Inspiration
Online reviews influence almost every consumer decision, yet fake, irrelevant, or promotional reviews often erode trust. We wanted to build a system that empowers users and platforms with credible, trustworthy, and relevant reviews — especially for restaurants and local businesses where authenticity matters most.
What it does
ReviewGuard is an AI-powered review quality filter that: • Detects and flags advertisements, irrelevant content, and non-visitor rants. • Uses ML + NLP + Transformers to understand both the text and the context of reviews. • Provides explainable decisions, so platforms can see why a review was flagged. • Improves trust in review ecosystems, benefiting both businesses and customers.
How we built it
• Designed a data pipeline for text cleaning, emoji handling, and entity removal.
• Engineered rich features (linguistic, sentiment, structural, contextual, behavioral).
• Trained multiple models: LogReg, Random Forest, LSTM, and fine-tuned BERT, combined via an ensemble.
• Built a policy enforcement engine with multi-level confidence thresholds.
• Used Colab GPUs, Hugging Face Transformers, spaCy, and scikit-learn for development and training.
Challenges we ran into
• Handling noisy, unstructured user-generated text (emojis, slang, mixed languages).
• Balancing precision vs. recall — avoiding over-flagging while still catching bad reviews.
• Creating gold-standard labeled data with consistent annotator agreement.
• Optimizing large models under resource constraints (Colab GPU limitations).
Accomplishments that we're proud of
• Built a robust multi-model ensemble that outperforms regex and baseline keyword methods.
• Developed an explainable AI layer so predictions are transparent, not black-box.
• Created a scalable architecture that can be deployed in real-world review systems.
What we learned
• The importance of hybrid approaches: traditional ML + deep learning + transformers complement each other.
• How domain-specific features (restaurant terms, personal experience indicators) dramatically improve performance.
• Realized that trust in digital ecosystems depends not only on technical accuracy but also on explainability and fairness.
What's next for ReviewGuard
• Expand to other domains (e-commerce, travel, app stores).
• Build a real-time API service for platforms to plug into.
• Enhance detection with multimodal analysis (images, metadata, reviewer behavior).
• Introduce active learning loops to continuously improve with user feedback.
• Explore partnerships with platforms like Google Maps, Yelp, and TripAdvisor.
Built With
- matplotlib
- numpy
- pandas
- python
- pytorch
- scikit-learn
- spacy
Log in or sign up for Devpost to join the conversation.