spilt coffee screens (search history, results, competitive analysis)
spilt coffee logo
GIF
spilt coffee animation

☕️ spilt coffee

Media monitoring and brand reputation trends powered by AI.
We help small businesses use wit.ai to take control of their brand by giving them instant access to and sentiment analysis on mentions across social, reviews, news and more.

✨ Mission

We believe everyone has room to grow and thrive. We commoditize big data for small businesses, reviews are better when they’re heard. No one should be left out because the cost is too great or the technology too complex. So we build easy tools to empower businesses to take control of their brand. Tools that make media monitoring, competitive analysis and reputation tracking effortless—spill the coffee.

📈 Features

SEARCH
We scrape data from Yelp, Twitter and News about your brand (and any brand), bucket reviews by sentiment and display results on user-friendly charts.

MONITOR
We automate searches to easily monitor brand sentiments over time, and display historical trends on this data.

SENTIMENT ANALYSIS
We utilize advanced sentiment detection tools like VADER and wit.ai to segment positive, negative and neutral mentions, and assign overall sentiment to each mention.

COMPETITIVE ANALYSIS
We allow businesses to compare and overlay competitor data with their own, keeping up to date with what the people are saying.

🧱 Architecture

A brief overview of our application, with some key features (green) on the left and how we handle them on the backend on the right. alt text

🔮 NLP Model

MODEL
Sentiment analysis was conducted based on an ensemble model aggregating both the VADER model and Facebook’s wit.ai NLP model. Train and test data was primarily formed from Yelp’s open data set (>8,000,000 user reviews) and the Sentiment140 Twitter dataset.

IMPLEMENTATION
We utilized wit.ai's sentiment analysis to rate and categorize reviews. We trained entity recognition to pull out and rank the most common entities in "very negative" and "very positive" reviews. We can then see what users tend to have issues with, and provide actionable recommendations to improve these businesses.

ACCURACY
Running our ensemble model on a subset (test) dataset, we achieved an accuracy 73.35% on a set of 2000 test Yelp points as well as a 74.25% for the Sentiment140 set on their given test set of 497 tweets.

NEXT STEPS
We hope to add more robust entity tagging as well as bi- and tri-gram recognition in order to better provide value for businesses. We also hope to extend the pre-trained wit.ai sentiment model to a five category model (very negative, negative, neutral, positive, very positive) for better bucketing.

💻 Tech Stack

UI frameworks: ElasticUI Recharts
Frontend: React.js
Backend: Django
DB: PostgreSQL
Authentication: Auth0
Model: VADER wit.ai

✔️ To Do

☐ Provide helpful feedback and insights for businesses (actionable recommendations!).
☐ Perform more in depth competitor sentiment analysis, and ability to recognize competitors.
☐ Allow users to mark wrong sentiments (and correct them). Our models aren't perfect, we have room to grow too!
☐ We already provide a set of content marked as "extremely negative" or "extremely positive". Now, it's time to extrapolate reasons and analyze severity.
☐ Scrape more platforms (Facebook, Instagram, more news sources, etc.)

👻 Fun Facts

In our first (virtual) meeting where we were struggling to decide on our product name, one member spilled coffee on himself—with that, "spilt coffee" was born.
"8 million rows [of yelp review data] is a lot of rows."

Built With

Submitted to

Facebook Hackathon: AI
- Winner First Place

Created by

I worked on building and restructuring the UI to be more user-friendly, combining wit.ai and VADER for sentiment analysis on the backend, and making most of the branding material.

Jonathan Dai
I mainly contributed to the backend portion of the app, creating the data collectors and aggregating data to display for analysis. It was my first time developing with Django, and it was nice to see how the different parts of our app interacted with each other while it was in development.

David Jin
I built the backend using Django and PostgreSQL. I also worked on the UI using tools like React, ElasticUI, and Recharts to build a streamlined, easy interface to use.

Min Hwan Kim
I worked on the frontend using React.js, the backend using Django, on the data collectors, implemented user authorization and sessions via Auth0, and on the creation, training, and testing of our
sentiment analysis model.

Kevin Wang
Cornell University '20 | UPenn '22
Jonathan Yiu