Movie Matcher

Home Page
Query
Deep Learning model
Deep Learning model
Prediction Model

movie-matcher

Entry for MetroStar's Movie Matcher track for GMU's PatriotHack 2019 Hackathon. It uses the LSTM architecture for Sentiment Analysis and Kaggle's Large Movie Review Dataset (https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews) to determine whether reviews are positive or negative, calculates a "genre score", which is the positive reviews subtracted by the negative reviews for each actor featured in a movie.

How to set up

Movie Matcher relies on scikit-learn, numpy, nltk, keras, glob, requests, sqlite3, re, pandas, and json libraries and utilizes Python 3.7. The web appliction is a simple database query in order to visualize the data; however, it will require the scripts to be executed in order to update the database. You can do this by running process.py, which will train the model and save it to an h5 file, then the get_reviews.py, which will gather all reviews in the given dataset (in this case, we used MetroStar's dataset of movies) through the themoviedb API, and then run predict.py in order to run the Sentiment Analysis prediction on all available reviews, calculate genre scores, and store them to the SQLite3 databse.

Built With

Submitted to

PatriotHacks

Created by

I worked on the backend, including the deep learning model, the database design, and the php to visualize the data. It was my first time using Python for a large project and my first time doing anything with Machine Learning, so it was an intimidating process, but a fun learning experience!

Andy Nguyen
I did the front, I did the HTML and CSS. First time building from scratch in 36 hours! Tried to make site look user friendly and be responsive so it can be used on all platform.

Bryan Ramirez

Updates

Andy Nguyen started this project — Oct 20, 2019 12:51 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.