We were inspired after seeing how prominent of an issue digital illiteracy was. We saw our classmates struggle to use basic installers, and thought, "Hey! People should probably know where their private data is going." Okay so basically, here's how it works: -We parse the
tags from a web page (specifically a privacy policy) -We use fasttext to label the text into various categories -We use summa to summarize the text into the important/readable bits Asides from staying up until 5:00 am, we built this mainly by using the FastText and training it to recognize text that fits into certain categories. FastText has very little documentation on Windows and its Python versions. Not only that, but we were completely new to machine learning and had to learn lots of stuff from scratch. Gathering the data was also very difficult. We're proud of PPS being even a little bit accurate. It might not be much, but its something. We learned how to use Machine Learning, like supervised vs unsupervised training. We plan on expanding this even after the hackathon and making this an easily usable, open source, browser extension.
Built With
- fasttext
- python
Log in or sign up for Devpost to join the conversation.