DoYouKnowDataWey?

Inspiration

There are so many interesting ways to visualize data from services like TwitterEarth to TagGalaxy. We thought about the difficulties of efficient data visualization in high dimensions and decided to solve this well known issue in the most uncommon way possible.

Through partitioning of records into warring factions led by 2018's SPICIEST star. Ugandan Knuckles.

Together. We will find data way.

What it does

Performs principal component analysis on a dataset and applies K-means clustering to the result in order to partition the records into tribes represented by Ugandan Knuckles variations.

How I built it

All data mining principles were done through a python script running pandas, numpy, scikit-learn, and scipy.

Graphs were produced with plotly.

The frontend was a simple flask app being served off of Amazon EC2.

Challenges I ran into

Amazon AWS did not support Python3.6 which is honestly ridiculous and caused major setbacks.

I was hungry.

What I learned

Silhouette Averaging for optimal K selection as well as general insight into effective use of Python data cleaning / transformation tools.

Deeper understanding of the AWS services offered by Amazon.

How many variations a widely considered one-dimensional meme truly has.

What's next for DoYouKnowDataWey?

We fully intend to address scalability concerns for massive datasets.

In addition we would like to expand on the data analysis offer more meaningful statistics with regards to the observations.

And of course, we want to bring Ugandan Knuckles to even greater spicy meme heights that he so clearly deserves.

Built With

Submitted to

Uncommon Hacks 2018
- Winner Most Innovative

Created by

I worked on mostly the front-end development. Helped designed and create the website.

Dylan Vo
Worked with scikit-learn and pandas to perform PCA and K-means clustering on the data. Performed standard data mining procedures to partition into meaningful sects.

Set up the flask app and deployed it with Amazon EC2.

Daniel Tian
d:^)

Updates

Dylan Vo started this project — Feb 11, 2018 10:48 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.