A Journey to Solving the Data Hackathon Challenge
Inspiration
As a mathematics student, I was always fascinated by the power of data and its potential to shape our world. I wanted to test my math skills.
What it does
This dashboard concept allows users to drill down into various areas to answer questions they have about the data. The data is presented in a multitude of charts.
With the help of Plotly, I was able to create some interactive visualizations that helped me answer hidden trends and patterns in the data. I was especially proud of the dashboard I created, which included a treemap that showed the distribution of high schools represented and the number of potential engineers attending those high schools broken up by grade classification. This was colored based on gpa.
How I built it
There are various data science libraries and tools available but I decided to use Jupyter Notebooks and the Pandas and Plotly libraries because of this method lends itself to rapid development and its ease to manipulate data.
Challenges I ran into
I faced a few challenges, including creating a 2D Gaussian distribution with the proper correlations within the data. I was determined to overcome these obstacles after multiple iterations and exploratory analysis of the dummy datasets I created.
What I learned
I learned a lot about data manipulation, visualization, and the math behind multivariate normal distribution in data.
What's next for Cultivating Coders with Data
Maybe this methodology could get extended to a real life survey with actual students from HISD and be used to coordinate a broader recruitment effort from TSU.
Log in or sign up for Devpost to join the conversation.