TLDR

We performed EDA on a dataset provided by the Philadelphia Bail Fund. Our results are deployed as a R Shiny application. Our process is viewable as a Google Colab Notebook.

Inspiration

Our team was interested in doing a data science related project using Pandas or R, so the Philadelphia Bail Fund's challenge provided us with the perfect opportunity to flex our EDA muscles.

Process

We wanted to generate insights from bail data so we split into two groups. One group focused on cleaning the all.csv file provided. Mainly we were interested in extracting how many felonies, misdemeanors and summaries were committed by each subject. Our other group worked on generating visual representation of our parsed dataset. To graph and deploy rapidly, we opted to use R Shiny, which allowed us to quickly deploy our visualizations in R as an interactive web app.

Challenges

We are fairly novice data scientists, and we found that our data science toolsets did not have much overlap. This, alongside remote work, made even EDA really challenging.

What's Next?

If time permits, we would be thrilled to work with CodeForPhilly to continue generating insights from bail data.

Built With

Share this project:

Updates