Understanding Austin B-Cycle Usage

Inspiration

Vehicle rental services like electric scooters and B-cycles are very popular in Austin and are thus a lucrative and competitive business in this city. Understanding when B-cycles in particular are in highest demand and the motivations as to why people use them is useful information to companies to improve the user experience and gain a competitive edge in this profitable venture.

What it does

We explore daily usage patterns and develop a model that uses very limited data to predict how many bicycles will be checked out at a given kiosk depending on the time of the day.

How we built it

We classified specific rides' purposes roughly by calculating the expected ride time using latlong coordinates and comparing that to the actual ride time. Rides with significantly longer times were inferred to be leisure rides (as shown by some following the path of the river), rides close to the predicted time were inferred to be utility rides (to work/class, etc.), and rides with almost 0 distance (returning to the same stop) were inferred to be short-term errands.

We transformed the provided B-cycle data for a particular kiosk to record the number of checkouts for each hour on particular dates. We then used weekday/weekend and time of day features to predict in which frequency class (low, med, high) the checkout rate at a given hour falls into. We achieved 65% cross-validated accuracy using our very limited feature set.

We also compared the distribution of B-cycle traffic following the introduction of the system to UT Austin's students.

Challenges we ran into

The map visualization tools did not have native support for some of the aspects of the data we wanted to highlight. Additionally, the dataset did not have very many features very relevant to the classification task, but with careful selection of classification models, we achieved a nontrivial accuracy.

Accomplishments that we're proud of

Decent classifier accuracy with limited feature set Identification of anomalies in the data and assessing the causes (e.g. ACL festival)

What we learned

How to collaboratively analyze data using Azure

What's next for Understanding Austin B-Cycle Usage

In the future, we would try to better understand the usage with more robust data and understand how data was collected (random selection, etc.)

Built With

azure
jupyter
python

Submitted to

Texas Datahack 2019
- Winner Best Visualization

Created by

I worked on classifying the trips into Utility, Leisure and Errands using analysis on the euclidian distance between the start and end point and the trip duration. I then plotted these trips by the hour of the day and day of the week and observed interesting spikes.

Anirudh Goyal
I worked to show how popular B-Cycle destinations in Austin have changed over time. I created the heatmap and analyzed trends in return frequency to various kiosks, and discovered the interesting trend in B-Cycle usage during ACL over the years.

Matthew Hoffman
I worked on predicting whether a B-Cycle checkout would be lengthy or not. I took in the type of membership as well as time of day to predict whether a B-Cycle trip would take longer than 20 minutes with ~73% accuracy.

Amit Joshi

Updates

Anirudh Goyal started this project — Apr 13, 2019 06:00 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.