Purposes

The primary aim of our project was to predict the number of bike riders in a specific Bay Area city on a given day by analyzing various factors, including time, weather conditions, and rider behavior.

To achieve this, we focused on understanding travel trends for selected cities by thoroughly exploring and preparing the dataset. This process involved data manipulation, extraction, and preprocessing, followed by the application of regression algorithms and techniques to forecast ridership patterns.

Dataset Source: Bay Area Bike Share Dataset

Algorithms Selected to Train and Evaluate

  • RandomForestRegression
  • ExtraTreesRegressor
  • XGBoostRegressor
  • GradientBoostingRegressor

Running and Comparing all Algorithms

Model Performance Summary

Model Used RMSE Score Comment
RandomForestRegressor Mean: 244.9306 Relatively poor performance
ExtraTreesRegressor Mean: 190.8538 Good performance
XGBoostRegressor Mean: 190.0768 Good performance, better than Extra Trees
Gradient Boosting Mean: 190.0746 Good performance

Built With

Share this project:

Updates