Inspiration

The need for precise modeling of soil behavior, especially Ksat (saturated hydraulic conductivity), is critical in environmental and agricultural planning. We were inspired to build a predictive system that can assist researchers and engineers in sustainable water management.

What it does

KSAT Quest predicts the saturated hydraulic conductivity of soils using the UKSAT dataset. It evaluates model performance using RMSLE and R², and shows how dataset size affects model accuracy.

How we built it

We followed several steps for building our model such as :

  1. Cleaned the dataset to remove nulls and outliers
  2. Performed feature selection to retain only meaningful predictors
  3. Trained regression models using cross-validation
  4. Ran subset experiments with decreasing data sizes, repeated 50+ times each
  5. Evaluated and visualized using RMSLE and R² with Matplotlib and Seaborn

Challenges we ran into

Although we build our model, we faced certain challenges such as: Efficient handling of multiple randomized subset experiments Avoiding overfitting while tuning models for smaller datasets Maintaining reproducibility with a large number of experiment runs

Accomplishments that we're proud of

Built a robust pipeline from cleaning to evaluation Gained actionable insights on how data size impacts model performance Visualized error metrics that clearly reflect model behavior trends

What we learned

Throughout our project, we took several learning from our model such as: Subset experiments provide better generalization checks RMSLE is more effective than RMSE when dealing with wide-value ranges Automating repeated trials enhances consistency in ML evaluation

What's next for KSAT Quest: Regression Runoff

Extend the model to real-time Ksat predictions via an API Compare performance with deep learning models Integrate geospatial visualization for field-level hydrological insights

Built With

Share this project:

Updates