Digital-wellbeing

Mental Health and Digital Behavior

The goal of this project is to predict a user's digital_wellbeing_score based on various features like:

This helps in understanding how digital behavior affects mental wellness and can guide digital wellbeing recommendations.

The dataset (mental_health_digital_behavior_data.csv) is loaded using pandas.
It contains 500 rows of user behavior and mental health scores from 2020–2024.

Use .info(), .describe() and .head() to understand data types, ranges, and structure.
Checked for null values or data inconsistencies.

Box plots and scatter plots were used to inspect distributions and detect outliers.
seaborn and matplotlib were used for visualizations.

Used .corr() to analyze correlation of features with digital_wellbeing_score.
Found that anxiety_level was strongly negatively correlated.

Selected all features except the target (digital_wellbeing_score) for training.

Trained multiple regression models:

Used train_test_split to divide data into 80% training and 20% testing sets.

Evaluated all models using:

Used RandomForestRegressor.feature_importances_ to rank the importance of each input variable in prediction.

Linear Regression is overfitting (R² ≈ 1.00).
Random Forest and XGBoost performed well, showing robust predictions.
SVR and KNN showed relatively lower performance, possibly due to dataset size or scaling. Among all models Random Forest is the best model. This model is saved. ---

A Streamlit web app that helps understand and improve digital wellbeing.

Home - Tips: Learn helpful tips for reducing screen time, improving sleep, and managing digital habits.
Prediction: Enter your daily habits like sleep hours, screen time, mood, and so on to get a Digital Wellbeing Score.
What to Do: Based on your score, get suggestions on how to improve your digital wellbeing.

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.