Multilingual Toxicity Classification System

ROC Curve illustrating the model’s ability to distinguish toxic and non-toxic text.

🧠 About the Project

💡 Inspiration

With the rapid growth of online platforms and social media, the spread of toxic and harmful content has become a serious concern. Moderating such content manually is inefficient and time-consuming. This inspired us to build an AI system that can automatically detect toxic language, especially in multilingual environments like English and Hindi.

⚙️ What it does

This project classifies text into:

0 → Non-toxic
1 → Toxic

It helps identify abusive, offensive, or harmful content in user-generated text.

🏗️ How we built it

We followed a structured machine learning pipeline:

Data Preprocessing
- Lowercased text
- Removed URLs and unwanted characters
- Preserved both Hindi and English text
Feature Extraction
- Used TF-IDF Vectorization
- Captured both single words and phrases using n-grams
Model Training
- Applied Logistic Regression
- Split data into training and validation sets
Evaluation
- Used ROC-AUC as the main metric