🚀 About the Project: SynGenAI – Smart Synthetic Data Generator
🔍 Inspiration While working on AI projects, I repeatedly faced one major obstacle: lack of accessible, clean, and diverse data. In domains like healthcare or finance, real data is often restricted or unavailable due to privacy concerns. This sparked the idea for SynGenAI — a lightweight tool that creates realistic synthetic data on demand, without risking user privacy.
🏗️ How I Built It Frontend: Streamlit (for its simplicity and rapid prototyping)
Backend Logic: Python and the Faker library (for domain-specific synthetic data)
Data Handling: Pandas for formatting and CSV export
Deployment: Streamlit Cloud (serverless and fast)
I used modular Python scripts to structure the logic by domain (e.g., healthcare, retail, finance), and designed a user-friendly interface with one-click CSV download functionality.
📚 What I Learned How to build a serverless GenAI app using minimal tools
Creating domain-specific data logic using Python's Faker
Streamlit’s powerful capabilities for quick UI and deployment
⚙️ Challenges Faced Designing realistic and customizable prompts for different domains
Ensuring the generated data looked authentic and useful
Managing layout performance on Streamlit Cloud
Log in or sign up for Devpost to join the conversation.