Inspiration
In critical sectors like healthcare and finance, access to high-quality data is essential—but privacy concerns often restrict it. Inspired by this gap, we built HealthForge AI to generate realistic, privacy-compliant synthetic data to help train AI models safely and effectively.
What it does
HealthForge AI is a smart synthetic data generator that: Creates diverse, realistic datasets in domains like Healthcare and Finance Preserves data privacy (no real personal data used) Offers an easy-to-use UI to generate and download datasets instantly Helps developers and researchers test models without regulatory roadbloc
How we built it
Frontend: Built using Streamlit for a fast, interactive UI Backend: Powered by Python, Faker, and Pandas to create synthetic records Deployment: Containerized with Docker for seamless deployment Cloud Ready: Designed for hosting on Google Cloud or AWS
Challenges we ran into
Designing data that feels authentic yet is non-identifiable Balancing randomness with realism across multiple domains Ensuring compatibility with large-scale model training datasets Limited time and compute access without AWS credits
Accomplishments that we're proud of
A fully functional synthetic data generator within days Real-time generation and CSV download features Clean, containerized app ready for cloud deployment Adaptable architecture for expanding to more industries like Retail or Education
What we learned
How to generate domain-specific synthetic data while ensuring quality Leveraging Streamlit + Docker for rapid prototyping and deployment The importance of synthetic data in enabling ethical AI development
What's next for HealthForge AI
Add support for Retail, Education, and Cybersecurity data Integrate feedback mechanism to fine-tune realism Build an API to plug into existing ML training pipelines Add a module to compare real vs synthetic data fidelity Deploy fully on GCP/AWS with autoscaling
Built With
- amazon-web-services
- faker
- numpy
- pandas
- python
- streamlit
Log in or sign up for Devpost to join the conversation.