🏥 MedVision AI: The Future of Healthcare, Powered by Gemini 3
Bridging the gap between patients and doctors with the power of advanced AI.
Inspiration
The healthcare system often feels disconnected. Patients struggle to understand complex medical reports, while doctors are overwhelmed with data, wasting lots of unproductive time, leaving little time for deep analysis. We asked ourselves: What if a doctor had a "Second Brain" that never forgets a patient's history? And what if a patient had a companion to decode the scary medical jargon into simple terms?
That was the spark for MedVision AI. We wanted to create a system where technology doesn't replace the doctor, but empowers them—and where patients feel informed, not intimidated.
What it does
MedVision AI is a comprehensive Patient-Doctor Ecosystem that works seamlessly for both online and offline consultations.
- For Doctors: It acts as an intelligent assistant. Using Gemini 3, it analyzes historical scans, reports, and timeline data to predict disease trajectories and flag anomalies. It extracts data from X-rays and handwritten prescriptions instantly saving his time.
- For Patients: It decodes complex lab reports (like "Creatinine levels" or "Leukocytosis") into plain English. It also manages their hospital journey with a Live Token Tracking System, so they never have to wait blindly in a queue.
Key Features
- Gemini 3 Deep Reasoning: Analyzes years of patient history to provide confidence-scored clinical insights.
- Multimodal Vision: Upload any medical document or scan; our AI extracts and structures the data.
- Live Queue Management: Real-time status updates for patients (e.g., "You are #5 in line").
- Hybrid Appointments: Unifies walk-in and digital appointments in one dashboard.
How we built it
We built MedVision AI as a high-performance full-stack application, deeply integrating Google's latest AI models.
The Tech Stack
- Frontend: React and Next.js for a responsive, premium UI.
- Backend: Python (FastAPI/Flask) handling complex logic.
- AI Core: Google Gemini 3 (2M token context window) for reasoning and Vision for image analysis.
- Database: Hybrid setup with SQLite (local development) and Firestore (production real-time data).
- Deployment: Vercel (Frontend) linked with Render (Backend).
Gemini 3 Architecture
At the core is our custom Gemini Service. We utilize the massive 2M token context window to feed the entire patient history into the model, ensuring no detail is missed.
Challenges we ran into
The most significant technical hurdle was the Live Token-Based Appointment System.
- Real-Time Concurrency: Managing a live queue where "Walk-in" patients and "Online" bookings merge into a single stream was complex. We had to ensure that when a doctor updates a status, every patient's "Live Track" updates instantly without server lag.
- Gemini Context Management: Handling the massive text output from medical PDFs and staying within token limits (before Gemini 3's 2M update) was tough. We had to write efficient "Context Builders" to structure data effectively.
- Vision Accuracy: Fine-tuning the prompt for the Vision model to accurately read handwritten Indian doctor prescriptions required multiple iterations and "few-shot" prompting strategies.
Accomplishments that we're proud of
- Seamless Full-Stack Integration: Successfully linking a complex Python backend with a modern React frontend and deploying it live.
- Gemini 3 "Thinking Mode": We implemented a transparency layer where the AI shows why it made a diagnosis, building trust with doctors.
- The UI/UX: Transformations from a simple hackathon prototype to a product that looks and feels professional (Thanks to Parth!).
- Real-World Utility: The "Medical Library" feature that simplifies reports is something we believe can genuinely help non-medical users immediately.
What we learned
- System Design is Key: Planning the database schema for a hybrid (online/offline) system taught us the importance of solid architectural decisions early on.
- AI Transparency: We learned that in healthcare, "Black Box" AI is useless. Building the "Confidence Meter" and showing the AI's reasoning path was a crucial lesson in AI ethics.
- Team Synergy: As a team of second-year engineering students, we learned how to divide and conquer—merging distinct roles (Frontend, Backend, AI, Database) into a cohesive unit.
What's next for MedVision AI
Near Future
- Daily Patient Limiter: Tools for doctors to set daily caps based on estimated treatment times.
- Storybook Health Library: An interactive, illustrated library to bust medical myths.
- Smart Scheduling: Google Calendar integration for surgery management.
Far Scope (The Big Vision)
- Global Medical Network: A social platform connecting doctors and patients worldwide.
- Advanced Anomaly Detection: Training custom image processing algorithms on massive datasets to detect tumors/fractures, verified by Gemini's OCR for a double-layer safety check.
The Team (Contributors)
1. Sarthak Dhonde - Lead Programmer & Architect
- Integrated the entire Patient-Doctor ecosystem.
- Engineered the Gemini 3 Analysis engine (Context mapping, Image processing).
- Built the robust Authentication & Deployment pipeline (Vercel + Render).
2. Parth Vaishampayan - Lead Frontend Developer
- Designed the entire UI/UX from scratch.
- Created the stunning animations and responsive layouts.
3. Tanmay Pansare - Database & Deployment Lead
- Architected the secure SQLite and Firebase database structures.
- Managed the complex backend deployment on Render.
4. Harshal Pednekar - Backend Programmer
- Built the entire Medical Library feature.
- Optimized application performance and fixed critical bugs.
5. Vedang Mendhurwar - Backend Programmer
- Developed the AI summarization for the patient records.
- Contributed to the booking and scheduling logic.
6. Ruturaj Rajwade - Backend Programmer
- Established database connections between Doctor and Patient portals.
- Managed local database operations.
MedVision AI - See the Future of Health.
Built With
- firebase
- gemini
- nextjs
- python
- render
- sqlite
- typescript
- vercel
Log in or sign up for Devpost to join the conversation.