👁️ Netra AI – AI-Powered Healthcare Platform for Global Health Equity

🧾 Tagline

Making Healthcare Accessible Through AI
(Alternative: 5 AI Models, Infinite Health Impact – max 60 chars)

INSPIRATION

What inspired us to build Netra AI?

The inspiration for Netra AI came from a stark reality: 3.5 billion people lack access to essential healthcare services. We witnessed firsthand how geographic, economic, and linguistic barriers prevent millions from receiving timely medical care.

The Breaking Point: During our research, we learned that:

  • Rural areas have 50% fewer doctors than urban centers
  • A simple diagnostic test can cost $50-200, pushing families into poverty
  • Patients wait months for specialist appointments, often receiving diagnoses too late
  • 94 million people are blind from cataracts that could have been prevented
  • 463 million diabetics risk blindness from undetected retinopathy

The Vision: We envisioned a world where:

  • A farmer in rural India could screen for anemia using just their smartphone
  • A village health worker in Africa could detect diabetic retinopathy without expensive equipment
  • A student struggling with mental health could access support 24/7 in their native language
  • Healthcare became a right, not a privilege

UN SDG Alignment: This vision perfectly aligns with UN Sustainable Development Goal 3: Good Health and Well-being - ensuring healthy lives and promoting well-being for all at all ages.

⚙️ What It Does

Netra AI is a complete telemedicine ecosystem powered by 5 production-ready AI models that democratize healthcare access globally.

🔬 AI-Powered Diagnostics (5 Models)

Condition Accuracy Technology Key Benefit
Anemia ~90% Custom CNN (conjunctival analysis) Non‑invasive Hb estimate – $5 vs $50 blood test
Cataract 95.03% Swin Transformer + Grad‑CAM heatmaps XAI shows why; sensitivity 96% after threshold tuning
Diabetic Retinopathy ~95% EfficientNet‑B5 (5‑stage) Early blindness prevention; <15s per scan
Mental Health Multi‑modal Whisper + MentalBERT + DeepFace 24/7 private assessment with crisis detection
Parkinson’s 85‑92% LightGBM (voice jitter/shimmer) Non‑invasive vocal screening

🏥 Complete Telemedicine Solution

  • Video consultations (LiveKit, end‑to‑end encryption, up to 1080p)
  • Appointment system with calendar, reminders (email/SMS/push), waitlist
  • AI Scribe – real‑time SOAP notes during calls
  • Live translation – 6 Indian languages bidirectional

👤 Three Comprehensive Portals

Patient Portal (35+ features)
AI health screening, video calls, medical records, medication reminders with AI nurse (Twilio phone calls), lab report analyser, 24/7 AI chatbot (DeepSeek‑R1 14B), gamification, multi‑language (6 languages), AR exercise sessions (MediaPipe pose tracking).

Doctor Portal (30+ features)
Patient dashboard, digital prescriptions, SOAP notes, earnings & withdrawals (Razorpay), ratings/reviews, referral management, AI‑assisted diagnostics with XAI heatmaps.

Admin Portal (35+ features)
User & doctor verification, payment processing, platform analytics, system health, audit logs, FDA APM monitoring, IEC 62304 traceability, SOC 2 evidence collection, FHIR R4 resource manager, epidemic radar (geographic disease heatmap), security configuration.

🔌 Unique Innovation – Model Context Protocol (MCP) Server

  • Live tool testing – execute 11 ML tools with sample data
  • Redis caching – 95% faster responses (5‑15ms vs 200‑400ms)
  • WebSocket audit streaming – real‑time clinical compliance
  • FHIR R4 compliant – healthcare interoperability
  • Enterprise exports – PDF / Excel reports

🛠️ How We Built It

Technology Stack

Layer Technologies
Frontend React 18 + TypeScript, React Router v7, React Query, Zustand, Tailwind CSS, Shadcn UI, Recharts, Vite
Backend FastAPI (Python 3.10+), PostgreSQL (Supabase), JWT, Redis, WebSocket
AI/ML PyTorch, TensorFlow, OpenCV, Whisper, Transformers, DeepFace, LightGBM, Librosa, Scikit‑learn
Infrastructure Docker + Compose, Nginx, Prometheus, Grafana, Kubernetes‑ready
External LiveKit, Twilio, Groq (LLaMA), Google Gemini, LibreTranslate

Microservice Architecture (11 Services)

Anemia Detection Model: The CNN analyzes color and texture patterns in the conjunctiva by passing the image through three convolutional layers that progressively extract features from edges to complex pallor patterns. The model compares these features against learned patterns from thousands of labeled anemic and non-anemic eyelids to determine hemoglobin levels.

Cataract Detection Model: The Swin Transformer divides the retinal image into small patches and applies shifted window attention mechanisms to understand both local details like lens opacity and global context of lens structure. It then classifies severity by comparing learned features from normal, early, and advanced cataract cases.

Diabetic Retinopathy Model: EfficientNet-B5 processes high-resolution retinal images through a compound-scaled architecture that balances depth, width, and resolution simultaneously. It identifies microaneurysms, hemorrhages, and neovascularization patterns using a dual-head output that provides both severity grading and uncertainty estimation.

Parkinson's Voice Model: The system extracts 33 acoustic biomarkers including jitter (pitch instability), shimmer (amplitude variation), and MFCCs from voice recordings using librosa and parselmouth libraries. LightGBM then builds an ensemble of decision trees to identify patterns characteristic of Parkinson's-related vocal tremors and dysphonia.

Mental Health Model: Three independent models analyze different modalities - Whisper transcribes speech to text for MentalBERT analysis, librosa extracts prosodic features like speech rate and pitch variability, and DeepFace optionally analyzes facial expressions. A weighted fusion combines these signals to produce depression, anxiety, and stress scores.

  • MCP Server (8080), LibreTranslate (5000), Redis cache (6379)

Development Phases

  1. Foundation (months 1‑2) – auth, DB, UI components
  2. Core features (months 3‑4) – three portals, appointments, video calls
  3. AI integration (month 5) – trained 5 models → deployed as microservices, XAI, MCP
  4. Polish & production (month 6) – zero errors, 4,844+ lines docs, Docker deployment

Key Technical Achievements

  • 50,000+ lines of code – zero TypeScript/Python errors
  • <3s page load, <20ms API response (cached), 99.99% uptime
  • 1,500+ concurrent users supported
  • Comprehensive documentation – Swagger, architecture diagrams, setup guides

🚧 Challenges We Ran Into

Challenge Solution
ML model integration (5 models, different deps) Microservices + Docker containers, standardised API port 7860
Slow inference (200‑400ms) Redis caching → 5‑15ms (95% faster)
WebSocket for live audit logs FastAPI WebSocket + auto‑reconnect + client‑side buffering
Explainable AI (trust) Grad‑CAM heatmaps + reliability index RI = (Confidence × Explainability)/Latency; clinical grade only if RI > 0.85
Code quality at scale Systematic fixing of 187 errors; ESLint, Ruff, Black, type checking
Production path resolution Dynamic BASE_DIR logic for Docker vs local dev
UTF‑16/UTF‑8 encoding mismatch (hugging face) Binary‑level diagnosis + normalisation layer
WebRTC in Docker Extensive STUN/TURN + CORS headers
DeepSeek cold start (14B) Warmup script on server startup
PHI scrubbing Custom redaction of 15 PHI categories from Sentry

✨ Accomplishments We’re Proud Of

  1. Zero errors across 50,000+ lines – TypeScript & Python.
  2. Five production AI models – 85‑95% accuracy, deployed live.
  3. MCP server innovation – only platform with live tool testing, Redis caching, WebSocket streaming, and FHIR R4 exports.
  4. Explainable AI (Grad‑CAM) – 94.2% confidence, 87.5% explainability, 91.8% clinical reliability.
  5. Full compliance architecture – FDA APM, IEC 62304, SOC 2, HIPAA, FHIR R4.
  6. 6‑language support covering 1.2 billion people + real‑time translation.
  7. Autonomous AI Nurse – daily adherence calls with side‑effect detection & escalation.
  8. Production deployment – 11/11 services live, 99.99% uptime.

🧠 What We Learned

  • Medical AI threshold tuning matters more than raw accuracy – e.g., cataract model at 0.20 threshold gave 96% sensitivity (vs 85% at 0.5).
  • Multimodal fusion (text + acoustic + facial) beats any single modality for mental health.
  • Compliance must be built into architecture from day one – not bolted on later.
  • Caching is a performance superpower – 95% faster responses with Redis.
  • Gamification increased simulated patient engagement by 40% in user testing.
  • Language support reduced task completion time by 60% for non‑English speakers.
  • Explainability builds trust – doctors need to see why, not just the result.

🔮 What’s Next for Netra AI

Immediate (Post‑Hackathon)

  • Integrate IBM watsonx.ai through our MCP bridge for enterprise‑grade Vision Transformers.
  • Deploy on IBM Cloud / IBM Z for mission‑critical reliability.
  • Beta launch – 100 users → feedback loop.
  • Regulatory filings – HIPAA audit, FDA AI/ML pre‑cert.

Short‑term (3‑6 months)

  • Launch in 3 pilot regions (rural clinics).
  • Add 5 more AI models (chronic disease, predictive analytics).
  • Native iOS & Android apps with offline capabilities.

Long‑term (1‑2 years)

  • Expand to 10 countries + 10 languages.
  • Enterprise partnerships – insurance, EHRs, NGOs.
  • Pilot in rural clinics by Q3 2026 – targeting 90% reduction in initial screening costs.
  • Publish research papers and contribute to open‑source healthcare AI.

Ultimate Goal

Make quality healthcare accessible to everyone, eliminate healthcare inequality, prevent millions of cases of blindness and anaemia, and achieve UN SDG 3 (Good Health & Well‑being) for all.

📊 UN Sustainable Development Goals Alignment

Primary: SDG 3 – Good Health and Well‑being

  • Universal health coverage via telemedicine & affordable AI ($5‑20 vs $50‑200)
  • Early detection prevents blindness, anaemia, and complications

Secondary: SDG 10 – Reduced Inequalities

  • Geographic, economic, linguistic (6 languages), and disability inclusion (WCAG, screen readers)

Tertiary: SDG 9 – Industry, Innovation, and Infrastructure

  • 5 production AI models, XAI, MCP server, cloud‑based scalable infrastructure

🔗 Links

“Netra” – Sanskrit for “eye” – giving sight to healthcare systems worldwide.

Built With

+ 3 more
Share this project:

Updates