MediMemo

🧠 Inspiration

Healthcare data is fragmented, hard to interpret, and often overwhelming for both patients and clinicians. We were inspired to build MediMemo to simplify medical understanding by transforming complex records into clear, actionable insights. Our goal was to create a system that not only retrieves information but also remembers, connects, and explains it over time.

🚀 What it does

MediMemo is an AI-powered medical assistant that enables users to interact with healthcare data in a conversational way.

Upload and analyze medical documents (PDFs, reports, notes)
Perform intelligent search using hybrid retrieval (vector + keyword)
Provide contextual answers with precise citations
Maintain longitudinal memory of patient history
Support multimodal inputs (text, images, future audio/video)
Deliver real-time, streaming responses through an intuitive UI

⚙️ How we built it

We designed MediMemo as a modular, scalable system:

Frontend: Next.js + assistant-ui for real-time conversational UI
Backend: FastAPI handling ingestion, querying, and orchestration
RAG Pipeline:
- Chunking (500–800 tokens with overlap)
- Hybrid retrieval (BM25 + vector search)
- Reranking for relevance
Vector DB: Weaviate / Qdrant for semantic search
Storage: MinIO for documents and multimodal assets
LLM Layer: Self-hosted / API-based LLM for reasoning and response generation
Streaming: SSE for real-time responses
Deployment: Docker + Azure Container Apps (Terraform for IaC)

⚠️ Challenges we ran into

Multimodal complexity: Handling text, images, and structured medical data consistently
Latency vs accuracy trade-offs: Balancing fast responses with deep retrieval
Chunking strategy: Avoiding context loss while keeping retrieval efficient
Frontend-backend sync: Managing streaming responses and state updates reliably
Environment configuration: Fixing hardcoded localhost dependencies during cloud deployment