Inspiration

Education inequality is a global crisis. Students in remote areas or with limited resources struggle to access quality STEM education. Traditional learning materials are static, one-size-fits-all, and fail to adapt to individual learning styles. We envisioned MINERVA as a solution that democratizes STEM education by transforming any document into a personalized, interactive learning experience powered by AI.

The name MINERVA comes from the Roman goddess of wisdom and strategic warfare—symbolizing our mission to empower students with knowledge and strategic thinking skills.

What it does

MINERVA is an AI-powered learning platform that revolutionizes STEM education through:

Intelligent Course Generation

  • OCR Technology: Upload PDFs, images, or documents in any format
  • AI Course Builder: Automatically generates structured learning paths with modules, topics, and summaries
  • Adaptive Content: Choose between simple summaries or detailed explanations based on your level

Interactive Learning Experience

  • AI-Generated Quizzes: Automatic quiz generation for each topic and module
  • Progress Tracking: Real-time monitoring of completed topics and quiz scores
  • Visual Knowledge Maps: See how concepts connect and build upon each other

3D Avatar Tutor (MINERVA)

  • Immersive Learning: Interactive 3D avatar with realistic animations
  • Lip-Sync Technology: Natural speech synchronized with facial expressions
  • Emotional Intelligence: Avatar responds with appropriate expressions (happy, surprised, focused)

AI Voice Conversations

  • Real-Time Voice Chat: Talk directly with MINERVA using OpenAI Realtime API
  • Natural Dialogue: Ask questions, get explanations, discuss concepts naturally
  • Multilingual Support: Available in English and French

Contextual Chatbot

  • Course-Aware Assistant: Chatbot understands your current course context
  • Instant Help: Get answers without leaving your learning flow
  • Persistent History: Conversations saved across sessions

Challenges we ran into

1. Real-Time Voice Streaming

Implementing bidirectional audio streaming with OpenAI's Realtime API was complex. We had to:

  • Handle WebSocket connections with proper error recovery
  • Manage audio buffer synchronization between capture and playback
  • Implement interrupt handling when users speak over the AI
  • Optimize for mobile network conditions

Solution: Built a robust state machine with automatic reconnection and buffer management.

2. 3D Avatar Performance

Rendering a VRM avatar with real-time animations while maintaining 60fps was challenging:

  • VRM model loading and parsing overhead
  • Lip-sync calculations from audio amplitude
  • Smooth animation blending between states
  • Mobile GPU limitations

Solution: Implemented efficient animation caching, reduced polygon count, and optimized shader usage.

3. OCR Accuracy & Speed

Processing large PDFs (50MB+) with multiple languages required:

  • Handling various document formats and encodings
  • Balancing accuracy vs. processing time
  • Managing memory for large files
  • Supporting both text and image-heavy documents

Solution: Deployed PaddleOCR on Hugging Face Spaces with 5-minute timeout and progress feedback.

4. Course Generation Quality

Ensuring AI-generated courses were pedagogically sound:

  • Maintaining logical topic progression
  • Generating relevant, challenging quizzes
  • Adapting to different education levels
  • Avoiding hallucinations in technical content

Solution: Crafted detailed system prompts with educational frameworks and implemented multi-step validation.

5. Internationalization at Scale

Supporting multiple languages across 80+ UI strings:

  • Managing translation keys across 14+ components
  • Ensuring consistent terminology
  • Handling dynamic content (user names, dates, counts)
  • Mobile vs. desktop text constraints

Solution: Centralized translation system with next-intl and structured key organization.

What we learned

  • WebRTC & Audio Processing: Deep dive into real-time audio streaming and Web Audio API
  • 3D Graphics Programming: Mastered Three.js, VRM format, and animation systems
  • AI Prompt Engineering: Learned to craft effective prompts for educational content generation
  • Supabase Advanced Features: Implemented complex RLS policies and real-time subscriptions
  • Next.js 16 Best Practices: Leveraged Server Components, Server Actions, and Turbopack
  • User-Centered Design: Importance of testing on real devices and gathering user feedback

MINERVA isn't just a hackathon project—it's the future of accessible, personalized STEM education.

Built With

Share this project:

Updates