Vio - AI-Native Learning Companion

Inspiration

The inspiration for Vio came from a fundamental problem in modern education: information overload and fragmented learning experiences. In an age where knowledge is scattered across countless documents, videos, websites, and multimedia content, learners struggle to create cohesive, personalized learning paths from diverse sources.

We were inspired by the vision of creating a "second brain" for learners - an AI-native platform that could understand, organize, and transform any type of content into meaningful educational experiences. The idea was to build something that could take a student's collection of PDFs, YouTube videos, web articles, and images, and intelligently weave them together into personalized learning journeys.

The TiDB AgentX Hackathon provided the perfect opportunity to explore this vision, combining cutting-edge database technology with advanced AI capabilities to create something truly innovative in the educational technology space.

What it does

Vio is an AI-native learning companion that transforms how people learn from diverse content sources. It's designed to be your "second brain" - a platform that understands, organizes, and learns from any type of content you throw at it.

Core Capabilities

Universal Content Intelligence: Upload and process documents (PDF, Word, PowerPoint, Excel), images, YouTube videos, and website links. AI extracts, analyzes, and organizes all content types intelligently using advanced OCR, transcript extraction, and web scraping.

AI-Powered Learning Agents: Multi-step AI workflows that create personalized learning paths, conduct comprehensive research, and orchestrate adaptive study sessions. The system includes specialized agents for different learning tasks.

Intelligent Content Creation: Generate educational video scripts, video, quizzes, and summaries from your documents automatically. The platform can create engaging educational content from any source material.

Advanced Search & Discovery: TiDB-powered semantic search across all documents and content with web search integration, helping you find relevant information instantly.

Interactive Learning Features: Take AI-generated quizzes with multiple question types, participate in audio-based listening tests with text-to-speech integration, and engage in real-time study sessions with immediate feedback and explanations.

Performance Analytics: Detailed quiz results with topic-based performance analysis, learning trends, and improvement recommendations.

Workspace Organization: Create folders and organize your learning materials with hierarchical structure. Manage your content efficiently with hover-based UI for easy deletion and organization.

Smart Summarization: Get automatic content summarization using Gemini AI for better comprehension and quick overviews of complex materials.

Responsive Design: Enjoy an optimized experience across desktop and mobile devices with scrollable content areas and intuitive navigation.

Key Features

  • Multi-format Content Support: Documents, images, YouTube videos, web links
  • AI Chat Interface: Context-aware conversations about your content
  • Smart Summarization: Automatic content analysis using Gemini AI
  • Intelligent Quizzing: Generate and take quizzes with multiple question types
  • Listening Tests: Audio-based comprehension with text-to-speech
  • Workspace Organization: Hierarchical folder structure for content
  • Real-time Study Sessions: Interactive learning with immediate feedback -** Learning Script Studio**: Generates educational video scripts and video from content

How we built it

Architecture Overview

Vio is built on a modern, scalable architecture that combines multiple cutting-edge technologies:

Frontend: Next.js 14 with React 18, TypeScript, and Tailwind CSS for a responsive, modern interface Backend: Next.js API Routes with serverless functions Database: TiDB Serverless as the primary database with full-text search capabilities Authentication: Appwrite for user management and file storage AI Services: Multi-model architecture with Groq (primary), OpenAI (fallback), and Gemini AI File Processing: Advanced PDF processing, OCR, YouTube transcript extraction, and web scraping

Technical Implementation

1. Universal Content Processing Pipeline

We built a sophisticated content processing system that can handle any type of learning material:

// Multi-format content extraction
const processContent = async (file: File, type: string) => {
  switch (type) {
    case 'pdf': return await extractPDFContent(file);
    case 'image': return await performOCR(file);
    case 'youtube': return await extractTranscript(url);
    case 'web': return await scrapeWebContent(url);
    // ... other formats
  }
};

2. Advanced RAG (Retrieval-Augmented Generation) System

The core of Vio's intelligence lies in its RAG system built on TiDB:

  • Content Indexing: All uploaded content is processed and indexed in TiDB
  • Semantic Search: TiDB's full-text search capabilities enable intelligent content discovery
  • Context Assembly: Relevant content chunks are retrieved and assembled for AI processing
  • Response Generation: Multi-model AI generates contextual responses based on retrieved content

3. Multi-Model AI Architecture

We implemented a robust AI system with multiple fallback layers:

// AI Model Selection with Fallbacks
const generateResponse = async (messages: ChatMessage[]) => {
  try {
    // Try Groq first (primary)
    return await callGroqAPI(messages);
  } catch (groqError) {
    try {
      // Fallback to OpenAI
      return await callOpenAIAPI(messages);
    } catch (openaiError) {
      // Final fallback to Gemini
      return await callGeminiAPI(messages);
    }
  }
};

4. TiDB Integration

TiDB serves as the backbone of our data architecture:

  • Schema Design: Comprehensive database schema supporting users, workspaces, content, and learning analytics
  • Search Implementation: Native full-text search with LIKE queries for content matching
  • User Isolation: Proper data segregation and access control
  • Performance Optimization: Optimized queries for fast response times

5. AI Agent Workflows

We built several specialized AI agents for different learning tasks:

Learning Path Generator: Analyzes user content and goals to create personalized learning journeys Research Assistant: Conducts comprehensive research across documents with AI-powered analysis Study Session Orchestrator: Creates adaptive study sessions with real-time difficulty adjustment Video Script Studio: Generates educational video scripts from content

Development Process

  1. Planning Phase: Designed comprehensive database schema and API architecture
  2. Core Development: Built content processing pipeline and TiDB integration
  3. AI Integration: Implemented multi-model AI system with fallback mechanisms
  4. Agent Development: Created specialized AI agents for different learning tasks
  5. UI/UX Implementation: Built responsive interface with modern design principles
  6. Testing & Optimization: Comprehensive testing and performance optimization

Challenges we ran into

1. Multi-Format Content Processing

Challenge: Building a unified system that could process PDFs, images, YouTube videos, and web content with consistent quality.

Solution: Created a modular content processing pipeline with specialized extractors for each format, including advanced PDF parsing, OCR for images, YouTube API integration, and web scraping with Firecrawl.

2. AI Model Reliability and Fallbacks

Challenge: Ensuring consistent AI responses when different models have varying availability and performance characteristics.

Solution: Implemented a sophisticated fallback system with Groq as primary, OpenAI as secondary, and Gemini as tertiary. Added response validation and error recovery mechanisms.

3. TiDB Integration and Search

Challenge: Migrating from Appwrite search to TiDB while maintaining performance and implementing proper user data isolation.

Solution: Built a comprehensive user ID mapping system between Appwrite and TiDB, implemented workspace-scoped search, and optimized queries for better performance.

4. Complex AI Agent Workflows

Challenge: Creating multi-step AI workflows that could handle complex learning tasks like path generation and research assistance.

Solution: Designed modular agent architecture with clear separation of concerns, implemented state management for multi-step processes, and added comprehensive error handling.

5. Real-time Performance

Challenge: Ensuring fast response times for AI-powered features while maintaining quality.

Solution: Implemented caching strategies, optimized database queries, and used streaming responses for better user experience.

6. Content Security and User Isolation

Challenge: Ensuring user data privacy and proper access control across all content types.

Solution: Implemented comprehensive authentication checks, workspace-based data isolation, and proper authorization for all operations.

Accomplishments that we're proud of

1. Universal Content Intelligence

We successfully built a system that can process and understand any type of learning content - from complex PDFs to YouTube videos to web articles. This creates a truly unified learning experience.

2. Advanced AI Agent Architecture

Created sophisticated AI agents that can handle complex, multi-step learning workflows. The Learning Path Generator, Research Assistant, and Study Session Orchestrator represent significant innovation in educational AI.

3. Robust Multi-Model AI System

Implemented a production-ready AI system with multiple fallback layers, ensuring high reliability and consistent performance across different AI services.

4. TiDB Integration Excellence

Successfully integrated TiDB as the primary database, leveraging its full-text search capabilities and building a scalable data architecture that supports complex learning analytics.

5. Production-Ready Implementation

Built a comprehensive, production-ready application with proper error handling, security measures, and user experience considerations.

6. Innovative Learning Features

Created unique features like AI-powered video script generation, adaptive study sessions, and intelligent content summarization that push the boundaries of educational technology.

What we learned

Technical Learnings

Database Architecture: Learned to design comprehensive schemas that support complex learning workflows while maintaining performance and scalability.

AI Integration: Gained deep understanding of multi-model AI systems, prompt engineering, and response validation techniques.

Content Processing: Mastered various content extraction techniques including OCR, transcript processing, and web scraping.

TiDB Capabilities: Discovered the power of TiDB's full-text search and learned to optimize queries for educational use cases.

Product Learnings

User Experience: Learned the importance of seamless content upload and organization in educational platforms.

AI Reliability: Understood the critical importance of fallback mechanisms and error handling in AI-powered applications.

Learning Science: Gained insights into how AI can enhance learning through personalized content and adaptive difficulty.

Process Learnings

Iterative Development: Learned to build complex features incrementally, testing and refining each component.

Error Handling: Developed robust error handling strategies that maintain user experience even when services fail.

Performance Optimization: Learned to balance feature richness with performance requirements.

What's next for Vio

Immediate Enhancements (Next 1 months )

Vector Search Implementation: Integrate TiDB's vector search capabilities for improved semantic similarity and content discovery.

Mobile Application: Develop native mobile apps for iOS and Android to make learning accessible anywhere.

Real-time Collaboration: Add multi-user study sessions and shared workspaces for collaborative learning.

Advanced Analytics: Implement machine learning insights and personalized learning recommendations.

Medium-term Goals

Integration Ecosystem: Build APIs for integration with popular learning management systems and educational platforms.

Content Marketplace: Create a marketplace where users can share and discover educational content.

AI Model Fine-tuning: Customize AI models specifically for educational content and learning patterns.

Advanced Video Features: Implement video generation capabilities with visual elements and interactive components.

Long-term Vision

Global Learning Platform: Scale Vio to support millions of learners worldwide with localized content and languages.

AI Tutoring System: Develop advanced AI tutoring capabilities that can provide personalized instruction and feedback.

Research Platform: Create tools for educational researchers to study learning patterns and optimize educational content.

Enterprise Solutions: Build enterprise-grade solutions for schools, universities, and corporate training programs.

Technical Roadmap

Microservices Architecture: Decompose the monolithic application into microservices for better scalability.

Event Streaming: Implement real-time updates and notifications using event streaming technologies.

Advanced Caching: Integrate Redis for improved performance and reduced database load.

CDN Integration: Implement global content delivery for faster access to educational materials.

Security Enhancements: Add advanced encryption, audit logging, and compliance features for enterprise use.


Vio represents our vision for the future of education - where AI and advanced database technology come together to create truly personalized, intelligent learning experiences. We're excited to continue building and improving this platform to help learners worldwide achieve their educational goals.

TiDB Cloud Account :

23f3003794@ds.study.iitm.ac.in

Built With

Share this project:

Updates