-
-
home page
-
home page
-
dashboard
-
Learning Script Studio Generate Video Scripts
-
Learning Script Studio Generate Video Scripts
-
Learning Script Studio Generate Video
-
Video Generated
-
generated video player
-
generated video player settings
-
dashboard page with ai chat
-
ai chat and saved items view page take notes, generate quick summary
-
saved item view page take notes, generate quick summary
-
saved item view page view up next, navigate, take notes, generate quick summary
-
quiz taken result page, view all your quiz results score, insights and question answers, retake the quiz
-
quiz page
-
listening test page
-
learning path generator
-
research assistant
-
study orchestrator
-
performance dashboard
-
folder page
-
settings page, select beautiful themes
-
search page, search saved items, quiz results, folders and web search
Vio - AI-Native Learning Companion
Inspiration
The inspiration for Vio came from a fundamental problem in modern education: information overload and fragmented learning experiences. In an age where knowledge is scattered across countless documents, videos, websites, and multimedia content, learners struggle to create cohesive, personalized learning paths from diverse sources.
We were inspired by the vision of creating a "second brain" for learners - an AI-native platform that could understand, organize, and transform any type of content into meaningful educational experiences. The idea was to build something that could take a student's collection of PDFs, YouTube videos, web articles, and images, and intelligently weave them together into personalized learning journeys.
The TiDB AgentX Hackathon provided the perfect opportunity to explore this vision, combining cutting-edge database technology with advanced AI capabilities to create something truly innovative in the educational technology space.
What it does
Vio is an AI-native learning companion that transforms how people learn from diverse content sources. It's designed to be your "second brain" - a platform that understands, organizes, and learns from any type of content you throw at it.
Core Capabilities
Universal Content Intelligence: Upload and process documents (PDF, Word, PowerPoint, Excel), images, YouTube videos, and website links. AI extracts, analyzes, and organizes all content types intelligently using advanced OCR, transcript extraction, and web scraping.
AI-Powered Learning Agents: Multi-step AI workflows that create personalized learning paths, conduct comprehensive research, and orchestrate adaptive study sessions. The system includes specialized agents for different learning tasks.
Intelligent Content Creation: Generate educational video scripts, video, quizzes, and summaries from your documents automatically. The platform can create engaging educational content from any source material.
Advanced Search & Discovery: TiDB-powered semantic search across all documents and content with web search integration, helping you find relevant information instantly.
Interactive Learning Features: Take AI-generated quizzes with multiple question types, participate in audio-based listening tests with text-to-speech integration, and engage in real-time study sessions with immediate feedback and explanations.
Performance Analytics: Detailed quiz results with topic-based performance analysis, learning trends, and improvement recommendations.
Workspace Organization: Create folders and organize your learning materials with hierarchical structure. Manage your content efficiently with hover-based UI for easy deletion and organization.
Smart Summarization: Get automatic content summarization using Gemini AI for better comprehension and quick overviews of complex materials.
Responsive Design: Enjoy an optimized experience across desktop and mobile devices with scrollable content areas and intuitive navigation.
Key Features
- Multi-format Content Support: Documents, images, YouTube videos, web links
- AI Chat Interface: Context-aware conversations about your content
- Smart Summarization: Automatic content analysis using Gemini AI
- Intelligent Quizzing: Generate and take quizzes with multiple question types
- Listening Tests: Audio-based comprehension with text-to-speech
- Workspace Organization: Hierarchical folder structure for content
- Real-time Study Sessions: Interactive learning with immediate feedback -** Learning Script Studio**: Generates educational video scripts and video from content
How we built it
Architecture Overview
Vio is built on a modern, scalable architecture that combines multiple cutting-edge technologies:
Frontend: Next.js 14 with React 18, TypeScript, and Tailwind CSS for a responsive, modern interface Backend: Next.js API Routes with serverless functions Database: TiDB Serverless as the primary database with full-text search capabilities Authentication: Appwrite for user management and file storage AI Services: Multi-model architecture with Groq (primary), OpenAI (fallback), and Gemini AI File Processing: Advanced PDF processing, OCR, YouTube transcript extraction, and web scraping
Technical Implementation
1. Universal Content Processing Pipeline
We built a sophisticated content processing system that can handle any type of learning material:
// Multi-format content extraction
const processContent = async (file: File, type: string) => {
switch (type) {
case 'pdf': return await extractPDFContent(file);
case 'image': return await performOCR(file);
case 'youtube': return await extractTranscript(url);
case 'web': return await scrapeWebContent(url);
// ... other formats
}
};
2. Advanced RAG (Retrieval-Augmented Generation) System
The core of Vio's intelligence lies in its RAG system built on TiDB:
- Content Indexing: All uploaded content is processed and indexed in TiDB
- Semantic Search: TiDB's full-text search capabilities enable intelligent content discovery
- Context Assembly: Relevant content chunks are retrieved and assembled for AI processing
- Response Generation: Multi-model AI generates contextual responses based on retrieved content
3. Multi-Model AI Architecture
We implemented a robust AI system with multiple fallback layers:
// AI Model Selection with Fallbacks
const generateResponse = async (messages: ChatMessage[]) => {
try {
// Try Groq first (primary)
return await callGroqAPI(messages);
} catch (groqError) {
try {
// Fallback to OpenAI
return await callOpenAIAPI(messages);
} catch (openaiError) {
// Final fallback to Gemini
return await callGeminiAPI(messages);
}
}
};
4. TiDB Integration
TiDB serves as the backbone of our data architecture:
- Schema Design: Comprehensive database schema supporting users, workspaces, content, and learning analytics
- Search Implementation: Native full-text search with LIKE queries for content matching
- User Isolation: Proper data segregation and access control
- Performance Optimization: Optimized queries for fast response times
5. AI Agent Workflows
We built several specialized AI agents for different learning tasks:
Learning Path Generator: Analyzes user content and goals to create personalized learning journeys Research Assistant: Conducts comprehensive research across documents with AI-powered analysis Study Session Orchestrator: Creates adaptive study sessions with real-time difficulty adjustment Video Script Studio: Generates educational video scripts from content
Development Process
- Planning Phase: Designed comprehensive database schema and API architecture
- Core Development: Built content processing pipeline and TiDB integration
- AI Integration: Implemented multi-model AI system with fallback mechanisms
- Agent Development: Created specialized AI agents for different learning tasks
- UI/UX Implementation: Built responsive interface with modern design principles
- Testing & Optimization: Comprehensive testing and performance optimization
Challenges we ran into
1. Multi-Format Content Processing
Challenge: Building a unified system that could process PDFs, images, YouTube videos, and web content with consistent quality.
Solution: Created a modular content processing pipeline with specialized extractors for each format, including advanced PDF parsing, OCR for images, YouTube API integration, and web scraping with Firecrawl.
2. AI Model Reliability and Fallbacks
Challenge: Ensuring consistent AI responses when different models have varying availability and performance characteristics.
Solution: Implemented a sophisticated fallback system with Groq as primary, OpenAI as secondary, and Gemini as tertiary. Added response validation and error recovery mechanisms.
3. TiDB Integration and Search
Challenge: Migrating from Appwrite search to TiDB while maintaining performance and implementing proper user data isolation.
Solution: Built a comprehensive user ID mapping system between Appwrite and TiDB, implemented workspace-scoped search, and optimized queries for better performance.
4. Complex AI Agent Workflows
Challenge: Creating multi-step AI workflows that could handle complex learning tasks like path generation and research assistance.
Solution: Designed modular agent architecture with clear separation of concerns, implemented state management for multi-step processes, and added comprehensive error handling.
5. Real-time Performance
Challenge: Ensuring fast response times for AI-powered features while maintaining quality.
Solution: Implemented caching strategies, optimized database queries, and used streaming responses for better user experience.
6. Content Security and User Isolation
Challenge: Ensuring user data privacy and proper access control across all content types.
Solution: Implemented comprehensive authentication checks, workspace-based data isolation, and proper authorization for all operations.
Accomplishments that we're proud of
1. Universal Content Intelligence
We successfully built a system that can process and understand any type of learning content - from complex PDFs to YouTube videos to web articles. This creates a truly unified learning experience.
2. Advanced AI Agent Architecture
Created sophisticated AI agents that can handle complex, multi-step learning workflows. The Learning Path Generator, Research Assistant, and Study Session Orchestrator represent significant innovation in educational AI.
3. Robust Multi-Model AI System
Implemented a production-ready AI system with multiple fallback layers, ensuring high reliability and consistent performance across different AI services.
4. TiDB Integration Excellence
Successfully integrated TiDB as the primary database, leveraging its full-text search capabilities and building a scalable data architecture that supports complex learning analytics.
5. Production-Ready Implementation
Built a comprehensive, production-ready application with proper error handling, security measures, and user experience considerations.
6. Innovative Learning Features
Created unique features like AI-powered video script generation, adaptive study sessions, and intelligent content summarization that push the boundaries of educational technology.
What we learned
Technical Learnings
Database Architecture: Learned to design comprehensive schemas that support complex learning workflows while maintaining performance and scalability.
AI Integration: Gained deep understanding of multi-model AI systems, prompt engineering, and response validation techniques.
Content Processing: Mastered various content extraction techniques including OCR, transcript processing, and web scraping.
TiDB Capabilities: Discovered the power of TiDB's full-text search and learned to optimize queries for educational use cases.
Product Learnings
User Experience: Learned the importance of seamless content upload and organization in educational platforms.
AI Reliability: Understood the critical importance of fallback mechanisms and error handling in AI-powered applications.
Learning Science: Gained insights into how AI can enhance learning through personalized content and adaptive difficulty.
Process Learnings
Iterative Development: Learned to build complex features incrementally, testing and refining each component.
Error Handling: Developed robust error handling strategies that maintain user experience even when services fail.
Performance Optimization: Learned to balance feature richness with performance requirements.
What's next for Vio
Immediate Enhancements (Next 1 months )
Vector Search Implementation: Integrate TiDB's vector search capabilities for improved semantic similarity and content discovery.
Mobile Application: Develop native mobile apps for iOS and Android to make learning accessible anywhere.
Real-time Collaboration: Add multi-user study sessions and shared workspaces for collaborative learning.
Advanced Analytics: Implement machine learning insights and personalized learning recommendations.
Medium-term Goals
Integration Ecosystem: Build APIs for integration with popular learning management systems and educational platforms.
Content Marketplace: Create a marketplace where users can share and discover educational content.
AI Model Fine-tuning: Customize AI models specifically for educational content and learning patterns.
Advanced Video Features: Implement video generation capabilities with visual elements and interactive components.
Long-term Vision
Global Learning Platform: Scale Vio to support millions of learners worldwide with localized content and languages.
AI Tutoring System: Develop advanced AI tutoring capabilities that can provide personalized instruction and feedback.
Research Platform: Create tools for educational researchers to study learning patterns and optimize educational content.
Enterprise Solutions: Build enterprise-grade solutions for schools, universities, and corporate training programs.
Technical Roadmap
Microservices Architecture: Decompose the monolithic application into microservices for better scalability.
Event Streaming: Implement real-time updates and notifications using event streaming technologies.
Advanced Caching: Integrate Redis for improved performance and reduced database load.
CDN Integration: Implement global content delivery for faster access to educational materials.
Security Enhancements: Add advanced encryption, audit logging, and compliance features for enterprise use.
Vio represents our vision for the future of education - where AI and advanced database technology come together to create truly personalized, intelligent learning experiences. We're excited to continue building and improving this platform to help learners worldwide achieve their educational goals.
Log in or sign up for Devpost to join the conversation.