-
landing page
-
feature page
-
sign up page
-
login page
-
stories lazy loading list
-
add a new story
-
stories found
-
story details
-
story pipeline processing 1
-
story pipeline processing 2
-
story pipeline processing 3
-
story pipeline successfully processed
-
story details after upload 1
-
story details after upload 1 , listening + reading captions
-
stories translations
Inspiration
Our inspiration came from witnessing the rapid loss of oral traditions and cultural heritage in our increasingly digital world. We realized that countless stories, folktales, and cultural narratives were disappearing as older generations passed away without their wisdom being preserved. Traditional storytelling methods were being replaced by modern media, leaving behind rich cultural legacies that could teach us valuable lessons about community, wisdom, and human connection.
We were particularly inspired by the West African griot tradition - master storytellers who preserved history, culture, and wisdom through oral narratives. In today's digital age, we saw an opportunity to combine this ancient tradition with cutting-edge AI technology to create a platform that could preserve these stories for future generations while making them accessible to a global audience.
What it does
AI Griot is a comprehensive digital platform that preserves and shares oral traditions through AI-enhanced storytelling. The platform automatically processes audio recordings of traditional stories, converting them into immersive multimedia experiences.
Core Features:
- Audio Processing: Upload and process stories in multiple audio formats with intelligent quality assessment
- AI Transcription: Convert speech to text with precise timestamps using Google Cloud Speech-to-Text
- Intelligent Enhancement: Analyze and improve content using Google Gemini AI for cultural context and sentiment analysis
- Multi-language Translation: Automated translation supporting Swahili, English, French, Spanish, and Arabic
- AI-Generated Illustrations: Create contextual visual illustrations for each story paragraph using Gemini AI
- Synchronized Playback: Audio-visual synchronization with real-time paragraph highlighting and cultural art styles
- Cultural Preservation: Maintains authentic voices while making content globally accessible
How we built it
We built AI Griot using a modern full-stack architecture with emphasis on scalability and cultural sensitivity.
Backend (FastAPI + Python):
- FastAPI framework for high-performance async API endpoints
- PostgreSQL database with asyncpg driver for robust data storage
- Redis for background task processing and session management
- Google Cloud integration (Speech-to-Text, Translate API, Gemini AI)
- JWT-based authentication with secure token management
- Comprehensive 7-step processing pipeline from upload to publication
Frontend (React + TypeScript):
- React 18 with TypeScript for type-safe development
- Vite for fast development and optimized production builds
- Tailwind CSS with custom "Griot" color palette and responsive design
- React Router DOM v6 for client-side navigation
- React Hook Form with Zod validation for robust form handling
- Web MediaRecorder API for browser-based audio recording
- Real-time synchronization with WebSocket connections
AI Processing Pipeline:
- Audio upload and validation
- Speech-to-text transcription with word-level timestamps
- Content enhancement and cultural analysis
- Multi-language translation
- Intelligent paragraph segmentation
- AI-generated contextual illustrations
- Synchronized audio-visual presentation
Challenges we ran into
Technical Challenges:
- Audio Synchronization: Achieving precise audio-visual synchronization within 100ms was complex, requiring careful timing calculations and WebSocket implementation
- AI Integration: Integrating multiple Google Cloud services while maintaining performance and handling API rate limits required sophisticated error handling and retry mechanisms
- Cultural Sensitivity: Ensuring AI-generated illustrations respected cultural contexts and avoided stereotypes required careful prompt engineering and cultural validation
- Multi-language Support: Implementing robust translation and transcription for diverse languages, especially Swahili, required extensive testing and optimization
Development Challenges:
- Real-time Processing: Managing background tasks for AI processing while providing real-time status updates required complex state management
- File Handling: Supporting multiple audio formats while maintaining quality and processing efficiency
- Responsive Design: Creating a seamless experience across devices while maintaining cultural authenticity
- Performance Optimization: Balancing AI processing quality with fast loading times and smooth user experience
Accomplishments that we're proud of
Technical Achievements:
- Successfully integrated multiple Google Cloud AI services into a cohesive processing pipeline
- Achieved sub-100ms audio-visual synchronization for immersive storytelling experience
- Implemented intelligent paragraph segmentation that maintains narrative flow
- Created culturally-sensitive AI illustration generation with language-specific art styles
- Built a responsive, accessible web application that works across all devices
Cultural Impact:
- Developed a platform that genuinely preserves cultural heritage while making it globally accessible
- Created AI systems that respect and enhance cultural contexts rather than homogenizing them
- Established a foundation for preserving endangered oral traditions
- Built a system that bridges traditional storytelling with modern technology
User Experience:
- Designed an intuitive interface that serves both storytellers and listeners
- Created a seamless upload-to-publication workflow for content creators
- Implemented comprehensive search and discovery features for story exploration
- Developed real-time processing status tracking for transparency
What we learned
Technical Insights:
- The importance of async processing for AI-heavy applications to maintain responsive user experience
- How to balance AI automation with human cultural oversight
- The complexity of maintaining cultural authenticity in AI-generated content
- The value of comprehensive error handling and graceful degradation in AI applications
Cultural Learnings:
- The deep importance of preserving oral traditions in maintaining cultural identity
- How technology can enhance rather than replace traditional storytelling methods
- The need for cultural sensitivity in AI applications
- The power of combining traditional wisdom with modern accessibility
Development Lessons:
- The importance of early user testing with diverse cultural backgrounds
- How to design systems that scale from individual stories to global cultural libraries
- The value of comprehensive documentation for cultural preservation projects
- The need for flexible architecture that can adapt to different cultural contexts
What's next for AI Griot
Immediate Roadmap (3-6 months):
- Community Features: Add collaborative storytelling and community moderation tools
- Advanced AI Models: Integrate DALL-E, Midjourney, or Stable Diffusion for enhanced illustration quality
- Mobile App: Develop native iOS and Android applications for mobile storytelling
- Analytics Dashboard: Comprehensive analytics for storytellers and cultural organizations
Medium-term Goals (6-12 months):
- Global Expansion: Support for 50+ languages and cultural regions
- Educational Integration: Partner with schools and cultural institutions for educational content
- AI Voice Cloning: Preserve authentic storyteller voices for future generations
- Interactive Features: Add clickable illustrations with cultural context and explanations
Long-term Vision (1-2 years):
- Cultural AI Assistant: AI-powered cultural context and translation assistance
- Virtual Reality: Immersive VR storytelling experiences
- Global Cultural Network: Connect storytellers and listeners worldwide
- Cultural Heritage Database: Comprehensive digital library of human stories and traditions
- UNESCO Partnership: Collaborate with cultural preservation organizations globally
Technology Evolution:
- Advanced AI Integration: Next-generation AI models for even more authentic cultural representation
- AR/VR Experiences: Augmented and virtual reality storytelling
- AI Cultural Training: Machine learning models specifically trained on cultural preservation
Our vision is to become the world's premier platform for cultural preservation, ensuring that every story, every tradition, and every voice has a place in our shared digital heritage.
Built With
- fastapi
- geminiai
- python
- react
- speech-to-text
- translateapi
- typescript


Log in or sign up for Devpost to join the conversation.