StoryGRIOT AI

landing page
feature page
sign up page
login page
stories lazy loading list
add a new story
stories found
story details
story pipeline processing 1
story pipeline processing 2
story pipeline processing 3
story pipeline successfully processed
story details after upload 1
story details after upload 1 , listening + reading captions
stories translations

Inspiration

Our inspiration came from witnessing the rapid loss of oral traditions and cultural heritage in our increasingly digital world. We realized that countless stories, folktales, and cultural narratives were disappearing as older generations passed away without their wisdom being preserved. Traditional storytelling methods were being replaced by modern media, leaving behind rich cultural legacies that could teach us valuable lessons about community, wisdom, and human connection.

We were particularly inspired by the West African griot tradition - master storytellers who preserved history, culture, and wisdom through oral narratives. In today's digital age, we saw an opportunity to combine this ancient tradition with cutting-edge AI technology to create a platform that could preserve these stories for future generations while making them accessible to a global audience.

What it does

AI Griot is a comprehensive digital platform that preserves and shares oral traditions through AI-enhanced storytelling. The platform automatically processes audio recordings of traditional stories, converting them into immersive multimedia experiences.

Core Features:

Audio Processing: Upload and process stories in multiple audio formats with intelligent quality assessment
AI Transcription: Convert speech to text with precise timestamps using Google Cloud Speech-to-Text
Intelligent Enhancement: Analyze and improve content using Google Gemini AI for cultural context and sentiment analysis
Multi-language Translation: Automated translation supporting Swahili, English, French, Spanish, and Arabic
AI-Generated Illustrations: Create contextual visual illustrations for each story paragraph using Gemini AI
Synchronized Playback: Audio-visual synchronization with real-time paragraph highlighting and cultural art styles
Cultural Preservation: Maintains authentic voices while making content globally accessible

How we built it

We built AI Griot using a modern full-stack architecture with emphasis on scalability and cultural sensitivity.

Backend (FastAPI + Python):

FastAPI framework for high-performance async API endpoints
PostgreSQL database with asyncpg driver for robust data storage
Redis for background task processing and session management
Google Cloud integration (Speech-to-Text, Translate API, Gemini AI)
JWT-based authentication with secure token management
Comprehensive 7-step processing pipeline from upload to publication

Frontend (React + TypeScript):

React 18 with TypeScript for type-safe development
Vite for fast development and optimized production builds
Tailwind CSS with custom "Griot" color palette and responsive design
React Router DOM v6 for client-side navigation
React Hook Form with Zod validation for robust form handling
Web MediaRecorder API for browser-based audio recording
Real-time synchronization with WebSocket connections

AI Processing Pipeline:

Audio upload and validation
Speech-to-text transcription with word-level timestamps
Content enhancement and cultural analysis
Multi-language translation
Intelligent paragraph segmentation
AI-generated contextual illustrations
Synchronized audio-visual presentation

Challenges we ran into

Technical Challenges:

Audio Synchronization: Achieving precise audio-visual synchronization within 100ms was complex, requiring careful timing calculations and WebSocket implementation
AI Integration: Integrating multiple Google Cloud services while maintaining performance and handling API rate limits required sophisticated error handling and retry mechanisms
Cultural Sensitivity: Ensuring AI-generated illustrations respected cultural contexts and avoided stereotypes required careful prompt engineering and cultural validation
Multi-language Support: Implementing robust translation and transcription for diverse languages, especially Swahili, required extensive testing and optimization

Development Challenges:

Real-time Processing: Managing background tasks for AI processing while providing real-time status updates required complex state management
File Handling: Supporting multiple audio formats while maintaining quality and processing efficiency
Responsive Design: Creating a seamless experience across devices while maintaining cultural authenticity
Performance Optimization: Balancing AI processing quality with fast loading times and smooth user experience

Accomplishments that we're proud of

Technical Achievements:

Successfully integrated multiple Google Cloud AI services into a cohesive processing pipeline
Achieved sub-100ms audio-visual synchronization for immersive storytelling experience
Implemented intelligent paragraph segmentation that maintains narrative flow
Created culturally-sensitive AI illustration generation with language-specific art styles
Built a responsive, accessible web application that works across all devices

Cultural Impact:

Developed a platform that genuinely preserves cultural heritage while making it globally accessible
Created AI systems that respect and enhance cultural contexts rather than homogenizing them
Established a foundation for preserving endangered oral traditions
Built a system that bridges traditional storytelling with modern technology

User Experience:

Designed an intuitive interface that serves both storytellers and listeners
Created a seamless upload-to-publication workflow for content creators
Implemented comprehensive search and discovery features for story exploration
Developed real-time processing status tracking for transparency

What we learned

Technical Insights:

The importance of async processing for AI-heavy applications to maintain responsive user experience
How to balance AI automation with human cultural oversight
The complexity of maintaining cultural authenticity in AI-generated content
The value of comprehensive error handling and graceful degradation in AI applications

Cultural Learnings:

The deep importance of preserving oral traditions in maintaining cultural identity
How technology can enhance rather than replace traditional storytelling methods
The need for cultural sensitivity in AI applications
The power of combining traditional wisdom with modern accessibility

Development Lessons:

The importance of early user testing with diverse cultural backgrounds
How to design systems that scale from individual stories to global cultural libraries
The value of comprehensive documentation for cultural preservation projects
The need for flexible architecture that can adapt to different cultural contexts

What's next for AI Griot

Immediate Roadmap (3-6 months):

Community Features: Add collaborative storytelling and community moderation tools
Advanced AI Models: Integrate DALL-E, Midjourney, or Stable Diffusion for enhanced illustration quality
Mobile App: Develop native iOS and Android applications for mobile storytelling
Analytics Dashboard: Comprehensive analytics for storytellers and cultural organizations

Medium-term Goals (6-12 months):

Global Expansion: Support for 50+ languages and cultural regions
Educational Integration: Partner with schools and cultural institutions for educational content
AI Voice Cloning: Preserve authentic storyteller voices for future generations
Interactive Features: Add clickable illustrations with cultural context and explanations

Long-term Vision (1-2 years):

Cultural AI Assistant: AI-powered cultural context and translation assistance
Virtual Reality: Immersive VR storytelling experiences
Global Cultural Network: Connect storytellers and listeners worldwide
Cultural Heritage Database: Comprehensive digital library of human stories and traditions
UNESCO Partnership: Collaborate with cultural preservation organizations globally

Technology Evolution:

Advanced AI Integration: Next-generation AI models for even more authentic cultural representation
AR/VR Experiences: Augmented and virtual reality storytelling
AI Cultural Training: Machine learning models specifically trained on cultural preservation

Our vision is to become the world's premier platform for cultural preservation, ensuring that every story, every tradition, and every voice has a place in our shared digital heritage.

Built With

fastapi
geminiai
python
react
speech-to-text
translateapi
typescript

Submitted to

AI HACKATHON POWERED BY GOOGLE
- Winner 2nd Place

Created by

I worked on the backend with FastAPI it was a little bit challenging. Because It was my first to use this technology. but I tried to do my best with my team mate.
Hope we are going to win and Launch this project that can help you to preserve our culture

Georges Kennel Kassi
Gosse Yannick Gbaka
BROU DAVID YAO