What it does
StillTale is an AI-powered video generation platform that transforms text prompts into complete videos with:
Automatic story generation - Enter a simple prompt and AI expands it into a full narrative
Character consistency - AI identifies characters and generates reference images to maintain visual consistency across scenes
Scene-by-scene visualization - Stories are broken into scenes with AI-generated images
Voiceover narration - Text-to-speech converts narration into audio Automatic video assembly - Images and audio are merged into a polished MP4 video
Users simply log in, enter a prompt like "A brave knight rescues a dragon from a princess," and receive a complete video within minutes.
How we built it
Backend (Python/FastAPI):
FastAPI for REST API with JWT authentication
Google Gemini AI for story generation, character identification, and scene creation
Bria API for AI image generation (text-to-image and image-to-image)
gTTS for text-to-speech narration
OpenCV and FFmpeg for video assembly
Frontend (React):
React with React Router for navigation
Tailwind CSS for modern, responsive UI
Challenges we ran into
Character consistency - Maintaining the same character appearance across multiple scenes was difficult. We solved this by generating character reference images first and using image-to-image generation for scenes.
API rate limiting - The Bria image API has rate limits. We implemented retry logic with exponential backoff and polling for async image generation.
What's next for StillTale
Add unique voice for each characters
Built With
- bria
- javascript
- python
- vertex-ai
Log in or sign up for Devpost to join the conversation.