## Inspiration

We wanted to push interactive storytelling into a new era by combining text, visuals, video, and voice narration. Traditional story generators are limited to text or static images. Our vision was to create a fully immersive AI-powered storyworld where users see, hear, and interact with stories in real time, powered by FLUX.1 Kontext [dev] though *FAL.ai * .


## What it does

Kontext Storyworld Generator transforms user inputs into multi-modal AI stories:

  • Generates dynamic narratives via FLUX.1 Kontext [dev]
  • Produces AI-generated images and allows inline image editing
  • Creates video clips combining images, animations, and narration
  • Generates natural voice narration via ElevenLabs TTS
  • Exports complete stories (text, images, videos, audio) as JSON

All content flows seamlessly from FAL.ai / FLUX.1 Kontext pipelines, making it a fully interactive storytelling experience.


## How we built it

  • Frontend: React + TypeScript, dual-panel interface, Framer Motion animations
  • State Management: Custom useStoryState hook for stories, images, videos, and audio
  • API Integration:

    • FLUX.1 Kontext [dev] via FAL.ai for story, image generation, editing, and video creation
    • ElevenLabs TTS for voice narration
  • Production-ready: Modular architecture, type-safe, environment variable protection


## Challenges we ran into

  • Coordinating multi-modal content (story text, images, videos, audio) in real-time without performance drops
  • Ensuring image editing and video generation could run dynamically
  • Making the app demo-friendly without exposing API keys
  • Designing a scalable, type-safe architecture for future extensions

## Accomplishments that we're proud of

  • Fully functional multi-modal story pipeline: text → images → edited images → video → TTS
  • Clean, professional, responsive UI/UX with smooth animations

## What we learned

  • How to combine multiple AI modalities into a cohesive platform
  • Managing dynamic AI content generation in real-time

## What's next for Kontext Storyworld Generator

  • Implement branching narratives and interactive decision-making
  • Add storybook saving and collaborative story editing
  • Extend video generation for cinematic sequences with transitions
  • Explore additional LoRA fine-tunes, voice styles, and AI visual styles for richer storytelling

I've deployed this app live under "Try it out" below so everyone can test this out with your API keys they won't be stored on any server just in your browser go to the settings panel "as in the screenshot" in the top header ,

Built With

Share this project:

Updates