Inspiration

Video storytelling is one of the most powerful ways to communicate, but producing high-quality video still requires expensive tools, production teams, and technical expertise. We were inspired by how difficult it is for individuals, startups, and small teams to turn a simple idea into something that looks truly cinematic.

We asked a simple question: what if creating a cinematic video felt more like directing a scene than editing a timeline?
That idea became AdMotion AI.


What it does

AdMotion AI is an AI-powered platform that transforms ideas into cinematic videos, narrative scenes, and audio storyboards.

Users can:

  • Generate commercial videos from a single image
  • Direct narrative scenes using AI characters, dialogue, and cinematic settings
  • Create narrated, slide-based stories with AI-generated visuals and voiceovers

Instead of managing complex production tools, users describe what they want to happen—and AdMotion AI handles the visuals, audio, and sequencing.


How we built it

We built AdMotion AI as a structured creative system rather than a single prompt-based generator.

The platform integrates:

  • AI image and video generation models for cinematic visuals
  • Voice synthesis models for dialogue and narration
  • A director-style interface that guides users through casting, scene design, and cinematography

The workflow mirrors how filmmakers think, while hiding technical complexity behind intuitive creative controls.

APIs Used

  • gemini-3-pro-preview
    Used for complex image analysis and reasoning, including understanding visual context and scene structure in analyzeImageForVideo.

  • gemini-3-pro-image-preview
    Used for high-quality image generation in generateBaseImage and generateStoryboardImage, ensuring cinematic and visually consistent outputs.

  • veo-3.1-fast-generate-preview
    Used for video generation in generateCommercialVideo and generateSceneVideo, transforming images and scene descriptions into dynamic video content.

  • gemini-2.5-flash-preview-tts
    Used for text-to-speech generation in generateDialogueAudio, producing natural-sounding dialogue and narration for characters and storyboards.


Challenges we ran into

One of our biggest challenges was balancing creative freedom with simplicity. Too many controls overwhelm users, while too few limit storytelling.

Other challenges included:

  • Maintaining character and scene consistency
  • Designing an interface that feels creative, not technical
  • Ensuring AI outputs feel intentional rather than random

We solved these through iterative UX design and constant testing.


Accomplishments that we're proud of

  • Built an end-to-end pipeline combining video, image, and audio generation
  • Designed a director-focused UI accessible to non-technical users
  • Enabled cinematic-quality output without traditional editing tools
  • Unified multiple AI modalities into one cohesive creative platform

What we learned

We learned that structure enhances creativity. Guided storytelling workflows help users produce higher-quality results and feel more confident experimenting.

We also learned that:

  • Users think in intent, not technical settings
  • Fast iteration is essential for creative tools
  • UX quality is as important as model performance

What's next for AdMotion

Next, we plan to expand AdMotion AI with:

  • Multi-scene and longer-form storytelling
  • Stronger character consistency across projects
  • Multi-language voices and narration
  • Scalable, personalized video generation

Our long-term vision is to make AdMotion AI a creative operating system for cinematic storytelling, where anyone can turn imagination into production-ready content.

Built With

  • gemini-2.5-flash-preview-tts
  • gemini-3-pro-image-preview
  • gemini-3-pro-preview
  • react
  • tailwind
  • typestript
  • veo-3.1-fast-generate-preview
Share this project:

Updates