Onboarding Home Page
Dashboard
Fetch Agent completed
Ideation Agent completed
Media Upload Page
Media Selection Page
Storyboarding Agent completed
Architecture Diagram

Creator Pilot

What Inspired This Project

Modern content creation is still a long, manual pipeline: research trends, ideate, gather assets, edit, render, write metadata, publish, repeat.

CreatorPilot explores whether that pipeline can become multi-agent and autonomous—where agents coordinate across tools and outputs, not just one prompt at a time.

What I Learned

Agents work best when they orchestrate tools rather than generate everything themselves. Instead of asking a model to do everything in one step, the system is more reliable when broken into stages and orchestrated.
Creative pipelines require deterministic components. AI generates ideas and text, but rendering, file management, and media handling need deterministic systems.
The biggest challenge is orchestration, not generation. Generation is easy; coordinating assets, story structure, rendering, and publishing outputs is the hard part.

How I Built It

CreatorPilot is a modular multi-agent pipeline implemented inside a Next.js monolith (no external orchestration infrastructure).

A central Orchestrator Agent owns shared workflow state and decides which specialist agent runs next. A Profile / Memory Agent (Prisma-backed local storage) loads creator preferences, source settings, recent renders, and publishing history.

Core agents

Orchestrator Agent: coordinates trend discovery, ideation, storyboarding, rendering, metadata generation, and publishing workflows.
Profile / Memory Agent: loads creator profile data and history from Prisma-backed local storage.
Trend Discovery Agent: syncs enabled sources, fetches RSS entries, clusters related trends, and ranks them for creator fit.
Content Ideation Agent: converts trends or uploaded creator media into structured video concepts, hooks, and narrative outlines.
Media Selection Agent: resolves uploaded creator assets and prepares them for downstream planning and rendering.
Storyboard Agent: builds beat-by-beat coverage plans, evaluates whether existing media is sufficient, and prepares generated preview support when needed.
Render Agent: assembles narration, overlays, subtitles, and support media into FFmpeg-based video variants.
Metadata Agent: generates YouTube-ready titles, descriptions, tags, captions, and posting recommendations.
Publishing Agent: validates the selected render, checks readiness for upload, and performs live or mock YouTube publishing.

Challenges I Ran Into

1. Managing multi-agent workflow state

State needs to persist across steps so specialist agents can use consistent inputs and outputs.

2. Managing media pipelines

Different media formats and asset sources need consistent handling so downstream rendering doesn't break.

3. Coordinating agent decisions

Actions like storyboarding, metadata, and publishing depend on decisions made earlier in trend discovery and ideation.

Why CreatorPilot Matters

CreatorPilot demonstrates how AI agents can move beyond chat interfaces and become active collaborators in creative workflows.

Instead of generating isolated outputs, the system acts as a production assistant that plans, generates, and assembles a full video package end-to-end.

What's next for CreatorPilot

CreatorPilot is now a local-first single-instance prototype with explicit orchestration, specialized agents, creator memory, persisted render outputs, and visible agent-level workflow logging. The current implementation is intentionally monolithic and demo-oriented, which makes it practical for rapid iteration and hackathon presentation. Logical next steps include:

Multi-user and session-aware isolation instead of the current single local-user workflow
Stronger memory summarization and performance-aware feedback from completed renders and publishing outcomes
Smarter orchestrator decision policies for choosing between trend-led and media-led strategies
Better ranking signals for idea quality, render quality, and publishing recommendations
Richer observability for agent decisions, tool usage, and failure recovery
Optional distributed execution if render or upload workloads eventually outgrow the current in-process job model

Built With

ffmpeg
gemini
google-cloud
google-cloud-youtube
googleapis
next.js
prisma
rss-parser
sqlite
tailwind
typescript
vue

Updates

Priyam Singhal started this project — Mar 18, 2026 04:20 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.