Inspiration
I’ve always been captivated by the clarity of creators like 3Blue1Brown, who turn impenetrable equations into intuitive visual stories. However, creating high-quality mathematical animations with Manim is notoriously difficult, requiring both deep subject-matter expertise and precise coding skills. I wanted to build a tool that makes this "explainer" magic accessible to everyone.
Chalkline was born from a simple question: Can a multi-agent system think from first principles and then "draw" its reasoning in real-time?
What it does
Chalkline is an AI-powered "explainer engine" that transforms complex technical topics into clear, 2D animations. it performs a deep logical breakdown of a subject to ensure the final output is educationally sound and visually intuitive.
How we built it
Chalkline is built on a Hierarchical Multi-Agent Architecture powered by Gemini 3 . The Orchestrator: Acts as the "Director," managing the flow between specialized workers.
The Tutor Agent: This is the "Brain." It breaks down users request from First Principles. For example, when asked about the Pythagorean theorem, it doesn't just provide a formula; it reasons about the areas of squares: $$a^2 + b^2 = c^2$$.
The Manim Coder: A specialized agent that converts the Tutors's logic into modular Python code.
The Parallel Renderer: To solve the hardware bottleneck (rendering on 16GB RAM), I implemented a "Divide and Conquer" strategy. The orchestrator splits the script into individual scenes and "fans them out" to parallel workers for rendering, before "conquering" them back into a single video via FFmpeg.
Challenges we ran into
The "Hallucination" Gap: Initially, a single agent would try to write 500 lines of Manim code at once, leading to syntax errors. I solved this by forcing the Script agent to write a "Visual Roadmap" first, which the coder then follows scene-by-scene.
Hardware Constraints: Rendering complex 2D animations is CPU-heavy. My breakthrough was moving from a sequential local render to a distributed cloud-native approach, treating Google Cloud Run as a scalable "Animation Factory."
API Handoffs: Managing function calling between agents required precise prompt engineering to ensure the "Script Writer" could successfully hand off a "Visual Plan" to the "Coder."
Accomplishments that we're proud of
I am incredibly proud of the Hierarchical Multi-Agent Orchestration. Seeing the "Tutor Agent" successfully break down a concept into first principles, and then seeing the "Coder Agent" pick up that visual roadmap and execute it, was a "wow" moment.
Getting the parallel rendering pipeline to work was another huge win. By splitting the video into independent classes and stitching them with FFmpeg, I effectively turned a 20-minute render into a 4-minute "sprint," proving that agentic workflows can overcome physical hardware limitations.
What we learned
I learned that the future of education isn't just "generative" it's agentic. It’s about AI that doesn't just tell you the answer but builds a visual world to show you why it’s true.
What's next for Chalkline
The next steps for Chalkline include:
Dynamic Feedback Loops: Implementing an "Editor Agent" that watches the rendered video, catches visual bugs, and sends "Fix-it" prompts back to the Coder Agent.
Direct PDF-to-Video: Expanding the Physicist Agent’s ability to ingest massive textbooks and automatically generate an entire "First Principles" course syllabus as a series of videos.
Built With
- blaxel
- google-adk
- manim
- python
Log in or sign up for Devpost to join the conversation.