-
-
Self-Evolving Loop: Gemini 3 Flash for speed, Gemini 3 Pro for reasoning-based evaluation, and Redis persists the evolving behavioral state.
-
The system detected a hyperbole failure (top) and autonomously updated its primitives to generate a professional script seconds later
-
Behavioral primitives stored in Redis dictate personality, while Gemini 3 Pro's reasoning chains provide transparent auditing in W&B Weave.
GameScript AI
Inspiration
The Spark
We observed significant hallucinations in live AI sports commentary during NFL Weeks 14 through 18 of the 2025 season.
The Problem
We identified a critical flaw in high stakes AI deployment: real time errors in live broadcasts are public, irreversible, and damage viewer trust.
The Realization
Manual prompt engineering and expensive model retraining are insufficient for the zero retry requirements of live media.
The Solution
We shifted from static, stateless prompts to a multi agent cognitive architecture powered by the Neuron framework with persistent, self evolving behavioral primitives.
What It Does
Self Evolving Commentary
GameScript AI is a broadcasting system that autonomously improves its output quality without human intervention through a cognitive agent circuit.
Multi Agent Architecture
The system orchestrates five specialized Neuron agents (Topic In, Generate, Evaluate, Gate, Learn) that work together through a SynapticBus messaging system.
Dual Model Generation
Gemini 3 Flash powers the generation agent for sub second latency, while Gemini 3 Pro drives the evaluation agent for deep quality analysis. Supports bilingual output (English/Spanish).
Automated Quality Control
A 70% quality threshold is enforced by the Gate agent. If a script fails (for example excessive hyperbole or a factual error), it is rejected before broadcast.
Self Correction Through Learning
A dedicated LearningAgent identifies why a script failed and updates numeric behavioral primitives in Redis (the system's long term memory) to prevent recurrence.
Instant Regeneration
The circuit automatically triggers regeneration with updated constraints until production standards are met, all orchestrated by Neuron's confidence based routing.
How We Built It
Neuron Cognitive Framework
The foundation of the entire system. Neuron provides the agent orchestration layer, circuit definitions, SynapticBus for inter agent messaging, and episodic memory management.
Five Agent Circuit
- Topic In (ReflexAgent): Fast pattern matching for input ingestion with memory retrieval
- Generate (DeliberativeAgent): Deep reasoning for prompt construction and Gemini API calls
- Evaluate (ReflexAgent): Pattern based quality scoring with typed issue detection
- Gate (ReflexAgent): Threshold comparison and conditional routing logic
- Learn (LearningAgent): Failure pattern analysis and primitive weight updates
Gemini 3 Flash
Used for the generation layer to achieve sub second latency and ensure 99.9% reliable JSON output via structured schemas.
Gemini 3 Pro
Acts as the evaluation engine for deep reasoning, quality assessment, and failure pattern analysis.
Redis with Neuron Memory
Stores behavioral primitives such as hyperbole_control and fact_verification as persistent, mutable weights accessible through Neuron's episodic memory system.
Vercel Edge Runtime
Provides the infrastructure backbone for consistent performance under high load.
Note: Vercel was used instead of Google AI Studio because the maximum number of allowable projects had been reached.
Weights & Biases Weave
Enables full observability with a forensic audit trail of every agent decision, primitive mutation, and circuit transition without complex instrumentation.
Adaptive Mutation Logic
A custom algorithm in the LearningAgent scales primitive corrections (+0.10 to +0.15 per issue) based on failure severity detected during evaluation.
SynapticBus Messaging
After each episode completes, the circuit broadcasts results to all agents through Neuron's event system, maintaining coordination across the agent network.
Challenges We Ran Into
Latency vs Accuracy
Balancing sub 2 second live commentary with deep reasoning compute. Solved by splitting tasks between a fast DeliberativeAgent (Gemini 3 Flash) and a thorough ReflexAgent evaluator (Gemini 3 Pro).
Malformed Outputs
Early JSON inconsistencies were eliminated by switching to Gemini 3 Structured Output Mode and implementing proper error handling in the agent circuit.
Balancing Personality
Suppressing hyperbole often made commentary too flat. The LearningAgent's mutation logic was tuned to find the Goldilocks zone between engaging and accurate.
State Management
Moving from stateless prompts to stateful memory required integrating Neuron's episodic memory system with Redis for persistent primitive storage across serverless invocations.
Agent Coordination
Ensuring proper message flow and routing between five agents required careful CircuitDefinition design and implementing Neuron's confidence based routing strategy.
Graceful Degradation
Implementing fallback logic (Gemini Pro to Flash) through Neuron's graceful degradation strategy to maintain 100% uptime even when individual models experience issues.
Accomplishments We're Proud Of
Super Bowl LIX Validation
Successfully deployed during Super Bowl LIX, serving 2000 content creators simultaneously with bilingual (English/Spanish) commentary and zero hallucinations.
Broadcast Speed
1.5 second end to end latency (including all five agent steps), meeting live TV constraints.
Autonomous Improvement
Gate pass rate improved from 62% to 79% through the LearningAgent's self optimization alone, with no human intervention.
Forensic Auditing
Every agent decision, primitive mutation, and circuit transition is fully traceable through W&B Weave, meeting high stakes industry requirements.
Circuit Reliability
Neuron's graceful degradation strategy achieved 100% uptime, automatically handling model fallbacks and edge cases.
What We Learned
Cognitive Architecture Matters
A multi agent system with specialized roles (reflex for speed, deliberative for reasoning, learning for adaptation) outperforms monolithic models.
Structure Is Safety
Structured outputs are production critical, especially when routing decisions between agents depend on well formed responses.
Two Brains Are Better
A fast generator plus a smart critic, orchestrated through a cognitive circuit, outperforms monolithic models.
Memory Matters
External state management through Neuron's episodic memory beats prompt engineering for long term behavior control.
Self Reflection Works
Modern LLMs can accurately analyze their own failures when given proper reasoning schemas and a dedicated LearningAgent to coordinate the process.
Agent Orchestration Simplifies Complexity
Neuron's CircuitDefinition and SynapticBus eliminated hundreds of lines of manual routing code and made the system easier to reason about.
Confidence Based Routing Scales
Neuron's routing strategy allowed the circuit to automatically adapt flow based on evaluation confidence without hardcoded thresholds.
What's Next for GameScript AI
Global Sports Expansion
Extending beyond NFL to NBA, MLB, NHL, and international soccer leagues with sport specific agent capabilities.
Expanded Agent Network
Adding CoordinatorAgent instances to manage multi game parallel processing and resource allocation across concurrent broadcasts.
Multimodal Ingest
Integrating Gemini 3 Vision into the Topic In agent to extract scoreboard data and context from video feeds.
Advanced Circuit Topologies
Experimenting with parallel evaluation paths and consensus mechanisms using multiple Evaluate agents with different specializations.
Global Expansion
Multi language support including Portuguese, Japanese, French, and German with culture specific primitives stored in Neuron's semantic memory.
Voice Synthesis
Mapping behavioral primitives directly to speech parameters such as pace and energy through a new SpeechAgent in the circuit.
Enterprise SaaS
Adapting the self correcting Neuron architecture for finance, healthcare, and legal sectors with domain specific agent capabilities.
Circuit Designer UI
Building a visual interface for non technical users to design custom agent circuits and behavioral primitives without coding.
Federated Learning
Enabling multiple GameScript AI instances to share learned primitives through Neuron's memory synchronization protocols.
Technical Architecture
Agent Circuit Flow
Topic In (ReflexAgent)
↓
Generate (DeliberativeAgent, Gemini Flash)
↓
Evaluate (ReflexAgent, Gemini Pro)
↓
Gate (ReflexAgent, threshold check)
├─ Pass → Final Output
└─ Fail → Learn (LearningAgent)
↓
Update Primitives
↓
Regenerate (back to Generate)
Neuron Components Used
CircuitDefinition.create(): Declares the five agent topologyBaseAgent: Foundation for all custom agentsReflexAgent: Fast pattern matching (Topic In, Evaluate, Gate)DeliberativeAgent: Deep reasoning (Generate)LearningAgent: Adaptation and evolution (Learn)SynapticBus: Inter agent messaging and event broadcastingcore.memory_manager: Episodic memory for primitive persistencecore.circuit_designer: Circuit compilation and deploymentcore.neuro_monitor: Real time circuit health monitoring
Impact
For Broadcasters
A system that gets better with every broadcast, eliminating the need for constant human supervision and manual prompt tuning.
For Viewers
More accurate, engaging commentary that maintains trust while delivering the excitement they expect.
For AI Development
A proven architecture showing that cognitive multi agent systems can achieve production reliability in zero retry, high stakes environments.
For the Industry
A blueprint for self improving AI systems that combine the speed of Gemini Flash, the intelligence of Gemini Pro, and the orchestration power of Neuron.
Conclusion
GameScript AI demonstrates that the future of production AI is not bigger models or more complex prompts. It's cognitive architectures with specialized agents, persistent memory, and autonomous learning, all orchestrated through frameworks like Neuron that handle the complexity of multi agent coordination.
By combining Gemini's speed and intelligence with Neuron's orchestration capabilities, we've built a system that doesn't just generate content; it learns, adapts, and gets better every single time it runs.
This is AI that thinks, learns, and improves, just like we do.
Note: Vercel was used instead of Google AI Studio because the maximum number of allowable projects had been reached.
Built With
- gemini3
- neuron
- redis
- vercel
Log in or sign up for Devpost to join the conversation.