Hybrid Grid Balancing using MARL

Home-Page
Agent Flow
Agent Status
System Workflow
System Dashboard

Inspiration

The global power grid is struggling. We've got way more renewables now, but that means more unpredictability, storage headaches, and old infrastructure that just can't keep up. Traditional systems are too rigid for the dynamic, decentralized grid of the future.

We were inspired by the latest in Reinforcement Learning (RL) and multi-agent systems. We thought: what if the grid could learn how to balance itself? We wanted to build a system where AI agents don't just automate decisions ,they collaborate actively to hit real-time reliability, max out renewable use, and cut down on emissions. This is us reimagining grid balancing.

What It Does

Meet our prototype: an AI-powered hybrid grid balancer built with multi-agent reinforcement learning (specifically, MADDPG with CTDE and self-attention).

It's basically a highly-coordinated team of AI agents:

Supply agents (solar, wind, thermal)
Storage agents (battery, hydro)
Flexible demand agents

They all coordinate in real-time to meet demand, minimize blackouts, and use as much green energy as possible, thus being sustainable for the environment

We ran a one-week energy simulation, and our agent-based system crushed conventional rule-based algorithms. You can see the whole thing in action ,agent interactions, grid status, and energy flows—on our front end: https://visionary-sfogliatella-cbdabc.netlify.app/.

Challenges We Faced

Grid Realism: It's tough to capture the insane variability of renewables, demand, and storage in a simple demo. We tackled this by designing representative, focused synthetic scenarios for our prototype.
Theory to Code: Taking complex MARL concepts (and that cool self-attention coordination!) and making it work in practice was a huge lift. Lots of simplifying, parameter tuning, and late-night debugging went into this.
Visualizing the Chaos: How do you make multi-agent coordination look intuitive and clear? Building the front end required thoughtful UI/UX and non-stop communication between team members with completely different skill sets.
Demo Data: We used simulated data, not live grid feeds, for this initial prototype. We had to be strategic to make sure our demo was still relevant and clearly showed the power of our approach.

How We're Different

We ditch static, rule-based systems. Our system's policies adapt over time, dynamically focusing on the most important agents and grid states using "self-attention."
We're proving that multi-agent RL can automate incredibly complex balancing decisions. This is a clear roadmap to achieving higher renewable utilization, lower emissions, and ultimately, a smarter, more resilient grid.

What's Next

We're just getting started! Our next steps include:

Refining our simulation with more realistic grid models and much larger networks of agents.
Adding options for live data integration and more advanced ML training capabilities.
Expanding the front end with detailed visualizations and scenario controls for grid operators and stakeholders.
Working with energy experts to explore pilot deployments beyond this idea!

Built With

maddpg
marl
self-attention

Updates

Janet Anand started this project — Oct 17, 2025 11:00 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.