Aegis AI: Strategic Multimodal Crisis Response Framework

1. Inspiration: The Information Gap in Crisis

The inspiration for Aegis AI stems from a critical observation of modern emergency response: Information is abundant, but clarity is scarce. During a crisis—be it a wildfire, a flash flood, or a structural collapse—first responders are bombarded with fragmented data: frantic audio calls, shaky drone footage, social media reports, and erratic sensor telemetry.

We realized that the bottleneck in saving lives isn't a lack of data; it's the cognitive load required to synthesize that data into a strategic plan. We were inspired to build a "Mission Control" that acts as a cognitive force multiplier, leveraging Google Gemini's multimodal intelligence to turn chaos into actionable tactical directives.

2. The Problem: Multimodal Chaos

In high-stakes environments, decision-making suffers from Multimodal Asynchronicity.

The Analytical Gap

Traditional systems often process data in silos. A text-based reporting system doesn't "see" the smoke in a satellite image; an image-recognition system doesn't "hear" the urgency in a dispatcher's voice. This leads to a fragmented situational awareness where:

Urgency is miscalculated: Without visual context, a text report might seem low-priority.
Resources are misallocated: Static allocation models fail to account for the dynamic nature of evolving crises.
Trust is eroded: Black-box AI models provide answers without "showing their work," making it difficult for human commanders to trust high-stakes recommendations.

Mathematically, the problem of resource allocation can be viewed as a multi-objective optimization problem under uncertainty:

$$ \min \sum_{i=1}^{n} (R_i - D_i)^2 + \lambda \cdot T_{response} $$

Where $R_i$ is the allocated resource, $D_i$ is the actual demand (often unknown), and $T_{response}$ is the time to action. Aegis AI aims to minimize this gap by using multimodal inference to better estimate $D_i$.

3. The Approach: Human-AI Strategic Synergy

Our approach is founded on the principle of Strategic Human-AI Collaboration. We did not build a system to replace the commander; we built a system to augment the commander's strategic lead.

Methodology

We implemented a four-stage pipeline:

Multimodal Ingestion: Simultaneous processing of text and visual data.
Contextual Synthesis: Using Gemini's large context window to correlate visual evidence with textual reports.
Strategic Reasoning (XAI): Generating a natural language "Reasoning Path" that explains why a specific resource allocation was suggested.
Dynamic Visualization: Providing a high-density dashboard that updates in real-time as new telemetry arrives.

4. The Solution: Aegis AI Architecture

Aegis AI is a full-stack, multimodal intelligence platform designed for high-performance innovation.

Core Components

The Ingestion Engine: A flexible interface that accepts raw field reports and visual context (images/drone feeds).
The Reasoning Core: Powered by Gemini 3.1 Flash, this core performs high-level reasoning. It uses a custom system instruction set to act as a "Senior Strategic Coordinator."
The XAI Dashboard: A specialized UI that visualizes the AI's internal logic. It includes:
- Tactical Directives: Step-by-step instructions for field units.
- Geospatial Tactical Overlay: A Leaflet-powered interactive map for real-time incident tracking.
- Resource Allocation Matrix: A Recharts-powered visualization of unit deployment.
- Environmental Telemetry: Real-time tracking of thermal, seismic, and grid metrics.

5. How It Works: The Pipeline

When a user interacts with Aegis AI, the following sequence occurs:

Data Capture: The user inputs a text report (e.g., "Multiple structural fires in Sector 4") and optionally uploads an image (e.g., a drone shot of the area).
Multimodal Inference: The data is sent to the Gemini API. The model performs a cross-modal analysis:
- Text Analysis: Identifies keywords like "structural fires" and "Sector 4."
- Visual Analysis: Detects smoke plumes, identifies road blockages, and estimates fire intensity.
Strategic Calculation: The model calculates a resource allocation vector $\vec{V}$: $$ \vec{V} = [m, f, p, l] $$ where $m$ = Medical, $f$ = Fire, $p$ = Police, and $l$ = Logistics.
UI Rendering: The dashboard updates instantly. The Resource Allocation Matrix shows the new vector, and the Tactical Directives provide the "Reasoning Path."

6. Tech Stack: The Aegis Engine

We selected a stack that prioritizes performance, scalability, and developer velocity:

Frontend Framework: React 19 (for cutting-edge hook support and performance).
Styling: Tailwind CSS 4 (using the new @theme engine for a hardware-grade aesthetic).
AI SDK: @google/genai (direct integration with Gemini 3.1 Flash and Pro).
Animations: motion/react (for fluid state transitions and "Mission Control" feel).
Mapping: Leaflet & react-leaflet (for interactive geospatial tactical overlays).
Visualization: recharts (for high-density, responsive data grids).
Icons: lucide-react (for a clean, technical iconography).

7. Challenges Faced: The "Hole in the Wall"

Building a crisis-response tool in a hackathon environment presented several "Principal Engineer" level challenges:

Multimodal Alignment: Ensuring that the AI correctly weighted visual evidence against text reports. We solved this through rigorous System Instruction Tuning, forcing the model to explicitly cite visual evidence in its reasoning.
Latency vs. Accuracy: Real-time response requires low latency. We optimized this by using Gemini 3.1 Flash for the primary reasoning loop, which provides sub-second inference while maintaining high strategic accuracy.
Data Serialization: Firestore and standard JSON structures have limits on complex nested data. We implemented a strict Schema Validation Layer using TypeScript interfaces to ensure the AI's output always matched the UI's expectations.

8. Lessons Learned: The Power of XAI

The most significant lesson we learned was that AI is only as good as its explainability. In early testing, the AI would suggest resource allocations that seemed counter-intuitive. However, once we added the Explainable AI (XAI) reasoning panel, we realized the AI was picking up on subtle visual cues (like a blocked road in the corner of an image) that a human might miss.

This taught us that in high-stakes Human-AI collaboration, the "Reasoning Path" is just as important as the "Result."

9. Future Scalability: From Local to Global

Aegis AI is designed to scale horizontally and vertically:

Vertical Scaling: Integration with Vertex AI for fine-tuning on specific disaster datasets (e.g., historical wildfire patterns).
Horizontal Scaling: Deploying Aegis as an Edge-to-Cloud system, where drone-side AI performs initial triage before sending high-priority data to the central Aegis Command.
IoT Integration: Connecting directly to city-wide sensor grids (Smart Cities) to automate the "Environmental Telemetry" module with real-world data.

The Vision

Aegis AI isn't just a hackathon project; it's a blueprint for the future of Responsible AI Governance. By keeping humans in the loop and providing transparent, multimodal reasoning, we can build systems that don't just process data—they save lives.

10. Technical Deep Dive: The Aegis Reasoning Engine

To achieve "Hackathon-Winning" performance, we didn't just call an API; we engineered a Strategic Reasoning Pipeline.

10.1. Multimodal Contextualization

The Aegis engine uses a Context-Aware Prompting strategy. When an image is provided, the system instruction forces Gemini to perform a "Visual Triage" before generating the resource vector. This prevents the model from relying solely on text reports, which are often biased or incomplete.

10.2. The Resource Allocation Algorithm

Our resource allocation is modeled as a weighted priority queue. The AI calculates a Crisis Intensity Score (CIS) based on:

Urgency (U): 0.25 (Low) to 1.0 (Critical)
Population Density (P): Estimated from visual context.
Hazard Level (H): Based on identified threats (Fire, Flood, etc.).

The final allocation for a unit $i$ is derived as: $$ A_i = \frac{W_i \cdot (U + P + H)}{\sum W_j} \times 100 $$ where $W_i$ is the unit-specific weight (e.g., Fire units have higher weight in a "Fire" hazard).

10.3. UI Performance Optimization

To ensure the "Mission Control" feel, we implemented several performance optimizations:

Memoized Chart Rendering: Using React.memo on the Recharts components to prevent unnecessary re-renders during telemetry updates.
Throttled Telemetry: The environmental sensors update on a 3-second interval, balancing "live" feel with CPU efficiency.
CSS Hardware Acceleration: Using Tailwind's transform-gpu and will-change properties for the motion-based status badges.

10.4. Geospatial Tactical Overlay (GTO)

The GTO is a high-performance mapping module built with Leaflet and React-Leaflet. It provides:

Dynamic Filtering: Users can filter incidents by type (Fire, Medical, Police) and minimum urgency level (Low to Critical). This is implemented using a memoized filtering logic to ensure smooth map updates.
Intelligent Map Navigation: When filters are applied, the map automatically performs a smooth fly-to animation. It zooms and pans to fit the cluster of filtered incidents or centers precisely on a single incident if only one matches.
Heatmap Visualization: Using leaflet.heat, we visualize the density of sensor data points. This provides an immediate visual cue for "hotspots" where sensor activity is highest, often correlating with the epicenter of a crisis.
Custom SVG Markers: To maintain the "Hardware" aesthetic, we replaced standard Leaflet markers with custom SVG-based divIcon elements that feature a glowing pulse effect.

11. Conclusion: The Aegis Legacy

Aegis AI represents a paradigm shift in emergency response. By combining Google Gemini's multimodal intelligence with a Human-Centric Strategic Framework, we have created a platform that is not only technically superior but ethically grounded and practically applicable.

Through this collaboration, we have demonstrated that AI is most powerful when it acts as a transparent partner in human decision-making. Aegis AI is ready for the Weber State AI Hackathon, and more importantly, it is ready for the real world.

Built With

Updates

Ariadne-Anne DEWATSON-LE'DETSAMBALI started this project — Mar 27, 2026 02:56 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.