UrbanMind

Inspiration

Cities are growing faster than planners can keep up. Tampa alone faces rising flood risk, aging infrastructure, and a booming population that demands smarter energy, mobility, and land use. Traditional urban planning is siloed -- solar analysts don't talk to traffic engineers, flood modelers work separately from cybersecurity teams. We wanted to build a system where AI agents handle each domain simultaneously and present a unified, defensible city plan in seconds.

What it does

UrbanMind is a multi-agent AI platform that generates optimal city layouts for Tampa 2050. Users ask natural language questions through a chat interface overlaid on a Google Maps 3D view of Tampa. Four specialized AI agents run in parallel:

CityRAG retrieves context from Tampa planning documents using vector search
GeoSense analyzes 4 zones across solar potential, flood risk, traffic congestion, EV infrastructure, and accident rates
Visionary generates top-down city layout maps using Google Imagen showing building footprints, solar arrays, green corridors, and road networks
GridShield scans proposed smart city infrastructure for SCADA and IoT cybersecurity vulnerabilities

The AI highlights the recommended zone on the 3D map, defends its choice with specific data comparisons, and generates a city masterplan layout. Users can then dive into close-up views of solar arrays, EV networks, green corridors, and smart roads.

How we built it

Frontend: React 19 + TypeScript + Vite, Tailwind CSS, Framer Motion, Google Maps Photorealistic 3D Tiles, Recharts
Backend: FastAPI with WebSocket streaming for real-time agent status updates
AI: Google Gemini 2.5 Flash for reasoning and synthesis, Google Imagen for city layout generation, ChromaDB for RAG vector search
Architecture: Async multi-agent orchestration -- CityRAG and GeoSense run sequentially to establish context, then Visionary and GridShield run in parallel, followed by a Gemini synthesis step that produces a persuasive, data-backed response

Challenges we ran into

Agent coordination: Getting 4 agents with different latencies to stream status updates to the frontend in real-time without blocking each other required careful async orchestration with WebSocket chunked responses
Making the AI persuasive: Early responses were generic. We had to engineer the synthesis prompt to include zone-by-zone data comparisons so the AI could defend recommendations with specific metrics like congestion indices and accident rates
3D map integration: Google Maps 3D Tiles require a specific Map ID configuration and the alpha Maps3D API. Getting the camera to fly smoothly to recommended zones took iteration on the coordinate and tilt parameters

Accomplishments that we're proud of

The AI genuinely defends its recommendations -- ask "why not Ybor City?" and it will zoom to Ybor, acknowledge its strengths (8/10 solar), then explain why its flood risk of 5/10 and congestion index of 5.4 make it inferior to New Tampa's 3/10 and 3.2 respectively
Real-time multi-agent orchestration with live status streaming -- users watch each agent activate, process, and complete
Top-down city layout generation that shows actual building footprints, solar placement, green corridors, and road networks as a professional urban planning visualization
The entire 4-agent pipeline completes in under 30 seconds

What we learned

Multi-agent systems are powerful but coordination is the hard part -- knowing which agents can run in parallel vs sequentially is critical for perceived performance
Geospatial AI needs structured data grounding, not just LLM generation -- combining RAG with hard-coded zone metrics prevented hallucinated coordinates and scores
Users trust AI recommendations significantly more when they can see the 3D map zoom to the location AND read the data-driven justification side by side
Prompt engineering for image generation is fundamentally different from text -- getting Imagen to produce consistent top-down layout maps required very specific style directives

What's next for UrbanMind

Real sensor integration: Connect to Tampa's live IoT sensor network for real-time solar irradiance, traffic flow, and flood gauge data instead of mock data
Google Solar API: Overlay real per-building solar potential data directly on the 3D map
3D layout generation: Use AI to generate interactive 3D city models rather than 2D top-down images
Multi-city expansion: The agent architecture is city-agnostic -- swap the zone data and RAG documents for any city
Community feedback: Let residents vote on AI-proposed layouts and feed preferences back into the optimization
Woolpert integration: Connect to Woolpert's LiDAR point cloud data and GIS platforms for production-grade geospatial accuracy

Built With

chromadb
fastapi
framer-motion
google-adk
google-gemini-2.5-flash
google-imagen
google-maps-3d-tiles-api
leaflet.js
lucide-react
python
react
recharts
tailwind-css
typescript
vite
websocket

Updates

Harshith Gujjeti started this project — Apr 11, 2026 04:58 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.