Project goal: To see how AI can bridge the gap between digital convenience and physical enterprise spaces.

The main mission is to engineer a functional agent that solves a real-world challenge—specifically targeting industrial and commercial environments to eliminate energy inefficiencies.

Spatial Agent uses tools and capabilities to accomplish tasks (e.g., managing a local database, automating high-fidelity engineering workflows, and interacting with live database services).

My system can handle complex goals: the agent plans the steps and uses the tools at its disposal to finish the job.

Partner Power: Supported by the MongoDB for Startups program, my solution demonstrates a meaningful integration with MongoDB using MCP to give the agent its "superpowers" through high-performance data grounding.

For the build, I am using Google Cloud Agent Builder (rapid prototyping, building, and scaling).

Spatial Agent is a functional agent—powered by Gemini and Google Cloud Agent Builder—that integrates a MongoDB MCP server to solve a real industrial challenge.

💥 Inspiration

Traditional architectural evaluations are slow, disconnected, and lack real-time optical feedback. When designing spaces, architects have to manually calculate lux deficits, spatial efficiency, and lighting coefficients across static drawing boards and local databases. I wanted to bridge the digital convenience of generative AI with high-fidelity physical blueprint feedback. "Spatial Optician" was born to automate architectural visual analysis by letting an intelligent agent reason, plan, and analyze blueprints directly inside an immersive dashboard. I decided to build a specialized B2B vertical—Spatial Optician—to solve the architectural lighting crisis.

📐 What it does

Spatial Optician is a complete visual calibration and spatial dynamics console.

  1. Interactive Blueprint Workspace: A stunning, retro-futuristic glassmorphic blueprint interface featuring handwriting typography ("Architects Daughter") and pixel-perfect engineering grids.

  2. Intelligent Spatial Agent: Powered by Gemini and Google Cloud Agent Builder, my agent performs multi-step missions: analyzes uploaded site photos, extracts depth buffers, and evaluates lighting anomalies.

  3. Real-time Calibrations: Calculates optical scales, environmental factors (diffusion & Rayleigh scattering), and lux deficits.

  4. MongoDB Grounding & Self-Healing Catalog (Partner Power): Supported by MongoDB for Startups, my agent interacts directly with a MongoDB cluster via MCP. If a requested light fixture is missing from the database, the agent automatically executes a live web search to find exact technical specifications, dynamically writes the new fixture back into the MongoDB catalog, and completes the ROI calculations on the fly.

🛠️ How I built it

  • Frontend: React, TypeScript, Vite, and Tailwind CSS v4 for the highly-interactive design. Powered by Spatial Engine core components for highly-interactive engineering grids and spatial visualization.

  • Animations: Framer Motion for smooth blueprint grids transitions.

  • Backend: FastAPI (Python) exposing spatial analysis and agentic reasoning endpoints.

  • Agent Core & Compliance: Built fully within the Google Cloud Agent Builder ecosystem using the official Google ADK (Agent Development Kit) in Python to coordinate the reasoning loops, sub-agents, and MCP tool execution.

  • Database & Integrations: MongoDB Atlas (powered by the MongoDB for Startups tier) and local instances, bridged via a custom TypeScript-based Model Context Protocol (MCP) Server.

🧠 Challenges I ran into

  • Environment Caching: Synchronizing path configurations and environment variables inside sandboxed shell runners during local setup. I bypassed this by utilizing uv for seamless Python workspace environment management.

  • Tailwind v4 Monorepo Config: Aligning path aliases and custom grid backgrounds in Tailwind v4 without traditional tailwind config files. I achieved this via raw css theme overlays (@theme) directly in index.css.

  • MCP Communication: Ensuring robust JSON-RPC communication between the Agent Builder and my MongoDB instance through safe MCP tooling within a unified cloud environment.

🏆 Accomplishments that I am proud of

  • Implementing a self-healing database feedback loop: building an agent that autonomously expands its own MongoDB product catalog when faced with missing technical specifications, using real-time search data and secure writebacks.

  • Designing an interface that wows at first glance, making static code look like a premium, active architectural terminal.

  • Fully automating the schema inspection, querying, and aggregate pipelines of MongoDB via a custom TypeScript MCP Server.

  • Zero manual environment setup requirement on the backend thanks to modern uv run packaging.

🎓 What I learned

I learned the immense power of the Model Context Protocol (MCP) in making LLMs "database-aware." Instead of writing brittle database API wrappers, I let Gemini autonomously reason about schemas, perform paginated queries, and inspect collections.

🔮 What's next for Spatial Optician

  • Evolve the visual analysis pipeline into automated 3D Digital Twin generation, using dedicated spatial AI models to process complex room geometries.

  • Scale the unified Cloud Run architecture globally with advanced API key authentication for enterprise B2B clients.

  • Expand the dashboard to support multi-layered CAD drawings rendering.

Built With

Share this project:

Updates