Dynamic AI Agentic Ecosystem on GKE

💡 Inspiration & Vision

My inspiration for this project began with a question: How can we evolve standard e-commerce sites from static catalogs into intelligent, conversational partners? I saw an opportunity to build more than just an AI feature; I set out to create a truly dynamic, manageable, and scalable AI agentic ecosystem. My goal was to design a platform that could bring intelligent assistants to any website in a non-invasive way, all orchestrated on the power and scalability of GKE.

✨ What It Does

This project transforms the standard Online Boutique into an interactive, personalized shopping experience powered by a team of AI agents. The core of my submission is a complete, cloud-native platform that includes:

An Injectable AI Assistant Widget: A single <script> tag, hosted on GCS, adds a floating AI assistant to the Online Boutique product pages. It offers real-time, conversational advice, powered by a Preact and TypeScript frontend.
A Dynamic & Asynchronous Agent Backend: Running on GKE Autopilot, a ProxyAgent intelligently handles user conversations. For complex tasks, it delegates them asynchronously via Pub/Sub to a pool of specialist agents, ensuring the user experience is always fast and responsive.
A Real-time Management Dashboard: A comprehensive Vue 3 frontend that allows for the dynamic creation of new AI agents, management of the RAG knowledge base, and live monitoring of all AI tasks and their token consumption via a real-time connection to Firestore.

🏗️ How I Built It

This project is a cloud-native application built from the ground up on Google Cloud, using a unique Human-AI collaborative development methodology.

Backend: The core is an event-driven system of Python microservices built with FastAPI and CrewAI. This system runs on a GKE Autopilot cluster, leveraging Workload Identity for secure, keyless access to GCP services. Pub/Sub acts as the Message Coordination Platform (MCP) for asynchronous Agent-to-Agent (A2A) tasking, and Firestore provides real-time state persistence.
Intelligence: All AI capabilities are powered by Google's Gemini models via Vertex AI. I implemented a RAG (Retrieval-Augmented Generation) pipeline using Vertex AI Vector Search to ensure all agent responses are grounded in factual, up-to-date information crawled from specified websites.
Human-AI Collaboration: This project was a true partnership. As the human architect (Aaron Chuang), I was responsible for the core vision, architectural decisions, and final code review/refactoring. My AI partners (Gemini 2.5 Pro and Claude 3 Sonnet) acted as a pair-programming team, generating the majority of boilerplate code, writing technical specifications, and assisting in debugging complex cloud-native issues.

🚧 Challenges & Architectural Pivots

The primary challenge was architecting and implementing a highly elastic, manageable AI agent platform within the hackathon's tight timeframe.

My initial concept was a fully asynchronous model where a central Orchestrator handled all requests. While scalable, I quickly realized this could introduce unacceptable latency for real-time user conversations.

Therefore, I made a critical architectural pivot to a more sophisticated Proxy/Orchestrator pattern. A lightweight Proxy Agent now handles all user-facing conversations for instant interaction, while complex tasks are asynchronously delegated. This refactoring cost precious time and meant I couldn't fully implement the multi-agent delegation feature, but it was the right decision to prioritize a seamless user experience and build a more robust foundation. This hands-on process of building, testing, and making tough architectural decisions under pressure was the most significant challenge and learning experience.

✅ Accomplishments I'm Proud Of

Frankly, what I'm most proud of is successfully building and deploying this complete, end-to-end cloud-native AI platform from scratch. I didn't just build a single feature; I built an entire ecosystem. This includes:

A real-time monitoring dashboard built with Vue 3.
A non-invasive, injectable frontend widget built with Preact.
A scalable, asynchronous AI backend orchestrated on GKE Autopilot.

I am particularly proud of successfully navigating and resolving complex, real-world cloud engineering challenges, especially in the GKE Ingress networking layer. Furthermore, validating the Human-AI collaborative development paradigm was a game-changer, proving that such a sophisticated system could be architected and largely built by a solo developer in a matter of days.

🚀 What's Next

The current platform provides a robust foundation for a truly autonomous and scalable AI workforce. My roadmap is focused on bringing the "AI as a Company" concept to life:

Full Implementation of the "AI HR Department" (Dynamic Agent Management): Completing the automated pipeline where the OrchestratorAgent can "hire" new AI specialists by triggering a serverless CI/CD process to provision GKE deployments and Pub/Sub topics automatically.
Building the "Corporate IT Department" (Centralized Tool Management): Creating a UI for registering new capabilities (e.g., new APIs) as shareable "tools" that can be dynamically assigned to agents, allowing the ecosystem to learn new skills without redeployments.
Introducing the "Strategic Planning Department" (AI-Powered Auditing): Creating a high-level "Auditor Agent" that analyzes operational data to provide proactive strategic advice, such as recommending the creation of new agents based on market trends identified in the RAG knowledge base.
Enhancing Long-Term Conversational Memory: Evolving beyond single-session memory by leveraging a persistent store like Firestore or Cloud Memorystore to provide a truly continuous and personalized user experience across multiple visits.

Built With

artifact-registry
cloud-build
cloud-storage
crewai
css3
docker
fastapi
firestore
gemini-2.5-pro
gke-autopilot
html5
langchain
nginx
preact
pub/sub
python
typescript
vector-search
vertex-ai
vite
vue.js
workload-identity

Updates

季儒莊 started this project — Sep 22, 2025 11:05 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.