landing page
product image upload live demo matching
user product search based recommendations
products selected and added in cart with live matches

💡 Inspiration

With the massive shift towards digital shopping, brick-and-mortar retail often struggles to provide the same level of instant, personalized search. We wanted to build an agent that brings the digital concierge experience into the physical retail world—helping shoppers find exactly what they need, even if they only have a photo of it.

⚙️ What it does

The Retail Concierge is a real-time, multimodal AI agent that helps shoppers find products in a store catalog.

Multimodal Search: Users can type what they are looking for or upload a photo of a product.
Semantic Matching: Using Gemini embeddings and MongoDB Vector Search, it finds the most relevant products instantly based on deep semantic meaning, not just keyword matching.
Interactive Chat: The agent converses naturally with the user, referencing live prices, features, and inventory stock before allowing them to add items directly to their cart.

🛠️ How we built it

Frontend & Backend: Next.js (React)
AI Brain & Embeddings: Google Gemini (Gemini 2.5 Flash / Flash-Lite, and gemini-embedding-2)
Vector Database (Partner Tech): MongoDB Atlas Vector Search
Deployment: Google Cloud Run

We generated product embeddings using Google Gemini's embedding model and stored them in MongoDB Atlas. When a user queries the app or uploads an image, the backend uses Gemini to describe the image, generates an embedding for the search context, and performs a $vectorSearch aggregation against our MongoDB products collection. Finally, Gemini 2.5 Flash is fed the search results as context to power the conversational interface.

🧠 Challenges we ran into

Multimodal Context: Getting the AI to perfectly blend an uploaded image with a text query required an intermediate step where Gemini first described the image before we vectorized the description.
Deployment Permissions: We had to configure the correct IAM permissions for Google Cloud Build to properly access our storage buckets and deploy the container to Cloud Run.

🏆 Accomplishments that we're proud of

Seamlessly integrating MongoDB Vector Search with Google Gemini's latest embedding models to return sub-second search results.
Successfully deploying a polished, production-ready application to Google Cloud Run.

📚 What we learned

We learned how incredibly powerful combining MongoDB Atlas Vector Search with Google Gemini can be for building agents. The ability to retrieve precise catalog data and feed it to a reasoning engine completely changes how we can build user experiences.

🚀 What's next for Retail Concierge

In-Store Navigation: Integrating indoor mapping so the agent can tell the user exactly which aisle the product is located in.
Voice Interface: Adding speech-to-text so users can talk to the concierge hands-free.

Built With

gemini
google-cloud-run
mongodb
next.js

Submitted to

Google Cloud Rapid Agent Hackathon

Created by

I designed and built the entire Real-Time Retail Concierge from scratch —
architecture, frontend, and backend. I integrated Google Gemini's multimodal
vision and embedding APIs to enable image-based product search, built the
MongoDB Atlas Vector Search pipeline using $vectorSearch aggregation, and
developed the conversational AI chat interface with real-time product matching.
I also handled the full Google Cloud Run deployment, IAM configuration, and
containerization. This was my first time combining multimodal AI with vector
search, and seeing it work in real-time was incredibly rewarding!

Swati Gupta

Updates

Swati Gupta posted an update — Jun 09, 2026 04:07 PM EDT

Real-Time Retail Concierge is Live!

Excited to share that our submission for the Google Cloud Rapid Agent Hackathon is fully deployed and ready to try!

What we built:

Multimodal Search - upload any product photo and get instant semantic matches powered by Gemini Vision
Conversational AI - chat naturally to find products, check prices, and manage your cart
Sub-second results - MongoDB Atlas Vector Search with $vectorSearch aggregation
Production deployed - live on Google Cloud Run

Try it now:

https://retail-concierge-262035403755.us-central1.run.app

Watch the demo:

https://youtu.be/q0fNnq0IDWQ

Built with Google Gemini 2.5 Flash, MongoDB Atlas, Next.js, and Google Cloud Run.

GoogleCloud #Gemini #MongoDB #VectorSearch #AI #RetailTech

Log in or sign up for Devpost to join the conversation.

Swati Gupta started this project — Jun 09, 2026 04:01 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.