πŸ’‘ Inspiration

With the massive shift towards digital shopping, brick-and-mortar retail often struggles to provide the same level of instant, personalized search. We wanted to build an agent that brings the digital concierge experience into the physical retail worldβ€”helping shoppers find exactly what they need, even if they only have a photo of it.

βš™οΈ What it does

The Retail Concierge is a real-time, multimodal AI agent that helps shoppers find products in a store catalog.

  • Multimodal Search: Users can type what they are looking for or upload a photo of a product.
  • Semantic Matching: Using Gemini embeddings and MongoDB Vector Search, it finds the most relevant products instantly based on deep semantic meaning, not just keyword matching.
  • Interactive Chat: The agent converses naturally with the user, referencing live prices, features, and inventory stock before allowing them to add items directly to their cart.

πŸ› οΈ How we built it

  • Frontend & Backend: Next.js (React)
  • AI Brain & Embeddings: Google Gemini (Gemini 2.5 Flash / Flash-Lite, and gemini-embedding-2)
  • Vector Database (Partner Tech): MongoDB Atlas Vector Search
  • Deployment: Google Cloud Run

We generated product embeddings using Google Gemini's embedding model and stored them in MongoDB Atlas. When a user queries the app or uploads an image, the backend uses Gemini to describe the image, generates an embedding for the search context, and performs a $vectorSearch aggregation against our MongoDB products collection. Finally, Gemini 2.5 Flash is fed the search results as context to power the conversational interface.

🧠 Challenges we ran into

  • Multimodal Context: Getting the AI to perfectly blend an uploaded image with a text query required an intermediate step where Gemini first described the image before we vectorized the description.
  • Deployment Permissions: We had to configure the correct IAM permissions for Google Cloud Build to properly access our storage buckets and deploy the container to Cloud Run.

πŸ† Accomplishments that we're proud of

  • Seamlessly integrating MongoDB Vector Search with Google Gemini's latest embedding models to return sub-second search results.
  • Successfully deploying a polished, production-ready application to Google Cloud Run.

πŸ“š What we learned

We learned how incredibly powerful combining MongoDB Atlas Vector Search with Google Gemini can be for building agents. The ability to retrieve precise catalog data and feed it to a reasoning engine completely changes how we can build user experiences.

πŸš€ What's next for Retail Concierge

  • In-Store Navigation: Integrating indoor mapping so the agent can tell the user exactly which aisle the product is located in.
  • Voice Interface: Adding speech-to-text so users can talk to the concierge hands-free.

Built With

Share this project:

Updates

posted an update

Real-Time Retail Concierge is Live!

Excited to share that our submission for the Google Cloud Rapid Agent Hackathon is fully deployed and ready to try!

What we built:

  • Multimodal Search - upload any product photo and get instant semantic matches powered by Gemini Vision
  • Conversational AI - chat naturally to find products, check prices, and manage your cart
  • Sub-second results - MongoDB Atlas Vector Search with $vectorSearch aggregation
  • Production deployed - live on Google Cloud Run

Try it now:

https://retail-concierge-262035403755.us-central1.run.app

Watch the demo:

https://youtu.be/q0fNnq0IDWQ

Built with Google Gemini 2.5 Flash, MongoDB Atlas, Next.js, and Google Cloud Run.

GoogleCloud #Gemini #MongoDB #VectorSearch #AI #RetailTech

Log in or sign up for Devpost to join the conversation.