About the Project
Inspiration
Shopping can often be a daunting task, especially when faced with overwhelming options or limited assistance. Inspired by the need to simplify and personalize this experience, we envisioned an AI-powered shopping assistant that uses voice and video interaction to bridge the gap between customers and retail solutions.
What We Learned
- Technological Integration: Combining cutting-edge tools like video avatars, voice monitoring, and real-time APIs to create a seamless user experience.
- Conversational AI: Leveraging advanced AI models for natural, context-aware interactions.
- User Engagement: Understanding how multimedia interactions can enhance customer satisfaction.
How We Built It
Tech Stack & Tools
- Simli: Used for creating realistic video avatars, providing a friendly face to the AI assistant.
- Vapi: Powered voice interaction and monitoring to ensure smooth, real-time communication.
- Anthropic and ChatGPT Models: Enabled conversational AI with nuanced understanding and personalized responses.
- Real-Time API: Facilitated instant voice interactions for fluid communication.
- Function Calling: Enabled web interactions to seamlessly execute tasks like product searches, recommendations, and order placement.
- Simli: Used for creating realistic video avatars, providing a friendly face to the AI assistant.
Development Process
- Designed a system that integrates voice, video, and conversational AI into a unified experience.
- Developed robust APIs for real-time voice and video interactions, ensuring minimal latency.
- Incorporated monitoring systems for performance and user engagement analysis.
- Designed a system that integrates voice, video, and conversational AI into a unified experience.
User-Centric Approach
- Conducted usability testing to refine the assistant’s ability to understand diverse accents and preferences.
- Added dynamic function-calling capabilities for web-based tasks, ensuring the assistant is practical and functional for users.
- Conducted usability testing to refine the assistant’s ability to understand diverse accents and preferences.
Challenges Faced
- Integration of Diverse Technologies: Combining video avatars, voice monitoring, and real-time APIs into a cohesive system required extensive testing and fine-tuning.
- Performance Optimization: Maintaining low latency in voice and video interactions while handling real-time API calls.
- AI Training: Fine-tuning models to ensure accurate, personalized responses across various shopping scenarios.
- Scalability: Building a solution that can adapt to different retail environments and product catalogs.
The Outcome
The result is a revolutionary shopping assistant that offers:
- Personalized Recommendations: Tailored to individual preferences and shopping history.
- Engaging Interactions: Combining video avatars and voice AI for a natural, human-like experience.
- Seamless Functionality: Real-time voice interactions and web-based tasks create an intuitive shopping journey.
This project has not only advanced our technical expertise but also provided a scalable, innovative solution for enhancing the shopping experience through AI.
Built With
- coval
- lmnt
- senso.ai
- simili
- temporal
- vapi

Log in or sign up for Devpost to join the conversation.