Inspiration
Traditional food apps are built for screens, not for people. They require endless scrolling, rigid filtering, and dozens of taps just to find a meal. We wanted to bring back the human element of service.
Our inspiration came from a my father who was trying to order food but due to being a non-techy person he couldn't find the desired item. So I thought of a system which listens to the user and helps to find the desired item.
What It Does
VoiceBite is an elite AI-powered voice ordering concierge. Instead of navigating menus manually, users simply speak.
Semantic Understanding
Understands subjective requests like “I want something light” or “a treat for the kids.”The “Yes” Policy
If an item is unavailable, the AI suggests the closest alternatives automatically.Voice-to-Checkout
Add items, explore deals, and complete simulated payments—completely hands-free.Proactive Salesmanship
Naturally surfaces deals and offers during conversation to maximize user value.
How We Built It
The Brain
Integrated Google Gemini as the reasoning engine, using LLM-based intent recognition to convert natural speech into structured JSON actions.Frontend
A high-performance React (Vite) app with a glass-morphism UI, Lucide React icons, and a custom speech recognition hook for real-time transcription.Backend
Node.js & Express API with MongoDB for menu management, acting as a low-latency bridge between frontend and Gemini AI.Feedback Loop
Used React Hot Toast to instantly confirm AI actions like cart updates or payments.
Challenges We Ran Into
The biggest challenge was Signal vs. Noise. Natural speech includes filler words like “um,” “maybe,” and “actually.” Filtering noise while accurately extracting intent required heavy prompt engineering and iterative testing of action-parsing logic.
Accomplishments We’re Proud Of
Our Subjective Intelligence Layer. Watching the AI interpret “I’m thirsty” and jump to beverages—or “I’m on a diet” and show healthy options—proved we built more than a voice-controlled menu. We built something truly smart.
What We Learned
The future of UI is conversational, not just graphical. We gained deep experience in prompt engineering, enforcing valid JSON outputs from LLMs, and bridging asynchronous AI workflows with synchronous React interfaces.
What’s Next for VoiceBite
VoiceBite is just getting started. Our roadmap includes:
- Multilingual Support – Order in any language, seamlessly translated for the kitchen
- Real-Time Inventory Integration – Syncing the “Yes Policy” with live POS systems
- Voice Fingerprinting – Recognizing returning users and greeting them with “The usual?”

Log in or sign up for Devpost to join the conversation.