Inspiration
Most AI assistants feel transactional — they answer questions but lack presence, emotion, and voice-first interaction.
We wanted to explore how AI could feel more human, calm, and conversational by focusing on voice, tone, and visual feedback instead of text alone.
This led to EchoMind, a voice-first conversational AI experience.
What it does
EchoMind is a voice-driven AI interface that allows users to interact naturally through speech.
It features:
- Hold-to-speak voice interaction
- Emotion-aware responses
- Real-time audio-reactive visuals
- A 3D, reactive orb representing the AI’s presence
- Seamless light and dark themes
- A polished glassmorphism-based UI
The goal is to make AI interaction feel like a conversation, not a command.
How we built it
- Frontend: HTML, CSS, JavaScript with canvas-based audio visualizations and a layered 3D orb built using gradients and shading
- Backend: FastAPI handling conversation flow and response generation
- Voice: Text-to-speech with audio analysis to drive real-time UI reactions
Design and engineering were developed together to ensure cohesion and performance.
Challenges we ran into
- Synchronizing audio playback with real-time visuals
- Creating a 3D-feeling orb without heavy 3D libraries
- Designing a clean, readable light theme
- Managing scroll-based UI motion smoothly
Each challenge required balancing performance, design, and usability.
Accomplishments that we're proud of
- Fully voice-first interaction flow
- A visually rich 3D o
Built With
- and-javascript
- and-real-time-voice-generation.-the-frontend-is-developed-with-html
- css
- elevenlabstext-to-speech
- emotion-detection
- enabling-scalable
- expressive-voice-interactions-and-designed-the-system-to-run-on-google-cloud?ready-architecture-(vertex-ai-/-gemini-compatible)
- fastapi
- featuring-voice-reactive-visuals-powered-by-the-web-audio-api.-we-integrated-the-elevenlabs-text-to-speech-api-to-deliver-natural
- googlecloud-ready-architecture(gemini)
- html
- javascript
- memory
- production-grade-deployment-of-a-voice-first
- python
- webaudioapi
Log in or sign up for Devpost to join the conversation.