Smooth Talker: AI Accent Coach - Hackathon Submission
Inspiration
We've all been there - struggling to be understood, having to repeat ourselves, or feeling self-conscious about our accent. I faced this challenge myself and noticed the biggest issue wasn't lack of knowledge, but lack of natural practice. Traditional solutions fall short: tutoring is expensive, pronunciation apps are boring with robotic word-by-word exercises, and language classes treat adults like school children with rigid lesson plans.
After experiencing the potential of advanced AI voices like Gemini Live and ElevenLabs, I realized we could create something revolutionary. We developed sophisticated techniques to make LLMs not just understand pronunciation and accent, but actually quantify and track improvement in real-time during natural conversations.
What it does
Smooth Talker lets users practice pronunciation naturally through AI conversations that adapt to their specific needs. Here's what makes it special:
π― Smart Accent Analysis: Tracks 34 curated accent features, rating them in real-time without forcing users to repeat artificial sentences
π£οΈ Natural Conversations: Users speak naturally while our dual-AI system provides conversation AND accent scoring simultaneously
π Adaptive Learning: AI understands each user's pronunciation challenges and adapts conversations to focus on problem areas
π Intelligent Feature Rotation: Keeps conversations engaging by rotating focus areas while ensuring users practice their weak spots
π± Cross-Platform: Full-featured mobile apps for iOS and Android, plus web version
π§ Premium Audio: Integration with ElevenLabs for natural-sounding AI responses
π Structured Learning: Optional pre-compiled lessons for users who want traditional practice alongside conversational learning
The magic happens through parallel processing: while users have natural conversations with our AI, a separate system rates their pronunciation in the background, feeding insights back to improve future conversations.
How we built it
π 100% AI-Assisted Development: Built entirely through "vibe coding" - no traditional coding required! Used Bolt.new for rapid prototyping and Claude Code for complex implementations.
π§ Technical Stack:
- Frontend: React Native (mobile) + React (web)
- AI Integration: Gemini Live API for real-time accent scoring, ElevenLabs for premium voice
- Backend: Supabase for data management and user tracking
- Hosting: Netlify for seamless deployment
- Real-time Communication: WebSocket connections for live audio streaming
β‘ Development Process:
- Spent countless hours perfecting Gemini Live API integration for accurate accent understanding
- Implemented sophisticated parallel WebSocket connections for simultaneous conversation and rating
- Created local-first storage architecture to minimize backend calls
- Designed custom UI with cohesive color palette and Airbnb-inspired iconography (generated via Gemini ImageGen)
Challenges we ran into
ποΈ Audio Engineering Nightmare: The biggest challenge was implementing Gemini Live on React Native with proper voice isolation. Issues included:
- Microphone capturing AI audio output (feedback loops)
- Audio overlap and timing synchronization
- Cross-platform audio handling differences between web and mobile
π Backend Migration: Started with Convex but encountered integration issues that cost us a week. Successfully migrated to Supabase for smoother development.
π± Platform-Specific Hurdles:
- Apple App Store subscription integration (still in progress)
- Managing WebSocket connections across different mobile environments
- Implementing Voice over Data (VoD) architecture from scratch
π¨ UI/UX Consistency: Maintaining design coherence across web and mobile while ensuring accessibility and performance.
Accomplishments that we're proud of
β‘ Lightning Development: Built the entire web application in under 48 hours using AI-assisted development tools
π¨ Design Excellence:
- Created a stunning, cohesive color palette that feels premium and modern
- Generated custom iconography following current design trends
- Achieved Airbnb-level UI polish through AI-assisted design
π§ AI Breakthrough: Successfully made Gemini Live understand and accurately rate pronunciation in real-time - a significant technical achievement
π± Production Ready: Apps are polished and ready for App Store/Play Store publication
π οΈ Tool Mastery: Successfully leveraged cutting-edge development tools including Bolt.new for rapid prototyping and Claude Code for complex implementations
π Audio Innovation: Solved complex audio engineering challenges including voice isolation and real-time processing
What we learned
π€ AI-Assisted Development is Revolutionary: "Vibe coding" with AI tools can accomplish in hours what traditionally takes weeks. The future of development is collaborative human-AI creation.
π LLMs Excel at Personalized Learning: AI provides unlimited patience, vast knowledge, and private practice environments that traditional methods can't match.
π§ Technical Insights:
- WebSocket management for real-time applications requires careful architecture
- Local-first design principles significantly improve user experience
- Cross-platform audio handling demands platform-specific solutions
π± Product Development: Rapid prototyping with AI tools allows for quick iteration and user feedback integration
What's next for Smooth Talker: AI Accent Coach
π Launch Strategy:
- Apps ready for publication pending Apple approval and payment integration completion
- Comprehensive marketing strategy developed and ready to execute
- Beta testing with close users yielded overwhelmingly positive feedback
π° Business Model:
- Freemium pricing strategy finalized with clear value proposition
- Subscription tiers designed to provide value at every level
- Revenue projections and market analysis completed
π Growth Plans:
- Expanding language support beyond American English
- Corporate training partnerships for international businesses
- Integration with popular language learning platforms
- Advanced AI coaching features using latest voice AI models
π Vision: Transform how people learn pronunciation by making it as natural as having a conversation with a friend - but with the intelligence and patience only AI can provide.
Built With
- asyncstorage
- audio
- bolt.new
- claude-code
- css3
- elevenlabs-api
- expo-av
- expo-cli
- expo.io
- git
- github
- google-cloud
- google-gemini-live-api
- google-vertex-ai
- html5
- javascript
- jwt
- localstorage
- netlify
- node.js
- postgresql
- react
- react-native
- rest-apis
- storekit-2-(in-app-purchases)-**android**:-kotlin
- supabase
- tailwind-css
- typescript
- web
- webrtc
- websocket



Log in or sign up for Devpost to join the conversation.