Smooth Talker: AI Accent Coach - Hackathon Submission

Inspiration

We've all been there - struggling to be understood, having to repeat ourselves, or feeling self-conscious about our accent. I faced this challenge myself and noticed the biggest issue wasn't lack of knowledge, but lack of natural practice. Traditional solutions fall short: tutoring is expensive, pronunciation apps are boring with robotic word-by-word exercises, and language classes treat adults like school children with rigid lesson plans.

After experiencing the potential of advanced AI voices like Gemini Live and ElevenLabs, I realized we could create something revolutionary. We developed sophisticated techniques to make LLMs not just understand pronunciation and accent, but actually quantify and track improvement in real-time during natural conversations.

What it does

Smooth Talker lets users practice pronunciation naturally through AI conversations that adapt to their specific needs. Here's what makes it special:

🎯 Smart Accent Analysis: Tracks 34 curated accent features, rating them in real-time without forcing users to repeat artificial sentences

πŸ—£οΈ Natural Conversations: Users speak naturally while our dual-AI system provides conversation AND accent scoring simultaneously

πŸ“Š Adaptive Learning: AI understands each user's pronunciation challenges and adapts conversations to focus on problem areas

πŸ”„ Intelligent Feature Rotation: Keeps conversations engaging by rotating focus areas while ensuring users practice their weak spots

πŸ“± Cross-Platform: Full-featured mobile apps for iOS and Android, plus web version

🎧 Premium Audio: Integration with ElevenLabs for natural-sounding AI responses

πŸ“š Structured Learning: Optional pre-compiled lessons for users who want traditional practice alongside conversational learning

The magic happens through parallel processing: while users have natural conversations with our AI, a separate system rates their pronunciation in the background, feeding insights back to improve future conversations.

How we built it

πŸš€ 100% AI-Assisted Development: Built entirely through "vibe coding" - no traditional coding required! Used Bolt.new for rapid prototyping and Claude Code for complex implementations.

πŸ”§ Technical Stack:

  • Frontend: React Native (mobile) + React (web)
  • AI Integration: Gemini Live API for real-time accent scoring, ElevenLabs for premium voice
  • Backend: Supabase for data management and user tracking
  • Hosting: Netlify for seamless deployment
  • Real-time Communication: WebSocket connections for live audio streaming

⚑ Development Process:

  • Spent countless hours perfecting Gemini Live API integration for accurate accent understanding
  • Implemented sophisticated parallel WebSocket connections for simultaneous conversation and rating
  • Created local-first storage architecture to minimize backend calls
  • Designed custom UI with cohesive color palette and Airbnb-inspired iconography (generated via Gemini ImageGen)

Challenges we ran into

πŸŽ™οΈ Audio Engineering Nightmare: The biggest challenge was implementing Gemini Live on React Native with proper voice isolation. Issues included:

  • Microphone capturing AI audio output (feedback loops)
  • Audio overlap and timing synchronization
  • Cross-platform audio handling differences between web and mobile

πŸ”„ Backend Migration: Started with Convex but encountered integration issues that cost us a week. Successfully migrated to Supabase for smoother development.

πŸ“± Platform-Specific Hurdles:

  • Apple App Store subscription integration (still in progress)
  • Managing WebSocket connections across different mobile environments
  • Implementing Voice over Data (VoD) architecture from scratch

🎨 UI/UX Consistency: Maintaining design coherence across web and mobile while ensuring accessibility and performance.

Accomplishments that we're proud of

⚑ Lightning Development: Built the entire web application in under 48 hours using AI-assisted development tools

🎨 Design Excellence:

  • Created a stunning, cohesive color palette that feels premium and modern
  • Generated custom iconography following current design trends
  • Achieved Airbnb-level UI polish through AI-assisted design

🧠 AI Breakthrough: Successfully made Gemini Live understand and accurately rate pronunciation in real-time - a significant technical achievement

πŸ“± Production Ready: Apps are polished and ready for App Store/Play Store publication

πŸ› οΈ Tool Mastery: Successfully leveraged cutting-edge development tools including Bolt.new for rapid prototyping and Claude Code for complex implementations

πŸ”Š Audio Innovation: Solved complex audio engineering challenges including voice isolation and real-time processing

What we learned

πŸ€– AI-Assisted Development is Revolutionary: "Vibe coding" with AI tools can accomplish in hours what traditionally takes weeks. The future of development is collaborative human-AI creation.

πŸŽ“ LLMs Excel at Personalized Learning: AI provides unlimited patience, vast knowledge, and private practice environments that traditional methods can't match.

πŸ”§ Technical Insights:

  • WebSocket management for real-time applications requires careful architecture
  • Local-first design principles significantly improve user experience
  • Cross-platform audio handling demands platform-specific solutions

πŸ“± Product Development: Rapid prototyping with AI tools allows for quick iteration and user feedback integration

What's next for Smooth Talker: AI Accent Coach

πŸ“ˆ Launch Strategy:

  • Apps ready for publication pending Apple approval and payment integration completion
  • Comprehensive marketing strategy developed and ready to execute
  • Beta testing with close users yielded overwhelmingly positive feedback

πŸ’° Business Model:

  • Freemium pricing strategy finalized with clear value proposition
  • Subscription tiers designed to provide value at every level
  • Revenue projections and market analysis completed

πŸš€ Growth Plans:

  • Expanding language support beyond American English
  • Corporate training partnerships for international businesses
  • Integration with popular language learning platforms
  • Advanced AI coaching features using latest voice AI models

🌟 Vision: Transform how people learn pronunciation by making it as natural as having a conversation with a friend - but with the intelligence and patience only AI can provide.

Built With

Share this project:

Updates