Inspiration
Traditional language education lacks realistic speaking practice with native speakers, limiting exposure to natural accents, slang, and diverse vocabulary. We aimed to bridge this gap using AI-generated voices for immersive, real-world conversations.
What it does
Our platform lets students practice speaking with AI-generated native speaker voices, simulating natural conversations. Teachers can create customizable scenarios, and the AI adapts based on context. Sessions are recorded for feedback on pronunciation, fluency, and progress tracking.
How we built it
We combined voice cloning, language models, and text-to-speech technology to create dynamic conversations. Teachers can design scenarios with custom grammar and vocabulary, while the AI adjusts to the conversation. The system records each session for both automated and teacher-led assessments.
Challenges we ran into
Ensuring dynamic, coherent conversations across various contexts was challenging. We also faced difficulties with accurate speech recognition and real-time voice recording without affecting performance.
Accomplishments that we're proud of
We created a platform for realistic, dynamic language practice with customizable scenarios. Teachers can track student progress and assess pronunciation and fluency objectively using automated tools.
What we learned
We learned the importance of context-aware conversations and the challenges of integrating voice cloning and transcription technologies in real-time. What's next for Immersive
We plan to expand scenarios and languages, improve AI’s conversational abilities, and refine pronunciation assessments. We also aim to integrate more cultural nuances and idioms.
Built With
- flask
- mfa
- nextjs
- python
- typescript
Log in or sign up for Devpost to join the conversation.