Inspiration

Public speaking is one of the most common and anxiety-inducing skills people struggle with, yet meaningful feedback is hard to get without a coach/instructor. We wanted to make high-quality speech feedback accessible to anyone with a microphone.

What it does

SpeechMentor is a speech coaching web app. The goal is to help users speak more clearly and confidently.

  • A full transcript
  • Words-per-minute (pace)
  • Filler words
  • Confidence and clarity scores
  • Actionable feedback on how to improve

How we built it

  • Frontend: HTML, Tailwind CSS, and JavaScript using the Web Audio API for in-browser recording
  • Backend: Node.js + Express
  • Speech-to-Text: ElevenLabs Speech-to-Text API
  • Analysis: Currently using custom logic for pace calculation, filler word detection, and scoring. We hope to later integrate with Gemini API for real feedback.

Challenges we ran into

  • Handling audio recording and encoding reliably across browsers using the MediaRecorder API
  • Correctly formatting multipart audio uploads for the speech-to-text API

Accomplishments that we're proud of

  • It sorta works!

What we learned

  • Audio handling in web apps is more complex than it looks
  • Some of our group learned version control during this hackathon

What's next for SpeechMentor

  • Session history and progress tracking
  • Tone, sentiment, and structure analysis
  • Actual feedback rather than custom hard-coded feedback!

Built With

Share this project:

Updates