SpeechMentor

Inspiration

Public speaking is one of the most common and anxiety-inducing skills people struggle with, yet meaningful feedback is hard to get without a coach/instructor. We wanted to make high-quality speech feedback accessible to anyone with a microphone.

What it does

SpeechMentor is a speech coaching web app. The goal is to help users speak more clearly and confidently.

A full transcript
Words-per-minute (pace)
Filler words
Confidence and clarity scores
Actionable feedback on how to improve

How we built it

Frontend: HTML, Tailwind CSS, and JavaScript using the Web Audio API for in-browser recording
Backend: Node.js + Express
Speech-to-Text: ElevenLabs Speech-to-Text API
Analysis: Currently using custom logic for pace calculation, filler word detection, and scoring. We hope to later integrate with Gemini API for real feedback.

Challenges we ran into

Handling audio recording and encoding reliably across browsers using the MediaRecorder API
Correctly formatting multipart audio uploads for the speech-to-text API