Audeza

Landing
Dashboard
Project View
Pitch deck and script overview
Pitching with Audeza Coach
Session score
Architecture

What Inspired me to build Audeza?

I've always been someone full of ideas. Over the years, I've built several of them — shipping products, writing code, solving real problems. But every single time, I hit the same wall: the pitch. I could build it, but I couldn't sell it. I'd stumble over my words, rush through slides, freeze when investors pushed back with hard questions. I knew my ideas were good — I just couldn't make other people feel that too. Mentors helped when they were available, but I couldn't call someone at midnight the day before a demo day. I needed a practice partner that was always there, that could actually watch me present and tell me what I was doing wrong. So I built one.

Audeza is an AI pitch coach that sees you, hears you, and coaches you in real-time. You upload your deck, turn on your camera, and start presenting — and Audeza gives you live feedback on your delivery, pacing, body language, and slide content. It can also flip into investor simulator mode, roleplaying as a tough VC who throws curveballs so you're never caught off guard again. After each session, you get a scorecard with per-slide breakdowns and pacing charts — so you can actually track yourself getting better.

Audeza was built with:

Gemini Live API for real-time multimodal streaming — audio, webcam, and slide awareness all at once
TanStack Start with file-based routing and server functions
Firebase Auth, Firestore, and Cloud Storage for the backend
Backend deployed on Cloud Run
React + shadcn/ui + Tailwind for a clean presenter-focused UI
Bun runtime with a WebSocket proxy bridging the browser to Gemini's live API
Audio via AudioWorklet at 16kHz PCM, webcam at 1 FPS JPEG — all streamed in real-time

Challenges I ran into:

Managing Gemini's context window with continuous audio and video — I had to design smart compression to work within the 2-minute media limit
Making slide awareness feel seamless so feedback is about this slide, not generic advice
Building a teleprompter that auto-scrolls by tracking your actual speech — matching spoken words to scripted text in real-time without being distracting.

Accomplishments that I'm proud of:

The coaching is truly multimodal — it watches your body language, hears your tone, and reads your slides all at once, just like a real mentor sitting across the table
Investor simulator mode that actually makes you sweat — the kind of practice that makes the real meeting feel easy
A scorecard that gives you honest, specific, per-slide feedback — not "great job!" but "you spent 45 seconds on slide 3 and said 'um' seven times"

What I learned ?

The Gemini Live API is incredibly powerful but demands careful orchestration when combining multiple media streams
Context-aware feedback (knowing which slide you're on, what it says) is what separates useful coaching from generic AI commentary
The hardest design problem wasn't technical — it was making real-time feedback feel helpful instead of interrupting your flow

What's next for Audeza ?

Launch publicly and get it into the hands of founders preparing for realdemo days
Team practice mode — let co-founders rehearse together with AI moderation
Industry-specific coaching — different feedback for VC pitches, sales demos, grant panels
Generate the slide decks
The bigger vision: no one should lose an opportunity because they couldn't communicate their idea.

The product was always good — the pitch just needed practice.

Built With

bun
gemini
react
shadcn
tailwind
tanstack

Updates

Carlos Brotas started this project — Mar 16, 2026 07:11 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.