About Pitchly

Inspiration

Interview preparation is broken. Most people practice alone—talking to mirrors or awkwardly role-playing with friends. We wanted to create a realistic AI interviewer that sounds human, asks dynamic questions based on your resume, and gives instant feedback.

What We Learned

Real-time AI Streaming: Implementing low-latency voice conversations with GPT-4 and ElevenLabs
Audio Pipeline Architecture: Building a queue system for seamless recording, transcription, and playback
Prompt Engineering: Crafting prompts that generate natural interview questions while maintaining context
Performance Analytics: Calculating WPM and filler word density in real-time

How We Built It

Tech Stack: Next.js + Tauri + Express + MongoDB Atlas + OpenAI + ElevenLabs

Built voice pipeline (Whisper STT + ElevenLabs TTS)
Implemented streaming GPT-4 conversations with personas
Migrated to MongoDB for persistent storage
Created performance analytics engine
Packaged as desktop app with Tauri

Key Challenges

Audio Queue Management

Problem: Audio clips overlapped when AI generated responses too quickly.
Solution: Built a queue system that tracks playback state and handles chunks sequentially.

AI Response Loop

Problem: AI responded to its own output, creating infinite loops.
Solution: Added strict message role validation to only respond to user messages.