NovaSonic AI Interviewer

Inspiration

We noticed how many candidates feel nervous, unprepared, or overwhelmed during interviews—especially in remote settings. Feedback is often delayed, superficial, or biased. What if there was a way to let people practise live interviews in a safe, repeatable environment, get immediate coaching, and for hiring teams to scale their interview process with consistency and data? That’s where NovaSonic AI Interviewer comes in: combining speech-to-speech, intelligent conversation, and AWS security to create an AI interviewer that helps both candidates and teams.

What it does

Hosts mock or real interviews via voice: candidate speaks, the AI interviewer asks questions (based on role, domain, difficulty), listens, responds, and optionally adapts follow-ups.
Provides live, natural speech interaction (no typing) with low latency using WebRTC + voice streaming.
Records interviews securely, stores audio (and optionally transcripts) for review / coaching.
Delivers feedback: immediate insights like pacing, clarity, filler-words, response length, gaps; later analysis or scores.
Allows hiring teams to standardize interview format, reuse question sets, compare across candidates, reduce interviewer bias.

How we built it

Backend using FastAPI on AWS (ECS Fargate) to orchestrate interviews, manage state, roles, credentials.
Speech model via Amazon NovaSonic (Bedrock) for converting text responses to voice, interpreting speech in real time.
Transport & streaming done via Daily + WebRTC: audio streaming, room management, secure token-based access.
Storage & infrastructure: S3 for recordings, IAM roles for access control; CloudFront and AWS CDK for deploying frontend; VPC setup for secure network; health/status endpoints.
Frontend: simple UI to join interviews, toggle audio/video, view transcripts, playback recordings.

Challenges we ran into

Latency vs Quality: balancing fast feedback vs clarity of speech (echoes, noisy inputs, variable bandwidth).
Context and follow-ups: deciding when the AI should branch or probe deeper vs sticking to a script, especially with free-form answers.
Feedback generation: what metrics to compute (e.g. pace, clarity, filler words) that are useful, fair, and interpretable.
User privacy & data retention: ensuring recordings are stored securely, access controlled, retention configurable, complying with GDPR / data protection best practices.
Voice/speech diversity: handling different accents, speech styles, background noise; ensuring the AI interviewer is fair and inclusive.

Accomplishments that we're proud of

Built an end-to-end working pipeline that supports real-time speech conversation with AI interviewer + voice responses.
Successfully recorded interviews and made playback possible, enabling review and coaching.
Modular architecture: easy to plug in new question sets, feedback modules, or adjust interview styles.
Security & infrastructure: deployed on AWS with proper IAM, roles, VPC, so it’s more than just a prototype but production-minded.

What we learned

Real users care not just about correctness but empathy: aspects like tone, encouragement, silence handling matter.
Even small delays or audio artifacts break immersion and make the candidate self-conscious.
There’s no one-size-fits-all in feedback: some prefer quantitative metrics, others qualitative tips. Customization is key.
Infrastructure and deployment concerns (security, scalability, cost) are just as important as model accuracy when building something used by real people.

What’s next for NovaSonic AI Interviewer

Improve feedback: more advanced metrics (sentiment, stress detection, content quality), dashboards for candidates and hiring managers.
Interviewer personality/styles: adapt tone (friendly, formal, technical), pacing, behavior depending on role.
Support multilingual interviews: candidate can speak in different languages, interviewer responds appropriately.
Bias detection & fairness: monitor for and mitigate any bias in follow-up questions or feedback based on accent, speech style, etc.
Mobile support & UX polish: better audio UIs in poor bandwidth, fallback options, more intuitive UX.
Experiment with real hiring process pilots: test with organizations, get live interviews, collect feedback data, improve.

Built With

Submitted to

AWS AI Agent Global Hackathon

Created by

As an AWS Ambassador at DoiT International, I elevate internal AWS proficiency by championing best practices, providing architectural guidance, and fostering certification growth to accelerate customer value delivery; amplify market presence by representing DoiT as a trusted AWS expert through thought leadership and global APN event participation; and accelerate customer success by applying deep AWS expertise to drive enhanced security, optimized costs, and faster deployment cycles for clients.

Richard Kang

Updates

Richard Kang started this project — Sep 29, 2025 11:33 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.