Inspiration : I kept thinking of scenarios where a victim can’t call 911—kidnappers, domestic abuse, or someone right outside your door. With 30 % of wireless 911 calls silent or misdials and 28 million Americans facing language barriers, a covert AI “friend” that poses as a pizza-delivery bot can bridge that gap. It only needs one use to save a life.

What I Learned

  • Real-time audio streaming with WebSockets & RTC
  • Integrating Twilio Programmable Voice for inbound calls
  • Building a lightning-fast frontend with Vite
  • Chaining OpenAI APIs for robust STT → NLU → TTS flows

How I Built It

  1. Telephony layer: Twilio routes calls to my Node.js server.
  2. Audio pipeline: Browser-based WebSocket forwards audio to OpenAI for transcription; GPT slots fill location, emergency type, and codewords.
  3. Persona engine: Prompts shape “pizza-order” small talk, then stealthily switch to emergency mode.
  4. 911 relay: As soon as key data is captured—or a duress codeword is spoken—the backend silently POSTs a JSON payload to a mock 911 endpoint.
  5. Speaker mode: Users can put the phone on speaker; the threat never suspects it’s a real emergency call.

Challenges Faced
My initial plan was to use OpenAI’s speech-to-speech model for seamless, low-latency convo. UCSD’s firewall blocked our RTC requests, so I pivoted to a chained STT→GPT→TTS approach—spending hours researching workarounds and optimizing latency.

https://github.com/sam-siavoshian/CoverAi https://cover-seven-rho.vercel.app/

Built With

Share this project:

Updates