Inspiration

Workplace conflicts cost companies $359 billion annually; not because people are wrong, but because they're talking past each other. We've all been in conversations where emotions run high and nobody feels heard. What if AI could act as an impartial mirror, showing both sides what they're missing?

What it does

CultureLens is a real-time conversation analysis platform. Users speak naturally with our voice AI mediator, which:

  • Transcribes and analyzes communication patterns in real-time
  • Identifies cultural dynamics and emotional escalation points
  • Surfaces hidden signals both parties might be missing
  • Delivers neutral, actionable insights without taking sides

Think of it as a conversation mirror that helps transform "you always..." into "I understand..."

How we built it

  • Frontend: Next.js 15, React, TailwindCSS, shadcn/ui
  • Voice AI: ElevenLabs Conversational AI with WebSocket integration
  • Analysis Engine: Google Gemini 2.5 Flash for cultural pattern recognition
  • Auth & Database: Firebase Auth + Firestore
  • Deployment: Vercel

The architecture flows: voice input → real-time transcription → AI analysis → structured insights with cultural context.

Challenges we ran into

  • WebSocket state management: Syncing session IDs between React components and the ElevenLabs voice agent required careful useEffect orchestration
  • Real-time analysis latency: Balancing comprehensive Gemini analysis with user experience—solved with streaming responses and progressive loading states
  • Cultural sensitivity: Training prompts to identify patterns without stereotyping or making assumptions

Accomplishments that we're proud of

  • End-to-end voice-to-insights pipeline working in real-time
  • Clean, production-ready UI with smooth loading animations
  • Gemini integration that genuinely identifies communication patterns and cultural dynamics
  • Zero-judgment analysis that focuses on understanding, not blame

What we learned

  • ElevenLabs Conversational AI is incredibly powerful for building natural voice interfaces
  • Cultural analysis requires nuance.. AI can identify patterns but must avoid generalizations
  • The hardest part of mediation isn't technology, it's designing for neutrality

What's next for CultureLens

  • Multi-party support: Handle group conversations with 3+ participants
  • Longitudinal tracking: Show communication improvement over time
  • Integration APIs: Embed CultureLens into HR platforms and therapy tools
  • Multilingual analysis: Detect code-switching and cross-cultural communication patterns

Built With

  • elevenlabs-conversational-ai
  • firebase-auth
  • firestore
  • framer
  • google-gemini-2.5-flash
  • motion
  • next.js
  • openapi
  • react
  • shadcn/ui
  • swagger
  • tailwindcss
  • typescript
  • vercel
  • websockets
  • zod
Share this project:

Updates