Dashboard
Sessions Listing
Personas Listing
Context Builder
Multi-participant Configuration
Training Session, Prospect Avatar, Interruption
Live Coaching
Teammate Agent
Multimodal Analysis + Tool Usage
Training Report + AI Feedback
GCP Deployment
Architecture
Firestore

Proxima

Proxima turns static role-play into a live, adaptive rehearsal environment where BDRs can safely make mistakes, sharpen their instincts, and walk into real calls already “warmed up.”

Inspiration

Most sales coaching happens after the fact—through recordings, scorecards, and retroactive feedback—when the opportunity to adjust is already gone.

Proxima shifts training before the call, creating a space where reps can rehearse full conversations that feel realistic and context-driven.

Advances in real-time generative AI, multimodal reasoning, and voice models make it possible to simulate:

Natural, interruptible conversations
Context-aware prospect behavior
Tool-augmented reasoning grounded in real deal data

What Proxima Does

Proxima is a real-time, AI-driven sales rehearsal platform for high-velocity revenue teams.

It enables reps to prepare, practice, and perform through:

Lifelike sales call simulations using context-rich personas
Real-time, bidirectional voice conversations
Interruption handling and dynamic topic control
Live coaching overlays during conversations
Post-session analytics combining quantitative and qualitative insights
Multi-participant AI simulations with teammates and stakeholders
Multimodal grounding using files, images, and structured inputs

Core Screens

Dashboard

View upcoming rehearsals
Access recent sessions
Review coaching reports
Track performance trends over time

Dashboard

Context Builder

Structured environment for defining simulation inputs and shaping AI behavior.

Context Builder

Persona Definition

Prospect identity (job title, company, industry, location)
Funnel stage (Awareness, Consideration, Decision, Expansion)
Stage intent (specific goal of the conversation)

Behavioral Modeling

Objection archetypes (Skeptic, Visionary, Guardian)
Decision styles (Data-driven, Consensus-based, Intuitive, Top-down)
Tunable sliders:
- Skepticism level
- Negotiation toughness
- Initial trust score

Context Injection

Key-value inputs for attaching assets such as:
- Decks
- RFP summaries
- Pricing sheets

Persona Randomization

Automatically generates realistic variability across:
- Stage
- Archetype
- Decision style
- Behavioral sliders

AI Teammates

Toggle participation on/off
Assign roles or randomize behavior
Simulate internal dynamics (e.g., dominant AE, hesitant junior rep)

Live Session Room

Primary simulation environment for real-time interaction.

Video-style layout with:
- User
- AI prospect
- AI teammates
Live transcript and coaching sidebar
Real-time feedback signals:
- Missed discovery opportunities
- Weak responses
- Over-talking or lack of clarity
Controls for:
- Mute
- Screen sharing
- Session termination

Live Session

Coaching

Session Report

Structured post-session analysis.

Talk ratio and listening balance
Question quality and discovery depth
Objection handling effectiveness
Coverage of key conversation areas

Includes:

Timestamped transcript highlights
Actionable improvement suggestions
Reusable coaching insights

Session Report

Persona Library

Reusable catalog of prospect profiles. Enables standardized training scenarios

Persona Library

Sessions List

Historical view of all rehearsals.

Chronological session tracking
Quick access to:
- Replays
- Reports
- Performance trends

Sessions List

How Everything Connects

Proxima is built as a tightly integrated real-time system powered by Google Cloud Platform and Gemini models.

The Next.js client streams audio, transcripts, and events via WebSockets
The FastAPI backend manages session state and orchestration
AI models handle conversation, reasoning, and analysis
Firestore persists structured data across sessions

This creates a continuous loop:

Configure → Simulate → Coach → Analyze → Improve

Tech Stack and Architecture

Frontend

Next.js (React) application
WebSocket-based streaming for low-latency interaction

Backend

FastAPI (Python) services for:
- Session orchestration
- Persona generation
- Report generation
Integrated with Google Gen AI Python SDK

AI and Models (Gemini-Centric)

Gemini Live (via Vertex AI)
- Real-time, bidirectional voice conversations
- Handles interruption, turn-taking, and tool invocation
Gemini Multimodal
- Processes text, images, and files
- Powers persona grounding and report generation
Tooling Layer
- Coaching hint generation
- File summarization
- Knowledge retrieval
- UI event triggering

All model interactions are orchestrated through Google’s Gen AI SDK, ensuring tight integration with GCP services.

GCP Infrastructure (Core Highlight)

Vertex AI
- Managed runtime for Gemini models
- Scalable inference for both live and batch workloads
Cloud Run
- Containerized backend deployment
- Horizontal scaling for concurrent sessions
Firestore
- Source of truth for:
- Personas
- Sessions
- Reports
- Configuration

This architecture ensures:

Low latency
High concurrency
Production-ready scalability

Integration Layer

WebSockets for real-time streaming
REST APIs for structured operations
Extensible tool registry for adding new capabilities

Challenges

Real-time AI Orchestration

Managing simultaneous voice, text, and tool outputs required a suppression and arbitration layer

Multi-Participant Simulation

Coordinating multiple AI agents with shared context and overlapping audio streams

Interruption Handling

Supporting natural barge-in without breaking conversational coherence

Persona Consistency

Maintaining behavioral realism across turns and sessions

Hybrid Analytics

Combining deterministic metrics with qualitative AI evaluation

Multimodal Context Flow

Aligning file uploads, screen shares, and live context in real time

Accomplishments

Delivered low-latency, real-time conversational performance
Built a dynamic persona engine grounded in real inputs
Implemented live coaching without disrupting flow
Enabled multi-agent simulation environments
Integrated multimodal reasoning for richer context understanding

Roadmap

Playbook Integration

Embed company-specific sales playbooks into simulations
Reinforce real-world selling frameworks

Knowledge Graph Reasoning

Model relationships between:
- Prospects
- Companies
- Opportunities
Improve contextual awareness and follow-up quality

Competitive Intelligence

Simulate real-world objections and competitor narratives
Dynamically scale difficulty

Role Switching

Allow reps to play the prospect
Identify weaknesses in messaging

Advanced Analytics

Funnel-stage heatmaps
Individual and team performance tracking
Correlation with pipeline outcomes

Localization

Multi-language simulations
Region-specific tone and objection styles

Emotion Modeling

Expressive AI avatars with behavioral cues
Train reps on timing and emotional intelligence

Live Meeting Integration

Join real calls as an AI assistant
Provide real-time coaching and post-call analysis

Adaptive Training (Long-Term Vision)

Multi-session, branching simulations
Dynamic difficulty adjustment
Automated playbook optimization from aggregated data

Built With

fastapi
firestore
gcp
gcp-cloud-run
gemini-live
nextjs
python
typescript

Updates

Rahul Suresh started this project — Mar 16, 2026 07:41 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.