Inspiration
The idea for VoxVersus came from imagining what a debate would sound like if multiple expert minds—each with a unique worldview—could argue the same topic. What if an AI philosopher questioned the ethics, while a scientist focused on empirical data, and a contrarian just poked holes in every argument? We were fascinated by the potential of AI-driven simulations of human reasoning. It felt like an opportunity to explore how diverse perspectives can coexist, conflict, and even collaborate—all within the same conversation.
What it does
VoxVersus is a backend platform that powers structured, multi-round debates between six AI personalities: The Philosopher, The Scientist, The Advocate, The Pragmatist, The Contrarian, and The Historian. Users can:
Create and start new debates on custom topics
Watch as each AI generates arguments in real time
Judge the winner via AI or human reasoning
Vote and view analytics post-debate
Track performance on a leaderboard
Everything is powered by Hugging Face models (DialoGPT-medium) and backed by MongoDB for persistent debate tracking and analysis.
How we built it
Backend Framework: Flask 2.3.3
AI Integration: Hugging Face Transformers with custom persona prompts
Database: MongoDB (via PyMongo) to store debates, personalities, stats
Architecture:
Modular services for debate, personality, and AI text generation
RESTful API endpoints in routes/
Real-time round progression with turn-by-turn argument handling
Challenges we ran into
Maintaining Personality Identity Some models initially gave generic replies. We had to iterate prompt engineering to keep consistent persona behavior and tone.
Orchestrating Debate Flow Ensuring proper turn rotation across multiple rounds required state synchronization and robust logic for edge cases (timeouts, skips, etc.).
Model Latency Inference with large models was slow, especially on CPU. We optimized by reducing context length and pre-loading models at startup.
Judging Logic We struggled to define what makes an argument "strong" in an objective way. Eventually, we added both AI-based and human-in-the-loop judging.
Accomplishments that we're proud of
Created a fully functional AI debate engine with turn-based logic
Simulated six distinct debate personalities using one language model
Developed a voting and judging system for community engagement
Implemented debate analytics and a leaderboard to track AI performance
Designed a scalable backend ready for frontend or public API integration
What we learned
Prompt engineering is critical for guiding LLM behavior
Managing multi-agent systems requires careful design to prevent collisions or looping
Even the best AI models need context control and persona anchoring
MongoDB is flexible and powerful for nested, evolving data like debates
Flask’s blueprint pattern keeps larger applications modular and clean
What's next for VoxVersus
Frontend UI: Build a rich, interactive web interface for real-time debates
🤝 User-Submitted Topics: Allow the community to suggest and vote on debate topics
🎤 Human vs AI Debates: Let users take on one of the six personalities
📊 Advanced Analytics: Add sentiment analysis, argument strength scoring, and keyword heatmaps
⚙️ Model Upgrade: Experiment with more powerful or fine-tuned models like LLaMA or Claude
🌐 Live Debate Rooms: Enable Twitch-like channels for ongoing AI vs AI or human-vs-AI debates
VoxVersus is just the beginning of what AI dialogue can become—educational, entertaining, and thought-provoking.
Log in or sign up for Devpost to join the conversation.