Inspiration
In a world where every app claims to use AI, we realized something was missing: emotion.
Financial assistants, chatbots, and virtual helpers usually talk at you, not with you. They give information, not connection.
We were inspired by a simple question:
“What if your bank could actually talk to you like a person?”
Traditional apps bombard users with financial data and notifications no one reads. We wanted to make finance feel human, quick, conversational, and intelligent.
We also wanted to push voice AI further than typical assistants, mixing Gemini’s reasoning with ElevenLabs’ emotional voice to create something that truly feels alive.
What it does
We built Capital One Guardian, a next-generation banking experience powered by AI with empathy, a voice-driven assistant that learns from you, remembers your behavior, and protects you in real time.
Guardian isn’t just a chatbot. It’s a companion. It predicts your future balance, detects suspicious activity, explains your spending, and talks to you naturally, with tone, emotion, and awareness.
It transforms financial interactions from static numbers into dynamic, emotional conversations that feel personal and alive.
How we built it
Our architecture combines several layers:
- Swift (Frontend): A clean, mobile-friendly banking interface where users can check balances, speak to the AI, and see real-time insights.
- Python FastAPI (Backend): Handles communication between the app and AI, storing transactions and user profiles in a database.
- Gemini API (Reasoning Engine): Processes user intent, generates contextual financial explanations, and produces human-like text responses.
- ElevenLabs API (Voice Layer): Converts Gemini’s response into a human, emotionally adaptive voice, choosing tone based on the user’s financial situation.
We also integrated a Memory Context Processor (MCP) that gives Guard continuity, it remembers past conversations and financial history to improve future responses.
Challenges we ran into
- Latency: Coordinating multiple APIs (Gemini + ElevenLabs + backend DB) while keeping response times low.
- Voice alignment: Making the AI sound natural and emotionally consistent with the user’s financial context.
- Context retention: Designing a memory system that lets Guard remember without leaking sensitive data.
- Time constraints: Integrating everything; UI, backend, and AI, within hackathon deadlines.
Accomplishments that we're proud of
We successfully merged AI reasoning, emotion, and real financial functionality into one cohesive prototype.
We created a system where the assistant not only analyzes spending but speaks with empathy and tone adapted to the user’s financial profile.
We’re proud of building a working voice-enabled financial guardian in such a short timeframe, one that feels truly human and redefines what a banking app can be.
What we learned
Building Guardian taught us that communication goes beyond words, tone and empathy are what make AI feel real.
We learned how to design multi-layered AI pipelines, integrate APIs efficiently, and fine-tune Gemini prompts to produce professional but human financial advice.
We also learned the importance of teamwork, compromise, and voice clarity, because sometimes, the smallest intonation can change an entire user experience.
What's next for Capital One Guardian
Next, we plan to expand Guardian’s memory system to enable deeper personalization and long-term learning.
We also want to integrate fraud prevention models that analyze transaction patterns across users, improving protection in real time.
Finally, we aim to release a public beta with broader voice customization, multilingual support, and a dashboard for visualizing spending and emotional feedback, taking Capital One Guardian closer to becoming the first truly empathetic AI in banking.

Log in or sign up for Devpost to join the conversation.