Inspiration

When I browse online, I sometimes find myself struggling to understand an article, or shopping online and torn between two choices (I mean, does the extra $10 really make a difference?). On one of those occasions, at the peak of my frustration, an idea came to me: an "online companion." Conversing and browsing are usually two separate activities. You scroll, type, and click, but you never really talk. I wanted to bridge that gap and create a browser that allows you to interact naturally through voice, combining browsing with conversational AI.


How I Built It

Since this is a desktop application and I was familiar with web technologies, Electron was the ideal choice. Its cross-platform compatibility and widespread use in enterprise-grade applications made it perfect for building a robust browser experience.

For the conversational voice interface, I used ElevenLabs’ text-to-speech and speech-to-text APIs, which provided a wide range of natural voices. For reasoning and AI-powered context awareness, I integrated Google Cloud Vertex AI, enabling the assistant to understand queries and provide intelligent responses.

The app features a modern interface with a standalone sidebar, designed to feel integrated and polished.


Challenges

Bringing together multiple AI services in a single application was an intricate task. One of the biggest challenges was designing a unique and cohesive user interface; the browser and AI sidebar needed to feel like a single, polished product rather than two separate components. Another challenge was reducing latency between different processes. I addressed this by implementing a logging system that tracked timestamps and durations, which allowed me to monitor and optimize communication between services. The codebase required iterative testing, and the logs provided a clear way to measure the impact of different parameters.


What I Learned

This project took me to the intersection of AI, desktop development, and user experience design. I gained experience integrating multiple AI services, understanding the nuances of Electron’s architecture, and deploying a cross-platform application. I also learned the importance of monitoring and optimizing interactions between APIs to maintain performance. Beyond the technical skills, I developed a stronger sense of product design, realizing how subtle UI choices can transform a tool into a companion-like experience.

Share this project:

Updates