Project Story
Inspiration
I've always been inspired by the opportunity to participate in Google's global challenge. Over the past two years, I've been actively using the company's ecosystem: from online tools like Gemini and NotebookLM to my Pixel 9 Pro. I wanted to test my own skills, challenge myself, and create a product that could compete at a high level.
What it does
Nexus is a multimodal AI agent that works in real-time. It is capable of:
- Seeing the environment through the camera.
- Listening and understanding speech.
- Speaking with a natural voice.
- Creating content and solving tasks on the fly.
How I built it
I started with in-depth research of the project categories. First, I developed the "skeleton" of the architecture, and then gradually built up the functionality. The process involved constant testing, bug fixing, and optimizing the interaction between modules.
Challenges I ran into
The biggest problem was latency when receiving audio via API. Also, this was my first serious experience with Google Cloud, which initially seemed difficult. However, with the help of documentation and AI tools, I was able to set up hosting and the environment.
When calculating the optimal response time, I relied on the formula:
$$T_{total} = T_{network} + T_{inference} + T_{processing}$$
where ( T_{inference} ) is the time for the model to generate a response.
Accomplishments that I'm proud of
I'm proud that I created a thoroughly developed product. This is not just a prototype, but a working test version that proved to me that I can create complex systems. This gave me a huge boost for further learning.
What I learned
The main achievement is the skills of working with Google Cloud Platform. I learned how to deploy models, manage resources, and integrate complex APIs into a single application. Thanks to Google for the opportunity to use the trial period to implement this idea.
What's next for Nexus
Next — fixing audio delays, adding new "personas" (agents), and developing the community. The project is Open Source, and I plan to actively improve it. The future belongs to voice, not text, and Nexus is my step in that direction.
Built With
- adk
- cloud
- gemini-api
- google-cloud
- node.js
- python
- vertex-ai

Log in or sign up for Devpost to join the conversation.