logo
Pop-Up of EzSpeak Chrome Extension
EzSpeak Settings (for dev)

Inspiration

We built EzSpeak because of how diverse the world we live in truly is! Living in Miami (one of the most diverse places in the WORLD) - surrounded by a vibrant mix of cultures, ethnicities, and constant language switching - its hard to make that connection with everyone when you are face with language barriers. However, the EzSpeak team took that as challenge to lower the barriers between people who don’t share a native tongue. As globalization and digital interconnectedness accelerate, we believe everyone should have the ability to talk to anyone, anytime, anywhere!

What it does

EzSpeak listens to the audio from your current browser tab.
The audio is sent securely to Microsoft Azure Cognitive Services Speech, which powers:
Speech‑to‑text (captions)
Translation (your chosen language)
Text‑to‑speech (optional AI voice)
Results appear in Chrome’s Side Panel so you can read along and, if you want, hear the translated voice.

How we built it

We built EzSpeak mainly using JavaScript, which handles everything from capturing audio to updating the UI. It uses Chrome’s tabCapture API to grab sound from the active tab, then processes it with the Web Audio API to downsample it into the right format for Azure. JavaScript streams these audio chunks to Azure’s Speech SDK using async functions and listens for the real-time responses. As transcripts, translations, and AI voice data come back, JS updates the Chrome side panel instantly and plays audio if enabled, while also saving user settings like language in chrome.storage.

Challenges we ran into

A challenge we ran into was determining the right model to use. AI is a new and upcoming tool that is constantly changing, and the topic the project is related to is very fresh. With limited resources and tools to use, it made live translations and the creation of the AI voice very difficult. We scrapped a lot of original planning and had to come up with new solutions with limited time. However, with pure determination and relentless research / problem solving we decided to go with the Azure AI speech resources due to the AI speech solutions it offers

Accomplishments that we're proud of

We are proud of the whole project, as well as the idea itself. This wasn't just a hackathon project for us, but instead it was a real idea that we had a lot of passion for, even way before we even wrote a single line of code. This project allowed us and others to be able to speak to people to others, who naturally had a tough time speaking to, and break those barriers for the first time.

What we learned

There were so many new thing learned such as software and new tools. However, for the majority of us this was our first hackathon. So the main thing we learned from this experience was working in a group of new people, as well as working in a time crunch. This squeeze of pressure is what gave us that extra push to go the distance and finish this project.

What's next for EzSpeak

Better AI voice timing for tighter synchronization with the original speaker.
Automatic AI voice selection by analyzing speaker characteristics; optional manual voice selection per session.
On-the-fly language detection: auto-switch translation target and AI voice when the spoken language changes, no extension restart needed.
Multi-speaker, multi-language meeting support speaker separation, per-listener language output.

Built With

azure
azurecognitiveservices
chrome
chromeextensionapi
css
html
javascript
webaudioapi

Submitted to

ShellHacks 2025
- Winner Google

Created by

I helped organize the system design with the rest of my teammates. I also aided in the CSS implementation and figuring out bugs within the Azure Implementation.

David Bucio-Paz
I implemented most of the JavaScript functions and logic. I implemented the Azure cognitive services in the extension and developed the logic for sending the audio over to Azure AI speech. I also debugged and fixed the AI voice functionality to receive and handle the audio properly from Azure AI speech.

Ivan Figueroa
I help worked on the Machine Learning Pipeline / logic. As well as work on front end design using HTML, CSS, and JS

Mateo Lauzardo
Hello, I'm Mateo, a student at FIU studying Computer Science with strong interest in AI programming and problem-solving.

Updates

Ivan Figueroa posted an update — Sep 28, 2025 11:57 AM EDT

EzSpeak Chrome Extension Setup Guide

Step 1: Clone the Repository

Clone the repository to your PC:
https://github.com/Shellhacks-2025/EzSpeak

Step 2: Open Chrome Extensions Page

Open Google Chrome and navigate to:
chrome://extensions/

Step 3: Load the Extension

Click Load unpacked (top left) and select the folder on your PC where you cloned the repository.

Step 4: Open Extension Details

Find EzSpeak in your extensions list and click Details.

Step 5: Open Extension Options

Scroll almost to the bottom to Extension options and open it.

Step 6: Configure API Settings

Enter your API key and region (example: useast2).
Do not enter a language. Click Save.

Step 7: Start Using the Extension

Now you have the extension installed!
Navigate to a Chrome tab that has speech audio and open the extension.

Step 8: Experiment

Try it out with:

Web meetings with friends
Videos in other languages
Podcasts
Any Chrome tab with speech audio!

Log in or sign up for Devpost to join the conversation.

Ivan Figueroa posted an update — Sep 28, 2025 11:56 AM EDT

Use this awesome chrome extension easily! All you need is your own Azure AI foundry API key. You can get one for free as a student.

Step1: Clone the repository on your pc. https://github.com/Shellhacks-2025/EzSpeak Step2: Open Google chrome and head to: chrome://extensions/ Step3: Click Load unpacked on the top left and select the folder in your pc where you cloned your repository. Step4: Click on Details in the EzSpeak extension. Step5: Scroll almost all the way down to extension options and open it. Step6: Enter your API key and region(ex: useast2), don't enter a language, click save. Step7: Now you have the extension! Navigate to a chrome tab that will have speech audio for it to work properly and open the extension! Step8: Play around with web meetings with your friend, videos in other languages, podcasts, Any chrome tab with Speech audio!

Log in or sign up for Devpost to join the conversation.

Mateo Lauzardo started this project — Sep 28, 2025 10:57 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.