Motion Voice

Project
Project

Inspiration

Communication shouldn't be a barrier. Millions of Deaf and hard-of-hearing individuals rely on American Sign Language (ASL), yet real-time communication with hearing individuals remains a challenge. We wanted to build a tool that uses vision and AI to enable seamless interaction — hands speak, and machines listen and speak back.

What it does

We developed a real-time ASL-to-speech translator using computer vision, deep learning, and hardware feedback. Our system captures hand gestures through a camera, classifies them using an LSTM model, and outputs both visual and audio feedback — making communication more intuitive and accessible.

How we built it

We built using Media Pipe

Challenges we ran into

Not enough time to both train the model and Find better datasets.
The datasets is too small and we have to take video to make our own.
Original big LCD screen is broken:(
want to integrate camera but we couldn't find a affordable embedded camera in person and it has to be shipped.
Originally we are planning to use single board computer but since we don't need to use embedded camera, LSTM and python libraries is relatively harder to been built on SBC. Therefore we chose PC and combine with MCU
Model Accuracy is not very high ## Accomplishments that we're proud of
Sucessfully integrated the core function of AI algorithms and embedded solutions.
Learned a lot of things!
Integration is not easy but we have managed to done several components together ## What we learned
Futher reinforce the AI skill
Explored the possibility between software and embedded systems.
To manage our time better ## What's next for Motion Voice

Built With

gemini
google-text-to-speech
hardware
lstm
mediapipe
opencv
python
pytorch
stm32

Updates

enthan3 Hu started this project — Mar 23, 2025 09:00 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.