Inspiration
I like to bring AI tools to disadvantaged people. This spark the light when I see video classification capabilities of PyTorch.
What it does
It uses pre-trained PyTorch video classification model and speak out loud what it see on iPhones.
How we built it
I built it easily using PyTorch iOS sample projects on Github and add it text-to-speech capability of iPhone. When person touch and hold finger on screen, it starts to speak to prevent unnecessary sound.
Challenges we ran into
I need to understand how sample projects work, thanks to PyTorch's developers they are very easy to understand and have a clean code.
Accomplishments that we're proud of
I created this project under two hours.
What we learned
I learned that PyTorch offers great capabilities for on device machine learning.
What's next for Video Recognizer for Blind
I can add image classification model to recognize objects in images and guide user by informing where are the objects.
Log in or sign up for Devpost to join the conversation.