Inspiration
With the idea of "Inclusion" be created, many people are devoting to creating methods for those visual impaired to be able to surf the Internet. However, the biggest barrier for blind is the Images. They can not enjoy the image on any website. Some website add short descriptions, but tons of details were left out. How can we solve this to enable those blind to see "in detail", like a normal person with normal sight does?
What it does
This tool can cut a picture into serval sections by identifying "objects", and then it will make a dictionary, the keys are the name of the objects, and the values are the specific descriptions for the object. if the user wants more detail , just select that object, and the object would be cut into different sections to offer information in a deeper level. By a recursive process ,we successfully simulitate the process for a normal human to admire a picture.
How we built it
We use chatgpt4-vision to achieve image recognition. And I write the prompt and use FEW-SHOT and Chain of Thought to tell it the rules and boost its productivity.the programming language we use is python
Challenges we ran into
QUITE HARD TO THINK UP THIS
Accomplishments that we're proud of
we make our contribution to build a society of inclusion. and I am glad to use high-end tech to boost human's wellbeing
What we learned
how to use the chatgpt api
What's next for TEAM(119,VISUALVOX)
a REALLY APPLICATION ON WEB WITH AUDIO!
Built With
- chatgpt
- python
Log in or sign up for Devpost to join the conversation.