Inspiration
We built ThirdEye to help visually impaired individuals relive their memories in a meaningful way. Traditional photo albums rely on sight, leaving millions unable to independently access their cherished moments. We wanted to change that by making photos come alive through storytelling.
What it does
ThirdEye is a voice-enabled AI that turns photos into engaging, narrated stories. Just ask, “Tell me about my son’s trip to NYC,” and it retrieves relevant images, crafting a vivid story instead of just listing details.
How we built it
We combined speech recognition, AI-powered image analysis, and natural language generation to create a seamless, hands-free experience. Our cloud-based system integrates with Google Photos and Apple Photos, ensuring quick and scalable access.
Challenges we ran into
Making AI tell meaningful stories—not just robotic descriptions—was tough. We also worked hard to balance real-time processing, privacy concerns, and accessibility to create a smooth and reliable experience.
Accomplishments that we're proud of
A working prototype that truly brings memories to life.
A simple, accessible UI designed for ease of use.
Early testing with real users to refine the experience.
What we learned
Storytelling is powerful—people don’t just want descriptions, they want emotions. Also, user feedback is key, and balancing AI capabilities with real-world needs is a constant learning process.
What's next for ThirdEye
We’re working on:
Multi-language support to make ThirdEye globally accessible.
Offline AI processing for better privacy.
More personalization with facial recognition (user consent required).
Partnerships & outreach to get ThirdEye into the hands of those who need it most.
Built With
- fastapi
- javascript
- mysql
- python
- react
- whisperapi
Log in or sign up for Devpost to join the conversation.