🌟 Inspiration Millions of people with disabilities face daily barriers in communication, navigation, and understanding. We were inspired to build AccessAI to harness the power of Gemini’s multimodal AI to create a truly inclusive digital companion — one that empowers users to read, listen, speak, and understand the world around them, regardless of ability.

🚀 What it does AccessAI is a mobile app that uses Gemini’s vision, speech, and language capabilities to assist users with disabilities. It converts images to speech, transcribes and translates spoken language in real time, and simplifies complex documents into easy-to-understand summaries. Designed with accessibility-first principles, it helps users navigate menus, conversations, forms, and more — independently and confidently.

🛠️ How we built it We built AccessAI using React Native for cross-platform mobile support, integrating Gemini APIs for multimodal processing.

OCR + Vision API for image-to-text conversion

Text-to-Speech (TTS) for narration

Speech-to-Text + Translation for real-time conversation support

Simplification engine using Gemini’s language model to rephrase complex content We followed WCAG guidelines to ensure the UI is accessible, with large fonts, voice commands, and high-contrast visuals.

⚔️ Challenges we ran into Ensuring fast and accurate multimodal processing on mobile devices

Designing a UI that’s intuitive for users with diverse accessibility needs

Handling edge cases in OCR (e.g., blurry images, handwritten text)

Balancing performance with offline functionality for low-connectivity regions

🏆 Accomplishments that we're proud of Seamless integration of Gemini’s multimodal features into a real-time mobile experience

Building a working MVP that narrates images and translates speech in under 2 seconds

Creating a user-friendly interface that passed accessibility audits

Receiving positive feedback from test users with visual and hearing impairments

📚 What we learned Gemini’s multimodal capabilities are incredibly powerful when applied to real-world accessibility challenges

Designing for accessibility requires empathy, iteration, and constant user feedback

Simplicity and clarity in UX are just as important as technical innovation

🔮 What's next for AccessAI Expand support for more languages and dialects

Add gesture-based navigation and voice command control

Build a community-driven library of simplified guides and translated content

Partner with NGOs and accessibility advocates to deploy AccessAI globally

Launch on Play Store and App Store with open-source accessibility modules

Built With

Share this project:

Updates