A&D INNOVATORS – AI Braille Audio Converter

Our project, AI Braille Audio Converter, is an innovative solution designed to make education more inclusive for visually impaired students. The idea was born from observing the struggles that students face when accessing printed educational materials such as books, notes, and worksheets. Many assistive tools are either expensive or limited to certain languages, making it difficult for students in rural and underserved communities to learn independently.

We envisioned a smartphone-based tool that anyone could use easily—no extra hardware, no complicated setups—just a device they already have in their hands. By leveraging Optical Character Recognition (OCR), Text-to-Speech (TTS), and Braille conversion algorithms, our app transforms printed text into audio and tactile feedback, breaking barriers in education.

Inspiration

The idea came to us when we learned about the lack of affordable learning tools for students with visual impairments. Many families cannot afford expensive assistive devices, and available apps do not support regional languages, leaving students behind. We wanted to create something simple, practical, and scalable, where technology helps everyone learn, regardless of their abilities or location.

What We Learned

Throughout this project, we gained hands-on experience with: OCR technology using Tesseract.js to extract text from images. Text-to-Speech APIs, including the Web Speech API and Google TTS, to provide real-time audio feedback. Braille pattern generation, utilizing Unicode Braille characters and vibration feedback. Accessibility design principles, integrating screen readers and voice commands for seamless navigation. Collaboration and problem-solving in a team setting, especially while testing on real devices.

We also learned how technology can be aligned with empathy—designing with users in mind, not just building software.

Challenges We Faced

Handling varied handwriting and fonts while using OCR required constant tuning and dataset expansion. Supporting multiple languages for Text-to-Speech output was technically challenging, especially in regions with dialect variations. Ensuring the Braille output was intuitive for beginners without prior experience needed careful user testing. Designing an app that works smoothly on low-end smartphones with limited processing power. Balancing user privacy and offline processing to reduce reliance on network connectivity while maintaining performance.

Final Thought

This project is not just a tool—it’s a step toward making education accessible to all. We believe technology should empower, not exclude, and with AI, we are shaping a future where learning knows no boundaries.

Updates

ARCHANA GURUSAMY started this project — Sep 10, 2025 01:51 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.