Inspiration
Over 2.2 billion people worldwide live with vision impairment. Most printed documents — letters, bills, medicine labels — remain inaccessible without help. Existing solutions like commercial braille displays cost $1,500+, require internet, or simply don't exist in Kazakhstan's assistive tech market. We built KOZAI to change that.
What it does
KOZAI is a portable offline document scanner for blind users. Point the camera at any printed document, and within seconds the text is read aloud via TTS and rendered in physical braille. No internet. No cloud. No assistance needed.
The user holds the camera over a document. The system detects the document automatically, waits for stillness via optical flow analysis, captures the sharpest frame, runs the full OCR pipeline, and outputs the result simultaneously to a speaker and a custom braille display.
How we built it
The server runs on Raspberry Pi 4B with a Flask REST API. The computer vision pipeline uses OpenCV for document detection (Canny edge detection + homography-based perspective correction), adaptive histogram equalization (CLAHE), Sauvola binarization, and Tesseract OCR with a multi-candidate evaluation system that picks the highest-confidence preprocessing output.
The braille display is custom-built using an ESP32-C3 SuperMini driving solenoids through logic gate transistors and resistors — one solenoid per braille dot. The display receives text from the RPi server via API and converts each character to its braille cell pattern in real time.
The full pipeline runs offline on consumer hardware costing under $100 total.
Challenges we ran into
Document detection in uncontrolled lighting was the hardest problem. We implemented an adaptive sharpness threshold that calibrates to ambient conditions in the first 15 frames, and a rectangularity filter using both minAreaRect ratio and cosine angle verification to reject trapezoids and partial views. OCR confidence on real-world photos plateaued around 71% — the main bottleneck is input image resolution, which we're addressing with closer capture distance and higher upscale factors.
Accomplishments we're proud of
- Full offline pipeline: camera → OCR → braille output, no internet required
- Custom braille display built from scratch with solenoids and ESP32
- Adaptive guided capture with vibration feedback for blind users
- 71% OCR confidence on real printed documents with a $35 computer
What we learned
Blind users cannot see if the camera is aligned — every piece of feedback must be tactile or audio. This shaped every design decision: vibration patterns for document detection, TTS segmentation for long texts, and automatic capture on stillness detection.
What's next
Upgrade to Orange Pi 5 with NPU for faster inference. Add Kazakh and Russian language support. Miniaturize into a wearable chest-mounted form factor. Submit to Kazakhstan's national assistive technology program.
Log in or sign up for Devpost to join the conversation.