Inspiration
The motivation behind this project was to create a seamless image-to-text extraction service that simplifies digitizing printed and handwritten text. Whether it's for document scanning, data entry automation, or accessibility support, OCR technology enhances productivity. By leveraging Azure Vision API, I aimed to build an efficient tool for extracting text from images in multiple formats.
What it does
Tech Stack Frontend: React (Vite), TailwindCSS
Backend: Azure Functions for API calls
OCR Service: Azure Vision API for text extraction
Authentication (Optional): Azure Active Directory
Deployment: Azure App Service
How It Works Upload Image → Users upload an image (JPG, PNG, etc.).
Send to Azure Vision API → The image is processed for text extraction.
Extracted Text Displayed → The application shows the extracted text in a structured format.
Challenges I ran into
Handling OCR Response Variability → Different image qualities produced different response structures, requiring careful text extraction logic.
File Size Optimization → Large images took longer to process, so compression techniques were explored.
API Rate Limits → Azure's free-tier had rate limits, so retries and optimizations were necessary.
What's next for Azurebased OCR Application
I will try to integrate webcamera and capture realtime images in it
Built With
- azure
- react-vite
- tailwindcss


Log in or sign up for Devpost to join the conversation.