Inspiration

The motivation behind this project was to create a seamless image-to-text extraction service that simplifies digitizing printed and handwritten text. Whether it's for document scanning, data entry automation, or accessibility support, OCR technology enhances productivity. By leveraging Azure Vision API, I aimed to build an efficient tool for extracting text from images in multiple formats.

What it does

Tech Stack Frontend: React (Vite), TailwindCSS

Backend: Azure Functions for API calls

OCR Service: Azure Vision API for text extraction

Authentication (Optional): Azure Active Directory

Deployment: Azure App Service

How It Works Upload Image → Users upload an image (JPG, PNG, etc.).

Send to Azure Vision API → The image is processed for text extraction.

Extracted Text Displayed → The application shows the extracted text in a structured format.

Challenges I ran into

Handling OCR Response Variability → Different image qualities produced different response structures, requiring careful text extraction logic.

File Size Optimization → Large images took longer to process, so compression techniques were explored.

API Rate Limits → Azure's free-tier had rate limits, so retries and optimizations were necessary.

What's next for Azurebased OCR Application

I will try to integrate webcamera and capture realtime images in it

Built With

  • azure
  • react-vite
  • tailwindcss
Share this project:

Updates