Native and Scanned PDF Segregation

Inspiration

The main inspiration behind this idea is "Most of the organisations avoid using PDF related automation because of OCR & its accuracy, and they miss to utilize wonderful functionality of UiPath which can extract the data with 100% accuracy from Native PDFs, with this snippet they will be able to segregate the PDFs based on its type(Native or Scanned)."

What it does

This snippet can segregate the scanned and native PDFs from a given set of PDFs in their respective folder so that other automation can process the further tasks like reading the PDF document with OCR from the Scanned folder & Reading data with the Native or Full-Text method from Native Folder.

How I built it

Built using Uipath activities

Accomplishments that I'm proud of

It segregates the PDFs files with 100% accuracy

What I learned

I learned almost every automation is possible using UiPath, You just need to identify the way of doing it.

What's next for Native and Scanned PDF Segregation

The files which have been moved under scanned folder can be utilized by another automation which has The OCR Engines activities Like Abbyy Flexi capture, Microsoft OCR or Google OCR to extract the information from PDFs, And the files which have been moved under Native Folder can be utilized by another automation which consists of Full text or Native methods to extract the data from PDFs. Or Anyone can use it or modify it evolve and develop further activities associated with PDFs.