OCR implementation for scanned documents on windows 10

cancel
Showing results for 
Search instead for 
Did you mean: 
sneha-lolge
Active Member

OCR implementation for scanned documents on windows 10

Hi all,

Currently,I am working on implementing OCR in alfresco for any scanned documents. e.g. scanned images, pdfs. To ellaborate more on this, any scanned file uploaded to a folder in alfresco should be searchable in the repository using the text provided.  I have referred this https://www.surevine.com/a-little-alfresco-tesseract-ocr-integration/, but this works only for image files. Can someone please suggest any addon/integration that can work for any type of scanned document for windows platform.

Configurations installed:

Alfresco 5.2 community edition

windows 10 

Tesseract ocr version 5.0

 

Thankyou..!