Alfresco integrate OCR

Showing results for 
Search instead for 
Did you mean: 
Member II

Alfresco integrate OCR

I try integrate OCR in Alfresco but no success. I instal


and after that i instal tesseract in C:\Program Files (x86)\Tesseract-OCR

Then I try to integrate tesseract in alfresco,

first i create in C:\alfresco-community bat file OCR.bat

REM to see what happens echo from %1 to %2 >>C:\tmp\ocrtransform.log  copy /Y %1 C:\TMP\%~n1%~x1  REM  call tesseract and redirect output to $TARGET "C:\Program Files (x86)\Tesseract-OCR\tesseract.exe" C:\TMP\%~n1%~x1 %~d2%~p2%~n2 -l eng del C:\TMP\%~n1%~x1

Then three transformation xml added in "C:\alfresco-community\tomcat\shared\classes\alfresco\extension"

ocrpng-transform-context.xml  ocrjpeg-transform-context.xml  ocrtiff-transform-context.xml

then create an empty file name with ocrtransform.log in "C:\TMP"

and Downolad this jar from this link( and place this jar in this path "C:\Alfresco\tomcat\lib".

I use this link

but not working. Where I make mistake? Do I need to add anything else?

Any suggestion, idea?

1 Reply

Re: Alfresco integrate OCR

The blog post you found is quite old and based on an older Alfresco version. The changes suggested there may not be compatible with the newest version of Alfresco that you are using. Also, the distribution method for that extension / customisation is quite unprofessional. You should probably stick to more professional solutions, like GitHub - keensoft/alfresco-simple-ocr: Simple OCR action for Alfresco  from the company ‌ works for.