Disable auto extract metadata from pdf

cancel
Showing results for 
Search instead for 
Did you mean: 
sanjaybandhniya
Intermediate

Disable auto extract metadata from pdf

I am exploring docker base alfresco version 7.2.0

I found link which will generate setup for alfresco.

https://github.com/Alfresco/alfresco-docker-installer.

I want to disable auto extract metadata from pdf. How Can I disable?

I tried below property but not working.

# pdfs
extracter.PDFBox.enabled=false
# office types
extracter.Office.doc.enabled=false
extracter.Office.xls.enabled=false 
extracter.Office.ppt.enabled=false
# images (no exif aspect)
extracter.TikaAuto.enabled=false
# docx,xlsx,pptx
extracter.Poi.enabled=false

I think extraction functionality is moved to transformation service.

2 Replies
signshah
Customer

Re: Disable auto extract metadata from pdf

Hi Sanjay,

Any luck on this?

I have a issue wiht my Alfresco 7.3.0 where the metadata extraction is not happening and also the thumbnails are not getting generated. 

openpj
Moderator
Moderator

Re: Disable auto extract metadata from pdf

I think that now you should override the engine_config.json inside the Transform Core AIO or in the specific transform service you are currently deploying in your Alfresco installation.

Here you have an example of how you can override the default configuration provided by the Alfresco Transform Service:

https://github.com/aborroy/alfresco-transform-service-3.0.0

Hope this helps Smiley Wink