Page Comparison

...

It is possible extract the text contents of PDFs and images using Optical Character Recognition (OCR). The contents can then be included in searches.

...

Parameter	Description
OcrKey	The key from the Computer Vision resource created above.
OcrServerUri	The URI from the Computer Vision resource created above. It is called “Endpoint” in the “Keys and endpoints” section of the Computer Vision resource.
OcrExtractFromPdf	If true, the text contents of PDF files are extracted when the PDF files are uploaded to the DAM.
OcrExtractFromImage	If true, the text contents of images are extracted when the images are uploaded to the DAM.
OcrLetAzureRequestFiles	If false, we explicitly upload files to the Computer Vision client. Otherwise, Azure will request the files from the DAM Center. Setting this to true is expected to be more efficient, but it requires that the DAM Center can be accessed by Azure. Thus, ensure that the DAM Center is not behind a strict firewall if this is set to true.
OcrTaskDelayLength	We regularly check the status of ongoing content extractions in the Computer Vision client. This gives the time interval between each check. The larger the time interval is, the less requests are made to Azure. However, it then also takes more time for the extracted contents of files to be available in the “Asset Content” metafield. You most likely don’t have to change this.

Once the information has been provided, and the “appsettings.json”-file has been saved, the contents of PDFs and/or images are extracted when PDFs/images are uploaded.

2. Including asset contents in searches

...

Versions Compared

Old Version 8

New Version 9

Key

2. Including asset contents in searches