...
Please be aware of the important information at the bottom of the page.
Content extraction with Microsoft Azure Cognitive Services
The content extraction of PDFs and images relies on Microsoft Azure Cognitive Services. Thus, a Computer Vision resource in Azure must be used.
...
Info |
---|
The metafield ‘Asset content' is automatically created when installing or upgrading to 5.5. The field is created in metagroup 'Content’, and it is very important that this exact field is used in the configuration as the GUID of the metadata field is used as a dependency in the system. GUID: 4A8ED71B-574A-43BB-A35E-8826598CF36F |
Including asset content in searches (when using Solr)
The extracted content of assets can be included in freetext searches by adding the metafield ‘Asset content' in the search DigiZuite_System_Framework_Search as a freetext input parameter. The 'Asset content’ metafield can be added as a freetext input parameter by following these steps:
...
Info |
---|
The content of existing assets can be extracted by republishing the assets. |
Including asset content in searches (when using ElasticSearch in MM 5.6+)
In Media Manager, go into ‘General settings → ‘Asset search’, and then add the field ‘Asset content’ to 'Freetext search fields’.
If the ‘Asset content’ field is not available, make sure it’s available from the metadata editor in Media Manager and readable for all the people that need to search for it. Otherwise, the Search Engine will not index it.
...
Important information
Please be aware of the following when using the Computer Vision resource:
...