[MNT-15909] CLONE - Keyword tags generated from metadata extraction are formed into a single string rather than split on delimiter Created: 22-Mar-16  Updated: 20-Jul-16  Resolved: 24-Mar-16

Status: Closed
Project: Service Packs and Hot Fixes
Component/s: Tags and Categories, Tika, POI, and Metadata Extraction
Affects Version/s: 5.1
Fix Version/s: 5.1.1

Type: Service Pack Request
Reporter: Alex Strachan Assignee: Closed Bugs (Inactive)
Resolution: Fixed Votes: 0
Labels: rn511
Remaining Estimate: 0 minutes
Time Spent: 2 hours, 30 minutes
Original Estimate: Not Specified

Attachments: PDF File Bacon Ipsum.pdf     File TikaAutoMetadataExtracter.properties     PNG File after_fix.png     PNG File error.png     XML File metadata-context.xml    
Issue Links:
is clone of MNT-15497 Keyword tags generated from metadata ... Closed
relates to MNT-13655 Just first keyword of the IPTC keywor... Closed
Bug Priority:
Category 2
ACT Numbers:


Build Location: https://releases.alfresco.com/Enterprise-5.1/5.1.1/5.1.1/build-00138/ALL/


Multiple keyword tags are not being split on the comma delimiter, instead they are being added as one long tag (that includes the comma)

In 5.0.1, the behavior is different as only the first word is used as a tag, the rest are ignored.

Steps to reproduce
  1. Install Alfresco
  2. Place the attached 'metadata-context.xml' and 'TikaAutoMetadataExtracter.properties' into the ~/shared/classes/alfresco/extension
  3. Upload various documents containing metadata to repository (example PDF attached)
  4. Browse to document library, select a document and view its tags
Expected behaviour

All tags are separated and individual tags created for each string

Observed Behaviour

Metadata tags are not being split on delimiter (comma) and are shown as one long tag, made up of all the tags (see attached error.png)

Generated at Fri Apr 23 12:52:43 BST 2021 using Jira 7.13.15#713015-sha1:7c5ddd2c3e1709974ae9c48c17df8edd3919fe2c.