Comparison Profile

Compared Types

Text comparison

Decompose complex characters

Activate to decompose complex or special characters in into basic characters. Complex characters are for instance ligatures like 'fi' which will be decomposed into 'fi'. Furthermore special character like long or short hyphens will be normalized to their base character.

Property
Property NameDescription
TRANSFORM_OPERATIONSAdd REPALCE_IDENTICAL to the comma separated list to enable. The default value is enabled
FILTERSAdd TEXTTRANSFORM to the comma separated list to enable. The default value is enabled

Equalize character recognition mistakes

Activate to correct typical text recognition mistakes. An example for a common ambiguousness in text recognition is the character 'm' and the syllable 'rn' which appear very similar depending on print quality and font.

Property
Property NameDescription
TRANSFORM_OPERATIONSAdd REPLACE_CONFUSABLES to the comma separated list to enable. The default value is disabled
FILTERSAdd TEXTTRANSFORM to the comma separated list to enable. The default value is enabled