we should have the option to normalize text input through the input forms -- most of our services (such as tokenizing splitting) require precombined unicode chars to function properly and we should normalize on input if we can.
found by @vgorman1 -- her μήδε used a combining char (combining \u03b7\u0301) but the code looks for the precombined character (\u03ae).
we should have the option to normalize text input through the input forms -- most of our services (such as tokenizing splitting) require precombined unicode chars to function properly and we should normalize on input if we can.
found by @vgorman1 -- her μήδε used a combining char (combining \u03b7\u0301) but the code looks for the precombined character (\u03ae).