Ref T12819. Two minor improvements from live data:
- Tokenize in a UTF8-aware way.
- When one document fails to index, kill the transaction explicitly (rather than leaving it hanging) so we don't cause other failures later.
Differential D18487
Make Ferret indexing more robust (UTF8, exception handling) epriestley on Aug 28 2017, 10:41 PM. Authored by Tags None Referenced Files
Subscribers None
Details Ref T12819. Two minor improvements from live data:
Created some UTF8 documents locally, indexed them, got clean results.
Diff Detail
|