Page MenuHomePhabricator

(Early Draft) Add a function to normalize UTF8 for search indexing
AbandonedPublic

Authored by epriestley on Apr 16 2020, 4:42 PM.
Tags
None
Referenced Files
F18053916: D21129.id.diff
Mon, Aug 4, 5:49 AM
F18047638: D21129.diff
Sun, Aug 3, 12:45 PM
F18037355: D21129.diff
Sun, Aug 3, 2:33 AM
Unknown Object (File)
Jun 20 2025, 7:44 AM
Unknown Object (File)
Apr 27 2025, 7:42 AM
Unknown Object (File)
Apr 11 2025, 8:42 AM
Unknown Object (File)
Apr 5 2025, 3:42 AM
Unknown Object (File)
Apr 4 2025, 6:58 AM
Subscribers
None

Details

Reviewers
None
Summary

See T13501. This can't move forward right now because "LIKE" in MySQL appears to not be combining-character aware, so even if we normalize the ngram indexes we miss the document on the final "LIKE" query.

Once a fix for that is ready, this might be useful.

Test Plan

N/A

Diff Detail

Repository
rARC Arcanist
Branch
normal1
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 24132
Build 33230: Run Core Tests
Build 33229: arc lint + arc unit