Page MenuHomePhabricator

(Early Draft) Add a function to normalize UTF8 for search indexing
AbandonedPublic

Authored by epriestley on Apr 16 2020, 4:42 PM.
Tags
None
Referenced Files
F18755132: D21129.id.diff
Sun, Oct 5, 3:58 AM
F18750092: D21129.diff
Sat, Oct 4, 6:28 AM
F18734236: D21129.id.diff
Sep 30 2025, 10:50 PM
F18373660: D21129.diff
Aug 28 2025, 7:50 AM
F18172869: D21129.id50314.diff
Aug 15 2025, 7:12 PM
F18101005: D21129.diff
Aug 9 2025, 12:15 PM
F18053916: D21129.id.diff
Aug 4 2025, 5:49 AM
F18047638: D21129.diff
Aug 3 2025, 12:45 PM
Subscribers
None

Details

Reviewers
None
Summary

See T13501. This can't move forward right now because "LIKE" in MySQL appears to not be combining-character aware, so even if we normalize the ngram indexes we miss the document on the final "LIKE" query.

Once a fix for that is ready, this might be useful.

Test Plan

N/A

Diff Detail

Repository
rARC Arcanist
Branch
normal1
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 24132
Build 33230: Run Core Tests
Build 33229: arc lint + arc unit