Page MenuHomePhabricator

(Early Draft) Add a function to normalize UTF8 for search indexing
AbandonedPublic

Authored by epriestley on Apr 16 2020, 4:42 PM.
Tags
None
Referenced Files
F15489405: D21129.id50314.diff
Fri, Apr 11, 8:42 AM
F15471162: D21129.id.diff
Sat, Apr 5, 3:42 AM
F15468860: D21129.diff
Fri, Apr 4, 6:58 AM
F15423498: D21129.diff
Mar 22 2025, 3:44 PM
F15420489: D21129.id50314.diff
Mar 21 2025, 2:23 PM
Unknown Object (File)
Feb 18 2025, 4:28 PM
Unknown Object (File)
Feb 13 2025, 6:02 PM
Unknown Object (File)
Feb 11 2025, 11:08 AM
Subscribers
None

Details

Reviewers
None
Summary

See T13501. This can't move forward right now because "LIKE" in MySQL appears to not be combining-character aware, so even if we normalize the ngram indexes we miss the document on the final "LIKE" query.

Once a fix for that is ready, this might be useful.

Test Plan

N/A

Diff Detail

Repository
rARC Arcanist
Branch
normal1
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 24132
Build 33230: Run Core Tests
Build 33229: arc lint + arc unit