Page MenuHomePhabricator

Combine the two different ngram-splitting algorithms into a single engine
ClosedPublic

Authored by epriestley on Apr 16 2020, 4:38 PM.
Tags
None
Referenced Files
F13377525: D21128.diff
Sat, Jun 29, 3:58 AM
F13366951: D21128.id50317.diff
Wed, Jun 26, 10:39 PM
F13350774: D21128.diff
Sun, Jun 23, 7:24 AM
F13337365: D21128.diff
Wed, Jun 19, 9:30 AM
F13324838: D21128.diff
Sat, Jun 15, 1:02 AM
F13312206: D21128.diff
Tue, Jun 11, 6:16 AM
F13305959: D21128.id.diff
Sun, Jun 9, 5:27 AM
F13304388: D21128.id50317.diff
Sat, Jun 8, 9:08 AM
Subscribers
None

Details

Summary

Ref T13501. Depends on D21127. With the "prefix" behavior removed in D21127, we now have two virtually identical copies of the same code.

The newer one in Ferret is better: it slices utf8 correctly and is slightly more efficient on large inputs. Pull it out and make all callers call into it.

Test Plan
  • Grepped for all affected symbols.
  • Ran bin/search index --force ... to reindex various objects (tasks, files).
  • Searched for things in the UI.

Diff Detail

Repository
rP Phabricator
Branch
search3
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 24131
Build 33228: Run Core Tests
Build 33227: arc lint + arc unit

Event Timeline

This revision was not accepted when it landed; it landed in state Needs Review.Apr 16 2020, 4:45 PM
This revision was automatically updated to reflect the committed changes.