Page MenuHomePhabricator

Combine the two different ngram-splitting algorithms into a single engine
ClosedPublic

Authored by epriestley on Apr 16 2020, 4:38 PM.
Tags
None
Referenced Files
F15197245: D21128.diff
Sat, Feb 22, 10:18 PM
Unknown Object (File)
Sun, Feb 9, 8:59 AM
Unknown Object (File)
Sun, Feb 9, 8:59 AM
Unknown Object (File)
Sun, Feb 9, 8:59 AM
Unknown Object (File)
Sun, Feb 9, 8:59 AM
Unknown Object (File)
Tue, Jan 28, 2:46 AM
Unknown Object (File)
Sun, Jan 26, 10:02 PM
Unknown Object (File)
Sat, Jan 25, 7:08 AM
Subscribers
None

Details

Summary

Ref T13501. Depends on D21127. With the "prefix" behavior removed in D21127, we now have two virtually identical copies of the same code.

The newer one in Ferret is better: it slices utf8 correctly and is slightly more efficient on large inputs. Pull it out and make all callers call into it.

Test Plan
  • Grepped for all affected symbols.
  • Ran bin/search index --force ... to reindex various objects (tasks, files).
  • Searched for things in the UI.

Diff Detail

Repository
rP Phabricator
Branch
search3
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 24131
Build 33228: Run Core Tests
Build 33227: arc lint + arc unit

Event Timeline

This revision was not accepted when it landed; it landed in state Needs Review.Apr 16 2020, 4:45 PM
This revision was automatically updated to reflect the committed changes.