Page MenuHomePhabricator

Support using the Porter stemmer when compiling search queries
ClosedPublic

Authored by epriestley on Nov 25 2016, 8:35 PM.
Tags
None
Referenced Files
Unknown Object (File)
Tue, Apr 9, 1:06 PM
Unknown Object (File)
Tue, Apr 9, 12:32 PM
Unknown Object (File)
Tue, Apr 2, 1:03 AM
Unknown Object (File)
Mon, Apr 1, 6:47 AM
Unknown Object (File)
Sun, Mar 31, 12:04 PM
Unknown Object (File)
Sun, Mar 31, 12:07 AM
Unknown Object (File)
Sat, Mar 30, 6:48 PM
Unknown Object (File)
Thu, Mar 28, 6:02 PM
Subscribers
None

Details

Summary

Ref T6740. Stemming reduces words to roots, so "deleting", "deletion", and "delete" all index as the root "delet". It's fine that roots aren't really words.

When users search for "delete", we'll search for "delet" intead, in an index which has already indexed all variants of "delete" as "delet".

(This gets a bit tricky when users mix quoted and unquoted terms, so we may need to search literal and stemmed indexes separately.)

Test Plan

Added unit tests; ran unit tests.

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

epriestley retitled this revision from to Support using the Porter stemmer when compiling search queries.
epriestley updated this object.
epriestley edited the test plan for this revision. (Show Details)
epriestley added a reviewer: chad.
chad edited edge metadata.
This revision is now accepted and ready to land.Nov 25 2016, 9:23 PM
This revision was automatically updated to reflect the committed changes.