Page MenuHomePhabricator

Support using the Porter stemmer when compiling search queries
ClosedPublic

Authored by epriestley on Nov 25 2016, 8:35 PM.
Tags
None
Referenced Files
Unknown Object (File)
Sun, Dec 15, 5:34 AM
Unknown Object (File)
Sat, Dec 7, 5:01 AM
Unknown Object (File)
Nov 26 2024, 1:15 PM
Unknown Object (File)
Nov 12 2024, 6:20 PM
Unknown Object (File)
Oct 22 2024, 11:41 PM
Unknown Object (File)
Oct 18 2024, 7:19 AM
Unknown Object (File)
Oct 12 2024, 3:19 PM
Unknown Object (File)
Oct 11 2024, 9:41 PM
Subscribers
None

Details

Summary

Ref T6740. Stemming reduces words to roots, so "deleting", "deletion", and "delete" all index as the root "delet". It's fine that roots aren't really words.

When users search for "delete", we'll search for "delet" intead, in an index which has already indexed all variants of "delete" as "delet".

(This gets a bit tricky when users mix quoted and unquoted terms, so we may need to search literal and stemmed indexes separately.)

Test Plan

Added unit tests; ran unit tests.

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

epriestley retitled this revision from to Support using the Porter stemmer when compiling search queries.
epriestley updated this object.
epriestley edited the test plan for this revision. (Show Details)
epriestley added a reviewer: chad.
chad edited edge metadata.
This revision is now accepted and ready to land.Nov 25 2016, 9:23 PM
This revision was automatically updated to reflect the committed changes.