Page MenuHomePhabricator

Support using the Porter stemmer when compiling search queries
ClosedPublic

Authored by epriestley on Nov 25 2016, 8:35 PM.
Tags
None
Referenced Files
F19908090: D16943.diff
Wed, Apr 1, 4:07 PM
F19701002: D16943.id40777.diff
Feb 10 2026, 10:01 PM
F19701000: D16943.id40777.diff
Feb 10 2026, 10:01 PM
F19674062: D16943.id.diff
Feb 8 2026, 7:02 AM
F19466065: D16943.diff
Jan 3 2026, 7:58 AM
F19466056: D16943.diff
Jan 3 2026, 7:55 AM
F19313412: D16943.id40783.diff
Dec 25 2025, 1:33 AM
F18939730: D16943.id40777.diff
Nov 11 2025, 7:15 AM
Subscribers
None

Details

Summary

Ref T6740. Stemming reduces words to roots, so "deleting", "deletion", and "delete" all index as the root "delet". It's fine that roots aren't really words.

When users search for "delete", we'll search for "delet" intead, in an index which has already indexed all variants of "delete" as "delet".

(This gets a bit tricky when users mix quoted and unquoted terms, so we may need to search literal and stemmed indexes separately.)

Test Plan

Added unit tests; ran unit tests.

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

epriestley retitled this revision from to Support using the Porter stemmer when compiling search queries.
epriestley updated this object.
epriestley edited the test plan for this revision. (Show Details)
epriestley added a reviewer: chad.
chad edited edge metadata.
This revision is now accepted and ready to land.Nov 25 2016, 9:23 PM
This revision was automatically updated to reflect the committed changes.