Page MenuHomePhabricator

Improve search stemmer performance for large inputs
ClosedPublic

Authored by epriestley on Sep 26 2017, 2:18 AM.
Tags
None
Referenced Files
F15470398: D18648.id44785.diff
Fri, Apr 4, 9:47 PM
F15452341: D18648.id44767.diff
Sat, Mar 29, 3:56 AM
F15448360: D18648.id44785.diff
Fri, Mar 28, 4:26 AM
F15427243: D18648.diff
Sun, Mar 23, 1:25 PM
F15426170: D18648.id44785.diff
Sun, Mar 23, 7:32 AM
F15380402: D18648.diff
Mar 14 2025, 2:51 AM
Unknown Object (File)
Feb 11 2025, 3:10 PM
Unknown Object (File)
Feb 11 2025, 10:35 AM
Subscribers
None

Details

Summary

Ref T12974. See PHI87. As in D18647, we can improve the performance of some UTF8 operations here.

Instead of calling phutil_utf8_strtolower() on each token separately, call it once on the entire input up front. This has the same effect.

Test Plan

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

This revision is now accepted and ready to land.Sep 27 2017, 5:15 PM
This revision was automatically updated to reflect the committed changes.