Page MenuHomePhabricator

Improve search stemmer performance for large inputs
ClosedPublic

Authored by epriestley on Sep 26 2017, 2:18 AM.
Tags
None
Referenced Files
Unknown Object (File)
Sun, May 5, 1:29 PM
Unknown Object (File)
Thu, May 2, 11:39 PM
Unknown Object (File)
Fri, Apr 26, 3:17 PM
Unknown Object (File)
Sun, Apr 21, 9:31 PM
Unknown Object (File)
Fri, Apr 19, 6:51 PM
Unknown Object (File)
Tue, Apr 16, 3:30 PM
Unknown Object (File)
Sat, Apr 13, 5:26 AM
Unknown Object (File)
Sat, Apr 13, 5:20 AM
Subscribers
None

Details

Summary

Ref T12974. See PHI87. As in D18647, we can improve the performance of some UTF8 operations here.

Instead of calling phutil_utf8_strtolower() on each token separately, call it once on the entire input up front. This has the same effect.

Test Plan

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

This revision is now accepted and ready to land.Sep 27 2017, 5:15 PM
This revision was automatically updated to reflect the committed changes.