Page MenuHomePhabricator

Improve search stemmer performance for large inputs
ClosedPublic

Authored by epriestley on Sep 26 2017, 2:18 AM.
Tags
None
Referenced Files
F13228977: D18648.diff
Mon, May 20, 1:45 PM
F13206054: D18648.diff
Wed, May 15, 4:53 AM
F13190086: D18648.diff
Sat, May 11, 7:37 AM
Unknown Object (File)
Sun, May 5, 1:29 PM
Unknown Object (File)
Thu, May 2, 11:39 PM
Unknown Object (File)
Fri, Apr 26, 3:17 PM
Unknown Object (File)
Apr 21 2024, 9:31 PM
Unknown Object (File)
Apr 19 2024, 6:51 PM
Subscribers
None

Details

Summary

Ref T12974. See PHI87. As in D18647, we can improve the performance of some UTF8 operations here.

Instead of calling phutil_utf8_strtolower() on each token separately, call it once on the entire input up front. This has the same effect.

Test Plan

Diff Detail

Repository
rPHU libphutil
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

This revision is now accepted and ready to land.Sep 27 2017, 5:15 PM
This revision was automatically updated to reflect the committed changes.