Paths

Table of Contentst

Diffusion Phabricator 9288cad0edc8

(stable) Improve Ferret engine indexing performance for large blocks of text
9288cad0edc8
Actions

Tags

None

Referenced Files

None

Subscribers

None

Description

(stable) Improve Ferret engine indexing performance for large blocks of text

Summary:
See PHI87. Ref T12974. Currently, we do a lot more work here than we need to: we call phutil_utf8_strtolower() on each token, but can do it once at the beginning on the whole block.

Additionally, since ngrams don't care about order, we only need to convert unique tokens into ngrams. This saves us some phutil_utf8v(). These calls can be slow for large inputs.

Test Plan:

Created a ~4MB task description.
Ran bin/search index Txxx --profile ... to profile indexing performance before and after the change.
Saw total runtime drop form 38s to 9s.
Before: https://secure.phabricator.com/xhprof/profile/PHID-FILE-wiht5d7lkyazaywwxovw/
After: https://secure.phabricator.com/xhprof/profile/PHID-FILE-efxv56q2hulr6kjrxbx6/

Reviewers: amckinley

Reviewed By: amckinley

Maniphest Tasks: T12974

Differential Revision: https://secure.phabricator.com/D18647

Details

Provenance

epriestley	Authored on Sep 26 2017, 2:11 AM
epriestley	Pushed on Sep 27 2017, 5:25 PM

Reviewer

Differential Revision

D18647: Improve Ferret engine indexing performance for large blocks of text

Parents

rP7ae4d93043c8: (stable) Promote 2017 Week 38

Branches

Unknown

Tags

Unknown

Tasks

T12974: Upgrading: "Ferret" Fulltext Engine

Build Status

Buildable 18565
Build 25009: Run Core Tests

Event Timeline

epriestley committed rP9288cad0edc8: (stable) Improve Ferret engine indexing performance for large blocks of text (authored by epriestley).Sep 27 2017, 5:25 PM

epriestley added a task: T12974: Upgrading: "Ferret" Fulltext Engine.

Harbormaster completed building B18565: rP9288cad0edc8: (stable) Improve Ferret engine indexing performance for large blocks of text.Sep 27 2017, 5:26 PM

Changes (1)

Path

Size

src/

applications/

search/

ferret/

PhabricatorFerretEngine.php

rP9288cad0edc8

src/applications/search/ferret/PhabricatorFerretEngine.php

Loading...