Paths

Table of Contentst

Diffusion Phabricator 086a125ad5ee

Improve performance of Ferret engine ngram extraction, particularly for large…
086a125ad5ee
Actions

Tags

None

Referenced Files

None

Subscribers

None

Description

Improve performance of Ferret engine ngram extraction, particularly for large input strings

Summary:
See PHI87. Ref T12974. The array_slice() method of splitting the string apart can perform poorly for large input strings. I think this is mostly just the large number of calls plus building and returning an array being not entirely trivial.

We can just use substr() instead, as long as we're a little bit careful about keeping track of where we're slicing the string if it has UTF8 characters.

Test Plan:

Created a task with a single, unbroken blob of base64 encoded data as the description, roughly 100KB long.
Saw indexing performance improve from ~6s to ~1.5s after patch.
Before: https://secure.phabricator.com/xhprof/profile/PHID-FILE-nrxs4lwdvupbve5lhl6u/
After: https://secure.phabricator.com/xhprof/profile/PHID-FILE-6vs2akgjj5nbqt7yo7ul/

Reviewers: amckinley

Reviewed By: amckinley

Maniphest Tasks: T12974

Differential Revision: https://secure.phabricator.com/D18649

Details

Provenance

epriestley	Authored on Sep 26 2017, 4:16 PM
epriestley	Pushed on Sep 27 2017, 5:41 PM

Reviewer

Differential Revision

D18649: Improve performance of Ferret engine ngram extraction, particularly for large input strings

Parents

rPa1d9a2389db4: Improve Ferret engine indexing performance for large blocks of text

Branches

Unknown

Tags

Unknown

Tasks

T12974: Upgrading: "Ferret" Fulltext Engine

Build Status

Buildable 18567
Build 25012: Run Core Tests

Event Timeline

epriestley committed rP086a125ad5ee: Improve performance of Ferret engine ngram extraction, particularly for large… (authored by epriestley).Sep 27 2017, 5:41 PM

epriestley added a task: T12974: Upgrading: "Ferret" Fulltext Engine.

Harbormaster completed building B18567: rP086a125ad5ee: Improve performance of Ferret engine ngram extraction, particularly for large….Sep 27 2017, 5:43 PM

Changes (2)

Path

Size

src/

applications/

search/

ferret/

PhabricatorFerretEngine.php

__tests__/

PhabricatorFerretEngineTestCase.php

rP086a125ad5ee

src/applications/search/ferret/PhabricatorFerretEngine.php

Loading...

src/applications/search/ferret/tests/PhabricatorFerretEngineTestCase.php

Loading...