Paths

Table of Contentst

Differential D14846

Implement basic ngram search for Owners Package names
ClosedPublic
Actions

Authored by epriestley on Dec 21 2015, 9:07 PM.

Tags

None

Referenced Files

	Unknown Object (File)
	Wed, Jul 1, 11:41 AM

	Unknown Object (File)
	Jun 25 2026, 11:58 AM

	Unknown Object (File)
	Jun 22 2026, 9:14 PM

	Unknown Object (File)
	Jun 22 2026, 3:41 PM

	Unknown Object (File)
	Jun 19 2026, 7:44 PM

	Unknown Object (File)
	May 29 2026, 9:03 PM

	Unknown Object (File)
	May 29 2026, 9:02 PM

	Unknown Object (File)
	Mar 21 2026, 6:36 PM

Subscribers

None

Details

Reviewers

Maniphest Tasks

T9979: Build support for ngram indexes for substring searches (e.g., file, paste, package, task titles)

Commits

Restricted Diffusion Commit
rP96fe8c0b83cf: Implement basic ngram search for Owners Package names

Summary

Ref T9979. This uses ngrams (specifically, trigrams) to build a reasonably efficient index for substring matching. Specifically, for a package like "Example", with ID 123, we store rows like this:

< ex, 123>
<exa, 123>
<xam, 123>
<amp, 123>
<mpl, 123>
<ple, 123>
<le , 123>

When the user searches for exam, we join this table for packages with tokens exa and xam. MySQL can do this a lot more efficiently than it can process a LIKE "%exam%" query against a huge table.

When the user searches for a one-letter or two-letter string, we only search the beginnings of words. This is probably what they want, the only thing we can do quickly, and a reasonable/expected behavior for typeaheads.

Test Plan

Ran storage upgrades and search indexer.
Searched for stuff with "name contains".
Used typehaead and got sensible results.
Searched for aabbccddeeffgghhiijjkkllmmnnooppqqrrssttuuvvwwxxyyzz and saw only 16 joins.

Diff Detail

Repository

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

epriestley updated this revision to Diff 35881.Dec 21 2015, 9:07 PM

epriestley retitled this revision from to Implement basic ngram search for Owners Package names.

epriestley updated this object.

epriestley edited the test plan for this revision. (Show Details)

epriestley added a reviewer: chad.

epriestley added a task: T9979: Build support for ngram indexes for substring searches (e.g., file, paste, package, task titles).

epriestley mentioned this in T9979: Build support for ngram indexes for substring searches (e.g., file, paste, package, task titles).Dec 21 2015, 9:08 PM

epriestley edited the test plan for this revision. (Show Details)Dec 21 2015, 9:11 PM

noidea

This revision is now accepted and ready to land.Dec 21 2015, 9:39 PM

Closed by commit rP96fe8c0b83cf: Implement basic ngram search for Owners Package names (authored by epriestley, committed by epriestley). · Explain WhyDec 22 2015, 4:00 PM

This revision was automatically updated to reflect the committed changes.

epriestley mentioned this in D14411: Allow querying for files by name.Dec 22 2015, 4:14 PM

amckinley mentioned this in D17702: Implement ngram search for File objects.Apr 17 2017, 7:24 PM

Revision Contents
Changeset List

Path

Size

resources/

sql/

autopatches/

20151221.search.2.ownersngrams.sql

7 lines

20151221.search.3.reindex.php

11 lines

src/

__phutil_library_map__.php

13 lines

applications/

config/

schema/

PhabricatorConfigSchemaSpec.php

8 lines

owners/

editor/

PhabricatorOwnersPackageTransactionEditor.php

4 lines

query/

PhabricatorOwnersPackageFulltextEngine.php

26 lines

PhabricatorOwnersPackageQuery.php

17 lines

PhabricatorOwnersPackageSearchEngine.php

8 lines

storage/

PhabricatorOwnersPackage.php

36 lines

PhabricatorOwnersPackageNameNgrams.php

18 lines

PhabricatorOwnersPackageTransaction.php

8 lines

typeahead/

PhabricatorOwnersPackageDatasource.php

2 lines

search/

engineextension/

PhabricatorFulltextIndexEngineExtension.php

3 lines

PhabricatorNgramsIndexEngineExtension.php

34 lines

PhabricatorSearchNgramsDestructionEngineExtension.php

31 lines

interface/

PhabricatorNgramsInterface.php

7 lines

ngrams/

PhabricatorSearchNgrams.php

113 lines

infrastructure/

query/

policy/

PhabricatorCursorPagedPolicyAwareQuery.php

139 lines

Diff 35903

resources/sql/autopatches/20151221.search.2.ownersngrams.sql

Loading...

resources/sql/autopatches/20151221.search.3.reindex.php

Loading...

src/__phutil_library_map__.php

Loading...

src/applications/config/schema/PhabricatorConfigSchemaSpec.php

Loading...

src/applications/owners/editor/PhabricatorOwnersPackageTransactionEditor.php

Loading...

src/applications/owners/query/PhabricatorOwnersPackageFulltextEngine.php

Loading...

src/applications/owners/query/PhabricatorOwnersPackageQuery.php

Loading...

src/applications/owners/query/PhabricatorOwnersPackageSearchEngine.php

Loading...

src/applications/owners/storage/PhabricatorOwnersPackage.php

Loading...

src/applications/owners/storage/PhabricatorOwnersPackageNameNgrams.php

Loading...

src/applications/owners/storage/PhabricatorOwnersPackageTransaction.php

Loading...

src/applications/owners/typeahead/PhabricatorOwnersPackageDatasource.php

Loading...

src/applications/search/engineextension/PhabricatorFulltextIndexEngineExtension.php

Loading...

src/applications/search/engineextension/PhabricatorNgramsIndexEngineExtension.php

Loading...

src/applications/search/engineextension/PhabricatorSearchNgramsDestructionEngineExtension.php

Loading...

src/applications/search/interface/PhabricatorNgramsInterface.php

Loading...

src/applications/search/ngrams/PhabricatorSearchNgrams.php

Loading...

src/infrastructure/query/policy/PhabricatorCursorPagedPolicyAwareQuery.php

Loading...