Page MenuHomePhabricator

For backup persitsence, mark the "common ngrams" table as a data table, not an index table
ClosedPublic

Authored by epriestley on Oct 10 2017, 12:04 AM.
Tags
None
Referenced Files
Unknown Object (File)
Fri, Apr 19, 6:53 PM
Unknown Object (File)
Fri, Mar 29, 6:43 PM
Unknown Object (File)
Mar 16 2024, 3:27 AM
Unknown Object (File)
Mar 5 2024, 5:15 PM
Unknown Object (File)
Feb 3 2024, 5:04 PM
Unknown Object (File)
Jan 21 2024, 5:49 PM
Unknown Object (File)
Jan 6 2024, 12:25 PM
Unknown Object (File)
Dec 27 2023, 12:16 PM
Subscribers
None

Details

Summary

Ref T13000. Garbage collecting common ngrams is slow because MySQL isn't all that great at deleting rows quickly. See PHI96, where it looks like it's going to take a week to GC ngrams for a ~million objects at a relatively conservative 0.15 threshold.

In the event of a restore, we can reduce the impact by persisting this table so the ngrams just don't get built when the reindex happens.

Test Plan

Viewed schema in Config, saw common ngrams tables marked as "Data" instead of "Index".

Diff Detail

Repository
rP Phabricator
Lint
Lint Not Applicable
Unit
Tests Not Applicable