Page MenuHomePhabricator

For backup persitsence, mark the "common ngrams" table as a data table, not an index table
ClosedPublic

Authored by epriestley on Oct 10 2017, 12:04 AM.
Tags
None
Referenced Files
F14755615: D18696.id44895.diff
Tue, Jan 21, 4:06 PM
F14755134: D18696.id44886.diff
Tue, Jan 21, 3:56 PM
F14751103: D18696.id.diff
Tue, Jan 21, 12:06 PM
F14749245: D18696.diff
Tue, Jan 21, 11:13 AM
Unknown Object (File)
Sat, Jan 18, 1:22 AM
Unknown Object (File)
Sun, Jan 5, 9:05 AM
Unknown Object (File)
Fri, Jan 3, 2:34 PM
Unknown Object (File)
Fri, Jan 3, 11:16 AM
Subscribers
None

Details

Summary

Ref T13000. Garbage collecting common ngrams is slow because MySQL isn't all that great at deleting rows quickly. See PHI96, where it looks like it's going to take a week to GC ngrams for a ~million objects at a relatively conservative 0.15 threshold.

In the event of a restore, we can reduce the impact by persisting this table so the ngrams just don't get built when the reindex happens.

Test Plan

Viewed schema in Config, saw common ngrams tables marked as "Data" instead of "Index".

Diff Detail

Repository
rP Phabricator
Lint
Lint Not Applicable
Unit
Tests Not Applicable