Page MenuHomePhabricator

Pull Git filesize logic into a separate LowLevel query and use more Iterators
ClosedPublic

Authored by epriestley on Nov 22 2018, 12:22 AM.
Tags
None
Referenced Files
F15466986: D19830.id47355.diff
Thu, Apr 3, 11:30 AM
F15460925: D19830.id47355.diff
Tue, Apr 1, 3:26 AM
F15445835: D19830.id47355.diff
Thu, Mar 27, 4:11 PM
F15422701: D19830.id47355.diff
Sat, Mar 22, 10:15 AM
F15411146: D19830.diff
Wed, Mar 19, 8:52 AM
F15380059: D19830.diff
Fri, Mar 14, 1:12 AM
F15364021: D19830.id47355.diff
Tue, Mar 11, 12:57 PM
F15338358: D19830.diff
Sun, Mar 9, 10:35 AM
Subscribers
None

Details

Summary

Depends on D19829. Ref T13216. See PHI908. The current implementation is kind of a lot to live in CommitHookEngine and will likely fail if git diff-tree produces more than 2GB of output.

Pull it out and make it slightly more robust against enormous commits. It's probably limited by this, now:

implode("\n", $every_path)

We could replace that with some PhutilReverseRopeSource primitive or something but since we don't have one of those and it seems unlikely that we'll hit this case in practice, I left it here for now with just the easy stuff converted to be stream-oriented.

Test Plan

Used this script to test the query against various commits, got good results:

<?php

require_once 'scripts/init/init-script.php';

$viewer = PhabricatorUser::getOmnipotentUser();

$repository = id(new PhabricatorRepositoryQuery())
  ->setViewer($viewer)
  ->withCallsigns(array('P'))
  ->executeOne();

var_dump(
  id(new DiffusionLowLevelFilesizeQuery())
    ->setRepository($repository)
    ->withIdentifier($argv[1])
    ->execute());

Used this to find large commits in history and pull filesizes (worked great, although our largest commit only touches a couple thousand paths):

for hash in `git log --format=%H`; do echo -n $hash; echo -n ' '; git diff-tree -r --no-commit-id $hash | wc -l | awk '{print $1}'; done | awk '{print $2 " " $1}' | sort -n

Diff Detail

Repository
rP Phabricator
Branch
rconfig5
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 21204
Build 28835: Run Core Tests
Build 28834: arc lint + arc unit