Page MenuHomePhabricator

Speed up diffusion_browsequery for mercurial repositories
ClosedPublic

Authored by richardvanvelzen on Feb 7 2014, 11:44 AM.
Tags
None
Referenced Files
F18734605: D8163.diff
Tue, Sep 30, 10:57 PM
F18640906: D8163.diff
Sep 18 2025, 7:01 PM
F18611002: D8163.id.diff
Sep 14 2025, 5:55 AM
F18598568: D8163.diff
Sep 13 2025, 5:19 AM
F18048909: D8163.id.diff
Aug 3 2025, 2:30 PM
F18039473: D8163.id18466.diff
Aug 3 2025, 3:41 AM
F18037214: D8163.id18466.diff
Aug 3 2025, 2:30 AM
F18000298: D8163.diff
Aug 2 2025, 7:14 AM

Details

Summary

Ref T4387. By using hg locate to attempt to only list files in the given path
browsing diffusion is a bit faster. In a repo of about 600M it shaves a rough 100ms
off viewing the root of the project.

Test Plan

Looked around in diffusion and saw it showed everything including .files, which was nice

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

This is great! I think I caught one edge case inline -- does that make sense? Am I crazy? I didn't actually import my silly deeply-nested repository so I'm not 100% sure that this is actually an issue, but it seemed to have the wrong behavior when I poked it on the command line.

src/applications/diffusion/conduit/ConduitAPI_diffusion_browsequery_Method.php
204

I think we can't include "-X", because Mercurial won't print directories unless we enumerate files in them, so this will incorrectly hide directories which nest several levels deep without any files next to them. For example, I made a repository with just one file:

a/b/c/d/e/.../l/m

When browsing a/, we should see that b/ is a subdirectory. However, with the -X flag, it's not visible:

>>> orbital ~/repos/hg-deep $ hg status
A a/b/c/d/e/f/g/h/i/j/k/l/m
>>> orbital ~/repos/hg-deep $ hg locate -I ./a -X ./a/*/*/*
>>> orbital ~/repos/hg-deep $

Omitting the -X flag, we can see it:

>>> orbital ~/repos/hg-deep $ hg locate -I ./a
a/b/c/d/e/f/g/h/i/j/k/l/m

This is unfortunate because it means we have to do a lot of extra work at the root, but we could conceivably add caching to handle that case pretty easily.

Ah yes, I hadn't taken that case into account and you're absolutely right.

There is no efficient solution to doing this, but having the -I will at least make it faster for nested structures. I'll take another look at this next week to see if I can find a cleaner solution.

richardvanvelzen updated this revision to Unknown Object (????).Feb 7 2014, 5:39 PM

Remove the exclusion to include deeper nested structures

Closed by commit rP5771d1395203 (authored by Richard van Velzen <rvanvelzen@expert-shops.com>, committed by @epriestley).