Evaluate virtualizing Git refs by proxying the protocol
Open, NormalPublic
Actions

Assigned To

None

Authored By

	epriestley
	May 6 2015, 3:17 PM

Description

We may be able to implement T8092 by proxying the protocol, without needing to embed an implementation of Git. We do this to some degree in Mercurial and SVN already, with success. Although this is complex, it's potentially much less complex than embedding a Git implementation.

Revisions and Commits

rP Phabricator
	D20436	rP904dbf0db666 Make the "git upload-pack" proxy more robust
	D20381	rPe08ba99dd3db Proxy the "git upload-pack" wire protocol
	D20380	rP35539a019ce1 Add an optional protocol log to `git` SSH workflows

Related Objects
Search...

Status	Assigned	Task
Open	None	T8090 Allow Harbormaster to perform change handoff in a defensible way
Open	None	T8092 Evaluate the viability of virtualizing Git refs in hosted repositories
Open	None	T8093 Evaluate virtualizing Git refs by proxying the protocol
Open	None	T4369 Phabricator HTTP repository hosting has fairly severe scalability limits

Event Timeline

epriestley created this task.May 6 2015, 3:17 PM

epriestley raised the priority of this task from to Normal.

epriestley updated the task description. (Show Details)

epriestley added a project: Harbormaster.

epriestley added a subscriber: epriestley.

epriestley moved this task from Backlog to v1 on the Harbormaster board.May 6 2015, 3:28 PM

cburroughs added a subscriber: cburroughs.May 6 2015, 6:54 PM

joshuaspence added a subscriber: joshuaspence.May 6 2015, 8:39 PM

After tinkering a bit, I think this is viable. The Git wire protocol is relatively straightforward to proxy and rewrite at the ref level. However, we'll need to proxy both SSH and HTTP traffic, so we need to fix T4369 at a minimum before we can pursue this.

epriestley added a subtask: T4369: Phabricator HTTP repository hosting has fairly severe scalability limits.May 11 2015, 4:33 PM

epriestley mentioned this in T8089: Unprototype Harbormaster (v1).Jun 4 2015, 4:24 PM

epriestley moved this task from v1 to Future on the Harbormaster board.Aug 10 2015, 7:12 PM

stwalkerster added a subscriber: stwalkerster.Dec 5 2015, 10:55 PM

epriestley mentioned this in T10691: Support GitHub-like forking of repositories.Mar 30 2016, 9:40 PM

20after4 added a subscriber: 20after4.May 19 2016, 2:24 AM

Herald added a subscriber: eadler. · View Herald TranscriptMay 19 2016, 2:24 AM

eadler added a project: Restricted Project.Aug 5 2016, 4:44 PM

Very soon now, git is getting an exciting new wire protocol. Highlights are improving performance on repos with unholy amount of refs, and being "easier to expand".

epriestley mentioned this in T13277: In repositories, realign "Track Only", "Autoclose", and "Publish/Notify" toward "Permanent Refs".Apr 6 2019, 5:10 PM

epriestley mentioned this in T13278: Improve repository Staging Areas.Apr 6 2019, 5:14 PM

exciting new wire protocol

My plan for now is to do v1 support only, since: (a) we'll need v1 for 15 years anyway for everyone running Ubuntu 3 on original Xbox hardware in their corporate enterprise cluster; and (b) I can't immediately tricky my git into v2 anyway; and (c) it looks easier.

The v1 protocol looks like it's pretty one-shot and straightforward: whether we're running upload-pack or receive-pack, the server immediately sends a complete list of refs to the client when the client connects. This is sort of a weird way for the protocol to work for 10+ years (?), also considering that this is the "smart" protocol, but it makes our job easier, since it looks like we can (as a starting point, at least) just parse the first few frames of the protocol, delete/rewrite some refs, and then drop into passthru mode.

This will just hide the refs from the client. A "malicious" client could still use want commands to fetch the underlying commits. However, this is fine: we aren't planning to treat different views of the same repository as having different permissions.

The want/need stuff seems ref-independent, so editing the initial list of refs looks like it fixes the whole read pathway with no other changes.

The "push" part is a little messier since the client sends what it's pushing, then sends PACK data, then the server acknowledges what was written. We need to parse all of that so we can rewrite refs in the first part (client thinks it's pushing A, tell the server it's pushing secret/A) and the last part (server acknowledges a write to secret/A, we tell the client the server acknowledge a write to A).

jbrownEP added a subscriber: jbrownEP.Apr 7 2019, 1:22 AM

hskiba added a subscriber: hskiba.Apr 8 2019, 7:25 AM

epriestley added a revision: D20380: Add an optional protocol log to `git` SSH workflows.Apr 8 2019, 2:22 PM

epriestley added a revision: D20381: Proxy the "git upload-pack" wire protocol.Apr 8 2019, 4:01 PM

When there are no refs in a repository, the server does not appear to send a capabilities frame:

! git-upload-pack -- '/Users/epriestley/dev/core/repo/local/12/'

< Write [4 bytes]
<  30303030                                                                                                     0000

> Read [4 bytes]
>  30303030                                                                                                     0000

_ <End of Session>

This makes our job a lot easier but also is absolutely bananas?

Herald added a subscriber: amckinley. · View Herald TranscriptApr 16 2019, 3:58 PM

epriestley added a revision: D20436: Make the "git upload-pack" proxy more robust.Apr 16 2019, 4:23 PM

epriestley added a commit: rP35539a019ce1: Add an optional protocol log to `git` SSH workflows.Apr 18 2019, 11:57 AM

epriestley added a commit: rPe08ba99dd3db: Proxy the "git upload-pack" wire protocol.

epriestley added a commit: rP904dbf0db666: Make the "git upload-pack" proxy more robust.Apr 18 2019, 12:04 PM

• pasik added a subscriber: • pasik.Jun 1 2019, 10:46 AM

epriestley mentioned this in T13584: Shallow Git clones fail under recent versions of Git.Nov 3 2020, 7:04 PM

Evaluate virtualizing Git refs by proxying the protocolOpen, NormalPublicActions

Description

Revisions and Commits

Related ObjectsSearch...

Event Timeline

Evaluate virtualizing Git refs by proxying the protocol
Open, NormalPublic
Actions

Related Objects
Search...