⚓ T13546 Modernize the "arc land" workflow

...and you arc land C, we'll land A, B, and C. However, if A, B, or C are not the most up-to-date version of those changes, we might land older code than the user expects.

In the case where the user explicitly typed arc land C and older code lands, I think this is not terribly bad: they at least very clearly told us to do something and we did exactly what they asked. Still, in all cases we can compare the set of changes we are planning to land to the last diffed set of changes and raise some basic warnings (notably, if the revision has a different set of changes and was updated more recently than the local changes were committed). This is straightforward.

The less straightforward case is when A or B is out of date, but a newer version exists somewhere else in the working copy. We'd like to find this A[2] or B[2] and land that instead. Often, this looks like starting here:

o master
 \
  \
   o A
    \
     \
      o B

...then you git checkout A and commit a fix:

o master
 \
  \
   o --- * A
    \
     \
      o B

..or git checkout A and amend a fix:

o master
 \---\
  \   \
   x   * A
    \
     \
      o B

In either case, you arc land B. In the diagrams above, the desired behavior is to include the commits marked * and exclude the commits marked x.

Finding these commits is challenging in the general case. We can only really expect that:

Reachable: They're reachable from some repository marker.

Although it's possible to update A on a detached head and save the hash on a sticky note, I think it's unreasonable for users to expect we'll find it. We can reasonably say we'll only look for commits that are ancestors of some marker (branch or bookmark).

Authored after A: Any commits we find should have an author date later than the earliest author date of any commit attached to the revision.

I don't think we can make any stronger claims than this in the general case, because users may have mutated or rebased their working copy; may push anything to any remote at any time, etc. Signals like commit date, author date, presence in remotes, Mercurial draft phase, git upstreams, etc., can not be used to exclude markers or commits in the general case.

But this means we have to search back from every marker in the repository, and can't stop our search until we (a) find a single GCA or (b) all active cursors have author dates older than the oldest author date of A.

In the general case, we can't even cache this work. If we examine the history of commit X and find it has no revision at time T, then the user runs arc land again later, there is no guarantee that X isn't mapped to a revision at time T + 1: the user may run arc diff between the two invocations of arc land.

Very modern Mercurial can trace the evolution of change A -> A', but I believe evolve is not widely deployed. Git can trace this evolution only through the reflog, which is unreliable and transient. Arc could trace this evolution, but only by wrapping every VCS command and forbidding users from using git or hg.

After we find A in history, we can ask Phabricator if it knows of a newer A, then look for that newer A. But this is not reliable in the general case because the user may have local updates to A that they haven't sent for review yet.

We can let users configure marker patterns for markers which never contain newer changes. I dislike this because it adds more configuration, the configuration is complex, there's a "right" and "wrong" value for it, and I suspect users may believe that arc should be able to figure it out even though I think it can not.

So I think the logic looks like this, mostly similar to today:

Log from the symbols to the GCA with the into commit.
Partition the graph into sets.
Identify "collateral damage" revisions which we're also landing.

The new piece is:

For each collateral revision:
- Log backward from every head.
- Stop if we reach a single GCA.
- Stop if all cursors have an author date before the first known date of the revision.
- Map commits in this graph to revisions (this is potentially very slow, since the graph may be very large).
- If the graph contains commits associated with A which are not the original commits:
  - Partition the graph.
  - For each "A" partition:
    - Find all heads within the partition.
  - If there are multiple heads across all the "A" partitions:
    - If exactly one head has a marker, ask the user to confirm that?
    - Ask the user to confirm the newest head?
    - Ask the user to confirm the other head, if there are only two heads?
    - Give up and suggest --pick?

This is fairly awful, but probably not impossible.

epriestley added a commit: rARC63f2e667b9a4: Update "arc land" display of build failures, and rename "DisplayRef" to….Jun 30 2020, 1:28 PM

epriestley added a commit: rARC33bb0acf97b3: Collect scattered implementations of "getDisplayHash()" into RepositoryAPI.

epriestley added a commit: rARCb0a9ef8351f1: In "arc land" under Git, confirm branch creation.

epriestley added a commit: rARCf52222ad19a0: Add more "RepositoryRef" legacy status mappings.Jun 30 2020, 1:30 PM

epriestley added a commit: rARC33484b43c9c8: Introduce "GridView", an updated version of "ConsoleTableView".

epriestley added a revision: D21371: Introduce "phutil_partition()" and natural case sorting for "msortv(...)".Jun 30 2020, 1:41 PM

epriestley added a commit: rARCc53c05e5b2a6: Introduce "phutil_partition()" and natural case sorting for "msortv(...)".Jun 30 2020, 1:45 PM

A related issue is that it is difficult to identify the set of "published" hashes in the general case. We would like arc branches to show all unpublished history, but stop at published history.

This is difficult because users routinely push temporary changes into remotes. Git upstreams are not reliable (many users set upstream branches to temporary branches to make git push easier to use) and hg outgoing is likewise not reliable.

When we recognize a remote as a Phabricator repository, we can safely use any refs in that repository which are configured as permanent refs: the install has told us they are permanent. But things become difficult beyond this. Without a way to do this, it's hard to make the arc branches tree work even plausibly well in many cases, although perhaps I can just degrade into skipping revision resolution beyond some depth.

epriestley added a revision: D21372: Copy repository URI normalization code from Phabricator to Arcanist.Jun 30 2020, 5:47 PM

epriestley added a revision: D21373: Collapse repository URI normalization code into Arcanist.Jun 30 2020, 5:55 PM

epriestley added a revision: D21374: Support inspection of remote refs with "arc inspect remote(...)".Jun 30 2020, 6:58 PM

epriestley added a revision: D21375: Support generating remote refs in Git.Jun 30 2020, 7:54 PM

epriestley added a revision: D21376: Provide "arc look", a user-facing inspection command.Jun 30 2020, 7:57 PM

epriestley added a commit: rARCb19985a4bde6: Copy repository URI normalization code from Phabricator to Arcanist.Jun 30 2020, 8:07 PM

epriestley added a commit: rARC89f9eb66a74b: Support inspection of remote refs with "arc inspect remote(...)".

epriestley added a commit: rARCffb027e85ccf: Support generating remote refs in Git.

epriestley added a commit: rARC6bf7a40358f8: Provide "arc look", a user-facing inspection command.

epriestley added a revision: D21377: Load and map repository objects for remote URIs.Jun 30 2020, 8:20 PM

epriestley added a commit: rARC50f7a853b5cf: Load and map repository objects for remote URIs.Jun 30 2020, 8:43 PM

epriestley added a revision: D21378: Identify published commits in working copies by using remote configuration.Jun 30 2020, 9:55 PM

epriestley added a commit: rARC80f5166b701d: Identify published commits in working copies by using remote configuration.Jun 30 2020, 9:56 PM

epriestley added a revision: D21379: Support date-range commit graph queries, and multiple disjoint commits in Git.Jun 30 2020, 10:01 PM

epriestley added a revision: D21380: Give Mercurial more plausible marker behavior.Jun 30 2020, 10:24 PM

epriestley added a commit: rARCcd19216ea28f: Render "arc markers" workflows as a tree, not a list.Jun 30 2020, 10:50 PM

epriestley added a commit: rARC10c4a551ae9d: Remove implicit sorting from "MarkerRefQuery".

epriestley added a commit: rARC0ad3222d5966: Improve grid layout in "arc branches" at various terminal widths.

epriestley added a commit: rARC5d305909eb91: When a commit graph set has many commits, summarize them.

epriestley added a commit: rARCc7093a2e5796: In "arc branches", group linear sequences of published revisions together.

epriestley added a commit: rARC8c95dc0d295e: Support date-range commit graph queries, and multiple disjoint commits in Git.

epriestley added a commit: rARC4b8a32ee0273: Give Mercurial more plausible marker behavior.

epriestley added a commit: rP7d496f2c6d7b: Collapse repository URI normalization code into Arcanist.Jun 30 2020, 10:54 PM

epriestley added a revision: D21390: Render the state tree in "arc branches" slightly more cleanly.Jul 3 2020, 6:39 PM

epriestley added a commit: rARCa5480609f870: Render the state tree in "arc branches" slightly more cleanly.Jul 3 2020, 7:43 PM

epriestley added a commit: rARC8795282286a7: (stable) Promote 2020 Week 26.Jul 3 2020, 8:10 PM

See PHI1807. At time of writing, arc land can delete the local master if you land it onto itself. This isn't a big deal (it gives you the command to get it back), but not intended and undesirable. Although it isn't recommended, arc land is supposed to support working in master and landing master into itself.

arc has three phases of local branch updates. The first two may run incrementally (with --incremental); the third is a finalization step which runs at the end.

(incremental) "cascade" updates, rebasing branches which descend from the range which was just pushed;
(incremental) "prune" updates, deleting local branches which are no longer relevant; and
(final) a "reconcile" update, where we figure out what state we should leave the user.

Currently, the "cascade" step skips branches which point directly at the heads of ranges which landed, with the expectation that we will "update or delete them later".

The "prune" step then deletes all branches which point at landed heads.

The "prune" behavior is likely fine if the "cascade" step updates these branches.

The "reconcile" behavior is likely fine too, although it may need a messaging change and a short-circuit in this case.

So when do we retain branches?

I think we can't know for sure in all cases. When a user runs arc land X, they sometimes would like X to be deleted and sometimes would like X to be retained, and I think we can not deduce intent purely from repository state in the general case.

A small subset of users use git checkout -b feature1 origin/release-1.2.3 (so their local feature branch may have no local ancestors) and then possibly set the branch upstream to point somewhere else. This looks like a "master" and quacks like a "master", but is not a "master".

A practical rule for retaining a local branch is probably just:

retain local branch X (which would otherwise be obsoleted) if it has the same name as the into branch.

This is straightforward, at least.

epriestley added a revision: D21392: Allow "hg arc-ls-markers" to run under Python 2 or Python 3.Jul 6 2020, 10:29 PM

epriestley added a commit: rARCa28e76b7b3df: Allow "hg arc-ls-markers" to run under Python 2 or Python 3.Jul 6 2020, 10:29 PM

epriestley added a commit: rARC1a54e1103c23: (stable) Allow "hg arc-ls-markers" to run under Python 2 or Python 3.Jul 6 2020, 10:34 PM

hskiba added a subscriber: hskiba.Jul 20 2020, 1:56 AM

epriestley mentioned this in T13566: Improve fallback behavior for "arc branches/bookmarks" when unpublished local state appears to have >1K commits.Aug 11 2020, 6:34 PM

cspeckmim added a subscriber: cspeckmim.Jul 6 2021, 9:22 PM

cspeckmim added a revision: D21680: An assortment of fixes and updates to using arc-land with mercurial.Jul 7 2021, 8:53 PM

cspeckmim added a revision: D21682: Add a prompt to allow pruning merged branches when using --hold.Jul 11 2021, 5:57 AM

cspeckmim added a commit: rARCa43a3a9aabe2: An assortment of fixes and updates to using arc-land with mercurial.Jul 12 2021, 3:41 AM

rARC Arcanist
	Abandoned		D21682 Add a prompt to allow pruning merged branches when using --hold
		D21680	rARCa43a3a9aabe2 An assortment of fixes and updates to using arc-land with mercurial
		D21392	rARC1a54e1103c23 (stable) Allow "hg arc-ls-markers" to run under Python 2 or Python 3
		D21392	rARCa28e76b7b3df Allow "hg arc-ls-markers" to run under Python 2 or Python 3
		D21390	rARC8795282286a7 (stable) Promote 2020 Week 26
		D21390	rARCa5480609f870 Render the state tree in "arc branches" slightly more cleanly
		D21380	rARC4b8a32ee0273 Give Mercurial more plausible marker behavior
		D21379	rARC8c95dc0d295e Support date-range commit graph queries, and multiple disjoint commits in Git
		D21367	rARCc7093a2e5796 In "arc branches", group linear sequences of published revisions together
		D21366	rARC5d305909eb91 When a commit graph set has many commits, summarize them
		D21365	rARC0ad3222d5966 Improve grid layout in "arc branches" at various terminal widths
		D21364	rARC10c4a551ae9d Remove implicit sorting from "MarkerRefQuery"
		D21363	rARCcd19216ea28f Render "arc markers" workflows as a tree, not a list
		D21378	rARC80f5166b701d Identify published commits in working copies by using remote configuration
		D21377	rARC50f7a853b5cf Load and map repository objects for remote URIs
		D21376	rARC6bf7a40358f8 Provide "arc look", a user-facing inspection command
		D21375	rARCffb027e85ccf Support generating remote refs in Git
		D21374	rARC89f9eb66a74b Support inspection of remote refs with "arc inspect remote(...)"
		D21372	rARCb19985a4bde6 Copy repository URI normalization code from Phabricator to Arcanist
		D21371	rARCc53c05e5b2a6 Introduce "phutil_partition()" and natural case sorting for "msortv(...)"
		D21360	rARC33484b43c9c8 Introduce "GridView", an updated version of "ConsoleTableView"
		D21355	rARCf52222ad19a0 Add more "RepositoryRef" legacy status mappings
		D21354	rARCb0a9ef8351f1 In "arc land" under Git, confirm branch creation
		D21353	rARC33bb0acf97b3 Collect scattered implementations of "getDisplayHash()" into RepositoryAPI
		D21352	rARC63f2e667b9a4 Update "arc land" display of build failures, and rename "DisplayRef" to…
		D21350	rARC50c534b5911a Correct some minor "arc land" workflow issues in Mercurial
		D21349	rARC86951ad0678f Use a "branchmap" call to identify remote branches in "arc-hg"
		D21348	rARC488a24c40a26 In "arc land" in Mercurial, inch closer to making complex branch/bookmark…
		D21351	rARC92f860ae9b2f Improve "--hold", save/restore state, bookmark creation, and some warnings for…
		D21347	rARC727d73fec937 In "arc land", fix some coarse issues with build warnings
		D21343	rARCb1f807f7ca93 Disambiguate various types of Mercurial remote markers with "hg arc-ls-remote"
		D21342	rARC1bb054ef47a1 Verify remotes ("paths") in Mercurial during "arc land"
		D21345	rARC705c48effcb5 Realign "arc land" closed/published warning around more modern language
		D21344	rARC3cad824e3872 In "arc land" in Mercurial, show a tidier "ls-remote" command
		D21341	rARC091aebe0149a Refine "arc land" behavior when pushing "onto" a new branch
		D21340	rARCab70626c1226 Support "arc land --pick" to pick specific changes out of a sequence
	Concern Raised	D21339	rARC7ddaed9aba1a Improve "arc land" behavior in the presence of merge conflicts and change…
		D21337	rARCb003cf93102c Remove "arc feature", "arc branch", "arc bookmark", and significant chunks of…
		D21336	rARC3d64140ff31c Implement "arc work", to replace "arc feature"
		D21335	rARC5abf0b96c8d9 Use MarkerRefs to resolve landing symbols in Mercurial
		D21333	rARC599ba0f999fd Provide a more powerful query mechanism for "markers" (branches/bookmarks)
		D21338	rARCe8c3cc32897e Allow "arc" to accept any prefix of a command as that command
		D21334	rARC31d08f9a8faf Remove old Mercurial code testing for rebase and phase support
		D21332	rARC78e9cc9c0129 Add a check for ambiguous merge strategies after the "history.immutable"…
		D21331	rARCc5192bde3445 Allow users to save prompt responses in "arc" workflows
		D21330	rARCf3f31155b761 Format "arc land" passthru commands more nicely, and execute them from CWD
		D21329	rARC0bf4da60f6d6 Make Mercurial use "hg shelve" and "hg unshelve" in dirty working copies in…
		D21328	rARC4d61c005310e Improve final messages under "arc land --hold"
		D21325	rARC709c9cb6fbe8 Improve the logic for identifying ambiguous commits and applying "--revision"…
		D21326	rARCa30378a34ab1 Update "arc help land"
		D21324	rARC8a53b5a4517d When landing changes in an empty repository, merge cleanly in Git
		D21319	rARC57d0d690cc76 Modernize output when pruning branches in Git during "arc land"
		D21318	rARC94f78cf87c78 Provide more information about merge progress in "arc land" under Git
		D21322	rARC1552397c8695 Sometimes discard already-closed revisions in "arc land"
		D21320	rARC25afb93f7ad4 In "arc land", rebase branches in natural order
		D21321	rARC6fb84e5164cc Add a synopsis and example for "arc help land"
		D21315	rARC68f28a171888 Substantially modernize the "arc land" workflow
		D21316	rARC7d615a97e240 In "arc branch" output, sort branches updated in the same second by name
		D21317	rARC3ed81d35a23a When "arc" receives SIGWINCH or other signals during display of a prompt…
		D21313	rARC7ac3b791b05a Provide modern config options for "arc land" configuration
		D21314	rARC0da395ffe4c9 Introduce "RepositoryLocalState", a modern version of "requireCleanWorkingCopy…
		D21312	rARCde607e9fbc3f Add modern refs and hardpoints for buildables, builds, and build plans
		D21311	rARCc1a4bee4a178 Add "Author" and "Parent Revision" hardpoints to RevisionRefs
		D21310	rARC6af46f289a14 Support short aliases and repeatable arguments in Arcanist Workflow arguments
		D21308	rARC0e8247400713 Support appending arbitrary lines to DisplayRef output
		D21309	rARC7c80a9006d2a Add a "%?" ("hint") conversion to "tsprintf()"
		D21307	rARCfc3974ed70c6 Impose a HardpointEngine future parallelism limit
rP Phabricator
		D21373	rP7d496f2c6d7b Collapse repository URI normalization code into Arcanist
		D21346	rP5b1dd96e40e8 Add an explicit "uri" to the "harbormaster.buildable.search" results

Modernize the "arc land" workflow
Open, NormalPublic
Actions

Description

Revisions and Commits

Related Objects

Event Timeline

	F7555488: Screen Shot 2020-06-08 at 4.23.04 PM.png
	Jun 8 2020, 11:24 PM

	epriestley
	May 30 2020, 11:58 PM

Modernize the "arc land" workflowOpen, NormalPublicActions

Description

Revisions and Commits

Related Objects

Event Timeline

Modernize the "arc land" workflow
Open, NormalPublic
Actions