Support diffs between abstract block lists in the UI
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	epriestley
	Sep 25 2019, 4:19 PM

Description

See PHI1453. See PHI1459. See PHI456. See PHI452. See T13105. See T13414.

Various interests are served by abstracting diffs and diff presentation so that the unit of change is a "block" rather than a "line of text". An immediate application is sensible UI diffing of Jupyter notebooks. A related application is more scalable prose diffs.

This is not conceptually hard because we can transform a "list of blocks" into a "list of lines of text" by hashing each block (and this is what diff does internally anyway, more or less). There's just a lot of mapping and separation of concepts ("block content" vs "block hash" vs "text line"; "line number" vs "block identifier") that are currently conflated.

There may be some technical limits later -- for example, inlines are associated with a numeric "line number", not a freeform "block identifier". There may be a path out of this with clever mapping. In the case where the block is a 3D model and we want to attach an inline to a vertex or something we probably can't just do clever mapping, but this is a different flavor of abstraction; for now, we're only going as far as block lists with a well-defined one-dimensional order.

There's a support issue somewhere about inline on specific offsets within a line which might arise here, but hopefully that can be scoped separately. The ideal abstraction is that inlines have a "block identifier" (normally, a "line number") and then a separate "location within the block", which might be "characters 9-13" for a line of text or "<12, 34, 29>" for a 3D model.

My planned path is:

Allow DocumentEngine to elect into diff rendering.
Elect the image engine into diff rendering.
Make the image engine emit a generic block list. Make the ChangesetParser render a block list though the Renderer.
- The "image stage" code moves to the image engine (or the block list if reuse is interesting).
- The "scaffolding" code moves to "renderer->renderBlockList()".

Then, in whatever order is easiest -- these steps don't have obvious dependencies:

Between emission of the block list and rendering, diff the block list.
Make inlines work on blocks emitted as part of a block list.
Elect the Jupyter engine into diff rendering and emit Jupyter books as a block list.

Revisions and Commits

rPHU libphutil
	D20837	rPHUf39a03df2bc5 Move PhutilProseDiff out of "libphutil/"
rP Phabricator
	D20959	rPccf28a81121e Fix an issue where the last line of block-based diffs could be incorrectly…
	D20897	rPd4491ddc225e Fix an issue with 1up diff block rendering for added or removed blocks
	D20866	rPf5b3528a2670 (stable) Fix an issue where added or removed source files could incorrectly…
	D20866	rP292f8fc612bd Fix an issue where added or removed source files could incorrectly select a…
	D20865	rP8ff0e3ab351f Support rich diff rendering with DocumentEngine for added/removed files
	D20851	rP344a2e39bed7 In Jupyter notebooks, apply intraline diffing to source code lines
	D20848	rP76d9912932bc Force unified abstract block diffs into roughly usable shape
	D20845	rPd9515e82a3ec Perform basic block interdiffs when diffing abstract blocks, and interdiff…
	D20844	rP5afdc620db59 Make basic Juypter notebook rendering improvements and roughly support folding…
	D20843	rPa7f3316aa3df Improve sequencing of various content/header checks in abstract block diffs
	D20840	rP2c06815edb0f When rendering Jupyter notebook diffs, split code inputs into individual blocks
	D20838	rP9d884f144f01 Add "PhutilProseDiff" classes to "phabricator/"
	D20836	rP281598d65cfc Use a hash-and-diff strategy to produce a diff layout for block-based documents
	D20835	rP932d829af339 Improve behavior of inline comment highlight reticle for block diffs
	D20834	rPa09b298d85d7 Correct DOM node metadata to let inline comments work against block-based diffs
	D20833	rP1c4450d39f9b Allow the Jupyter engine to elect to emit diffs, and emit Jupyter documents as…
	D20832	rP7ae711ed3e0b Add a "View as..." option to diff dropdowns for selecting between document…
	D20831	rPbb71ef6ad6cd Render image diffs as abstract blocks diffs via DocumentEngine
	D20830	rPa73f592d7d08 Allow DocumentEngine to elect into diff construction

Related Objects

Mentioned Here: T13642: Inline comment line numbers on DocumentEngine block diffs may be clamped to raw lines in the source file
T13105: Plans: Rich presentation and diff rendering pipelines for various file types
T13414: Raise the effective corpus size limit in "PhutilProseDifferenceEngine" by using "diff" for coarse passes

Event Timeline

epriestley triaged this task as Normal priority.Sep 25 2019, 4:19 PM

epriestley created this task.

Herald added a subscriber: amckinley. · View Herald TranscriptSep 25 2019, 4:19 PM

epriestley added a revision: D20830: Allow DocumentEngine to elect into diff construction.Sep 25 2019, 4:26 PM

TwoUp image comments aren't triggering (also in master).
OneUp image comments need some cell span adjustments.

Screen Shot 2019-09-25 at 10.08.10 AM.png (671×272 px, 27 KB)

epriestley added a revision: D20831: Render image diffs as abstract blocks diffs via DocumentEngine.Sep 25 2019, 5:14 PM

In an effort to "do no harm", I'm planning to add a "Render with Document Engine..." option to the View Options dropdown next. This will let you (for example) view a Jupyter notebook as a raw source diff if you want, if the "fancy" diff is broken or unhelpful for some reason, so you always have an escape hatch back to a lower level representation.

Porting inline comments across representations seems hopeless. In the future, the preferred representation may be a 3D model viewer, and the lower level representation may be a hexdump. Porting a comment at <13, 14, 15> in three-dimensional space into an offset in the hexdump of a model file is meaningless.

For now, I'm just going to let the comments do whatever they want (at least, as long as it isn't "fatal"). In the future, I'll probably hide comments on the non-preferred representation with a note ("6 inline comments on the "Jupyter Notebook" view of this file are hidden because you're viewing it as "Source".") or maybe stick them at the bottom or in a popup or something. In the far future, we probably need to store which representation an inline appears on, in case the preferred representation changes because of configuration changes. We could then make the non-preferred configurations read-only, or, I guess, let users hide comments on different representations to create a fun puzzle for reviewers.

The options in the "View As..." dropdown are exhaustive, and most do not work, because they aren't based on the changeset being rendered (so we'll give you an option to render a Jupyter notebook as audio, for example). This isn't trivial to fix and it isn't terribly important for this to function as an escape hatch back to old behavior.
Since we expect most documents to have a relatively small number of options here, a list of clickable options might be better than a <select /> dropdown.

There's also no "render as native source" option, but there is a "View as Source" option, which doesn't work. Gotcha!

epriestley added a revision: D20832: Add a "View as..." option to diff dropdowns for selecting between document engines.Sep 25 2019, 6:21 PM

Differential shows a "this file is big, so syntax highlighting is disabled by default" warning even when a document engine which does not use syntax highlighting renders the document.

Jupyter as blocks, no diffing or inlines yet:

Screen Shot 2019-09-25 at 12.02.00 PM.png (1×1 px, 435 KB)

epriestley added a revision: D20833: Allow the Jupyter engine to elect to emit diffs, and emit Jupyter documents as blocks.Sep 25 2019, 7:04 PM

epriestley added a revision: D20834: Correct DOM node metadata to let inline comments work against block-based diffs.Sep 25 2019, 7:48 PM

??? it just works ???

Screen Shot 2019-09-25 at 12.49.52 PM.png (898×1 px, 597 KB)

epriestley added a revision: D20835: Improve behavior of inline comment highlight reticle for block diffs.Sep 25 2019, 8:25 PM

epriestley added a revision: D20836: Use a hash-and-diff strategy to produce a diff layout for block-based documents.Sep 25 2019, 9:05 PM

Screen Shot 2019-09-25 at 1.57.05 PM.png (892×1 px, 149 KB)

epriestley added a revision: D20837: Move PhutilProseDiff out of "libphutil/".Sep 25 2019, 9:36 PM

epriestley added a revision: D20838: Add "PhutilProseDiff" classes to "phabricator/".Sep 25 2019, 9:40 PM

epriestley added a commit: rPa73f592d7d08: Allow DocumentEngine to elect into diff construction.Sep 25 2019, 11:23 PM

epriestley added a commit: rPbb71ef6ad6cd: Render image diffs as abstract blocks diffs via DocumentEngine.Sep 25 2019, 11:25 PM

epriestley added a commit: rP7ae711ed3e0b: Add a "View as..." option to diff dropdowns for selecting between document….Sep 25 2019, 11:29 PM

epriestley added a commit: rP1c4450d39f9b: Allow the Jupyter engine to elect to emit diffs, and emit Jupyter documents as….Sep 25 2019, 11:32 PM

epriestley added a commit: rPa09b298d85d7: Correct DOM node metadata to let inline comments work against block-based diffs.Sep 25 2019, 11:38 PM

epriestley added a commit: rP932d829af339: Improve behavior of inline comment highlight reticle for block diffs.

epriestley added a commit: rP281598d65cfc: Use a hash-and-diff strategy to produce a diff layout for block-based documents.Sep 25 2019, 11:41 PM

epriestley added a commit: rPHUf39a03df2bc5: Move PhutilProseDiff out of "libphutil/".Sep 25 2019, 11:47 PM

epriestley added a commit: rP9d884f144f01: Add "PhutilProseDiff" classes to "phabricator/".Sep 25 2019, 11:50 PM

epriestley added a revision: D20840: When rendering Jupyter notebook diffs, split code inputs into individual blocks.Sep 26 2019, 2:51 AM

epriestley added a commit: rP2c06815edb0f: When rendering Jupyter notebook diffs, split code inputs into individual blocks.Sep 26 2019, 4:05 AM

epriestley added a revision: D20843: Improve sequencing of various content/header checks in abstract block diffs.Sep 27 2019, 8:18 PM

epriestley added a revision: D20844: Make basic Juypter notebook rendering improvements and roughly support folding unchanged context.Sep 27 2019, 9:31 PM

epriestley added a revision: D20845: Perform basic block interdiffs when diffing abstract blocks, and interdiff markdown in Jupyter notebooks.Sep 28 2019, 12:25 AM

I think the major remaining issues are:

OneUp does not perform block layout properly (we want to try to group sequences of "-" and "+" lines together to make things more readable).
OneUp does not render blocks properly.
I'm not sure what inline comments do in mail. I'm aiming for "not broken" but not sure if we're hitting that bar yet.
When lines of context are folded, you can't expand them. This is currently "not broken" but would be nice to fix.

Oh, and maybe lack of source code intraline diffs, although that's likely not too much work.

Context isn't automatically unfolding around inlines, but should.

epriestley added a revision: D20848: Force unified abstract block diffs into roughly usable shape.Sep 30 2019, 5:34 PM

epriestley added a commit: rPa7f3316aa3df: Improve sequencing of various content/header checks in abstract block diffs.Sep 30 2019, 5:40 PM

epriestley added a commit: rP5afdc620db59: Make basic Juypter notebook rendering improvements and roughly support folding….

epriestley added a commit: rPd9515e82a3ec: Perform basic block interdiffs when diffing abstract blocks, and interdiff….Sep 30 2019, 5:43 PM

epriestley added a commit: rP76d9912932bc: Force unified abstract block diffs into roughly usable shape.

epriestley added a revision: D20851: In Jupyter notebooks, apply intraline diffing to source code lines.Oct 2 2019, 3:50 PM

epriestley added a commit: rP344a2e39bed7: In Jupyter notebooks, apply intraline diffing to source code lines.Oct 2 2019, 7:35 PM

epriestley added a revision: D20865: Support rich diff rendering with DocumentEngine for added/removed files.Oct 25 2019, 6:47 PM

epriestley added a commit: rP8ff0e3ab351f: Support rich diff rendering with DocumentEngine for added/removed files.Oct 26 2019, 3:21 PM

epriestley added a revision: D20866: Fix an issue where added or removed source files could incorrectly select a DocumentEngine.Oct 26 2019, 7:09 PM

epriestley added a commit: rP292f8fc612bd: Fix an issue where added or removed source files could incorrectly select a….Oct 26 2019, 7:16 PM

epriestley added a commit: rPf5b3528a2670: (stable) Fix an issue where added or removed source files could incorrectly….

epriestley added a revision: D20897: Fix an issue with 1up diff block rendering for added or removed blocks.Nov 8 2019, 3:35 PM

epriestley added a commit: rPd4491ddc225e: Fix an issue with 1up diff block rendering for added or removed blocks.Nov 8 2019, 3:37 PM

epriestley added a revision: D20959: Fix an issue where the last line of block-based diffs could be incorrectly hidden.Jan 30 2020, 6:45 PM

epriestley added a commit: rPccf28a81121e: Fix an issue where the last line of block-based diffs could be incorrectly….

epriestley moved this task from Backlog to Next on the Differential board.Mar 15 2021, 5:10 PM

The bulk of this work is done and I think there's nothing unique and actionable left here. This is survived by T13642 and other issues.

	F6888182: Screen Shot 2019-09-25 at 1.57.05 PM.png
	Sep 25 2019, 9:18 PM

	F6888097: Screen Shot 2019-09-25 at 12.49.52 PM.png
	Sep 25 2019, 7:50 PM

	F6888054: Screen Shot 2019-09-25 at 12.02.00 PM.png
	Sep 25 2019, 7:03 PM

	F6887940: Screen Shot 2019-09-25 at 10.08.10 AM.png
	Sep 25 2019, 5:10 PM

Support diffs between abstract block lists in the UIClosed, ResolvedPublicActions

Description

Revisions and Commits

Related Objects

Event Timeline

Support diffs between abstract block lists in the UI
Closed, ResolvedPublic
Actions