The "affected paths" index, which allows open revisions to be queried by which paths they affect, has been updated:
- The frozen API method `differential.query` no longer supports the `paths` parameter (and will raise an error if it is provided).
- The modern API method `differential.revision.search` now supports an `affectedPaths` constraint. Provide a list of paths:
{F8539350}
It may be useful to use the `repositoryPHIDs` constraint alongside the `affectedPaths` constraint. If you do not, queries like `/example.txt` will find revisions affecting `example.txt` in **any** repository. With both constraints, querying will be able to select a more efficient query plan, although the `affectedPaths` constraint will also work on its own.
**Affected Path Index**
With `phabricator.developer-mode` enabled, you can now inspect the affected path index directly:
{F8539356}
{F8539358}
These changes fixed a couple of bugs and limitations with the index. In particular:
- If you edited the "Repository" of a revision, the index wasn't rebuilt properly (it still recorded the revision as affecting paths in the old repository).
- If you created a revision without a "Repository", the index wasn't built at all (but there was also no way to query it).
The index is now always built, and always rebuilt when the repository is changed. This index update is not retroactive. You can optionally run this script if you'd like to fix existing indexes:
```name=update-affected-path-index.php
<?php
require_once 'scripts/init/init-script.php';
$viewer = PhabricatorUser::getOmnipotentUser();
$query = id(new DifferentialRevisionQuery())
->setViewer($viewer)
->needActiveDiffs(true)
->setOrderVector(array('-id'));
$iterator = new PhabricatorQueryIterator($query);
foreach ($iterator as $revision) {
try {
$active_diff = $revision->getActiveDiff();
$diff = id(new DifferentialDiffQuery())
->withPHIDs(array($active_diff->getPHID()))
->setViewer($viewer)
->needChangesets(true)
->executeOne();
id(new DifferentialAffectedPathEngine())
->setRevision($revision)
->setDiff($diff)
->updateAffectedPaths();
echo tsprintf(
"%s\n",
pht(
'Reindexed %s.',
$revision->getMonogram()));
} catch (Exception $ex) {
echo tsprintf(
"%s\n",
pht(
'Exception while indexing %s: %s',
$revision->getMonogram(),
$ex->getMessage()));
}
}
echo tsprintf(
"%s\n",
pht('Done.'));
```
```
phabricator/ $ php -f update-affected-path-index.php
```
This script is safe to run while Phabricator is online.
**Limitations**
There is currently no way to search for revisions that affect `/example.txt` but do not have a repository. This probably isn't terribly useful. (As a workaround, you could query for revisions affecting `/example.txt`, then filter them on the client.)
---
//Original Request//
See PHI2025. An install would like to query revisions by affected path, as Diffusion does to generate the "Recently Open Revisions" panel (which is a bit oddly named?).
`DifferentialRevisionQuery` currently has `withPath(repository_id, path_id)`, which may be called multiple times. It is called by:
- Diffusion, to populate "Recently Open Revisions".
- Differential, to populate the "Recent Similar Revisions" panel.
- `differential.query` supports an explicit "paths" constraint.
The `DifferentialAffectedPath` table stores `repositoryID`, `pathID`, `revisionID`, and has a key on `<repositoryID, pathID, epoch>`.
`epoch` seems like it's useless; nothing ever queries it.
The `<repositoryID, ...>` would almost certainly be better as `<pathID, ...>`. This table is never queried with a repository ID and no path ID.
This would be easier to query with `repositoryPHID` than `repositoryID`, but that's probably not worth the effort of migration.
Upshot:
- Rename "Recently Open Revisions" to "Recent Open Revisions".
- Change the `->withPath(repository_id, path_id)` API to `->withPaths(array $list_of_path_strings)`.
- Drop the `<repositoryID, pathID, epoch>` key.
- Drop the `epoch` column.
- Add a `<pathID, repositoryID>` key.
- Expose a constraint.
The `withRepositoryPHIDs(...)` + `withPaths(...)` API is technically less powerful than `withPath(repository_id, path_id)`. For example, it can not, in a single query, select:
> (revisions which affect `/a/b/c.txt` in repository A) unioned with (revisions which affect `/x/y/z.txt` in repository B)
There's likely no use case for this and any reasonable caller can almost certainly execute two queries.