Page MenuHomePhabricator

Make Drydock reclaim unused resources when it reaches a resource limit
ClosedPublic

Authored by epriestley on Dec 18 2015, 5:59 PM.
Tags
None
Referenced Files
F14783545: D14819.id35833.diff
Fri, Jan 24, 10:41 AM
F14783544: D14819.id35829.diff
Fri, Jan 24, 10:41 AM
F14783543: D14819.id.diff
Fri, Jan 24, 10:41 AM
Unknown Object (File)
Sun, Jan 12, 9:25 PM
Unknown Object (File)
Fri, Jan 10, 4:35 AM
Unknown Object (File)
Thu, Jan 9, 7:51 PM
Unknown Object (File)
Thu, Jan 2, 7:32 PM
Unknown Object (File)
Dec 13 2024, 7:54 PM
Subscribers
None

Details

Summary

Fixes T9994. Currently, when Drydock can't allocate a new resource because some limit has been reached, it waits patiently for a resource to become available.

It is possible that no resource will ever become available. Particularly with "Working Copy" resources, the new lease may want a copy of rB, but the resource may already be maxed out on rA.

Right now, no process exists to automatically reclaim the unused rA.

When we encounter this situation, try to reclaim one of the other resources if it is just sitting there unused.

Specifically:

  • Add a "reclaim" command which means "release this resource if it is completely unused".
  • Add a bin/drydock reclaim to send this command to every active resource.
  • When we try to acquire a resource and can't, but only because of some kind of limit / utilization problem, try to release an unused resource to free up some room.
Test Plan
  • Set "Working Copy" resource limit to 1.
  • Ran "Test Configuration" in rA, which worked.
  • Ran "Test Configuration" in rB, which hung forever.
  • Applied patch.
  • Ran "Test Configuration" in rB, saw it reclaim the rA resource, use the slot, then succeed.
  • Ran "Test Configuration" in rA again, saw it grab the slot back.
  • Ran bin/drydock reclaim and saw it reclaim a bunch of old orphaned resources.

Diff Detail

Repository
rP Phabricator
Branch
reclaim1
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 9647
Build 11553: Run Core Tests
Build 11552: arc lint + arc unit

Event Timeline

The build failed because of conflicts between my local working copy and resources on this host (I'm using the same server for both), I'll straighten that out in a sec.

chad edited edge metadata.
This revision is now accepted and ready to land.Dec 18 2015, 7:25 PM
This revision was automatically updated to reflect the committed changes.