⚓ T7352 Improve daemon scalability in the cluster

Get daemonID into the storage table so I can clean up phd status.
Change taskmasters to use autoscale pools when started with phd start.
Merge the GarbageCollector and Trigger daemons.
Double check that phd stop will still stop old daemons before landing all this stuff (it should, and users "shouldn't" hit this, but...)

I consider auto-anything to be generally scary because things can easily autopilot out of control. I've seen more than a couple of postmortems where a minor failure cascaded into a total failure because of a misbehaving automated recovery, automated scaling, etc.

These autoscaling pools have a dangerous, cascading failure mode: when a pool scales up, it consumes more resources and tends to exert pressure on other pools to scale up, since they'll have fewer resources and take longer to complete work. In an extreme case, several pools can scale up together and push a box into swap, and that could impose a huge performance penalty on other pools and make them all scale up, too. Then all the pools max out and the box thrashes itself to death and can probably never complete work fast enough to recover.

We're particularly susceptible to this failure immediately following a problem with the task queue. If the daemons restart into an existing task backlog, that will create a scaling pressure across all the pools (more work to do than usual), which will reinforce itself and push the box toward a thrashing death spiral where all pools scale up at once and starve each other out. This is an especially bad time to be susceptible to failures, since it could mean that one failure with the queue cascades into a second, larger and more complex failure if we restart daemons to try to fix it. This problem also gets harder to resolve over time, because the backlog will exert more upward pressure on pool sizes and scale pools up more quickly after a restart.

I think the cleanest way to prevent this is to put a hard free memory limit on pool scale-up: say, pools never autoscale up if the machine has less than 20% of its RAM free. This prevents the box from swapping itself to death. Resource allocation may not be totally fair (the first pools to grow get to stay large, and later pools don't get to expand) but system will self-heal over time as long as the work completion rate exceeds the rate at which new work is being generated.

btrahan awarded a token.Feb 23 2015, 6:02 PM

epriestley added a revision: D11863: Add a utility class for getting system memory information.Feb 23 2015, 6:08 PM

epriestley added a revision: D11864: Implement memory reserves in autoscale pools.Feb 23 2015, 6:30 PM

epriestley added a revision: D11865: Track daemon unique IDs in Phabricator daemon logs.Feb 23 2015, 7:28 PM

epriestley added a revision: D11866: Emit exit event from daemon handle when daemon is not running.Feb 23 2015, 7:31 PM

epriestley claimed this task.Feb 23 2015, 8:59 PM

epriestley added a revision: D11871: Convert taskmasters to use an autoscale pool.Feb 24 2015, 1:59 AM

epriestley added a revision: D11872: Merge GC daemon into Trigger daemon.Feb 24 2015, 2:41 AM

epriestley added a revision: D11873: Allow modern `phd stop` to stop old daemons cleanly.Feb 24 2015, 2:58 AM

epriestley added a commit: rPHU77f0eda5b427: Add a utility class for getting system memory information.Feb 24 2015, 3:52 PM

joshuaspence added a subscriber: joshuaspence.Feb 24 2015, 9:54 PM

epriestley closed this task as Resolved by committing rPa3518e19a565: Merge GC daemon into Trigger daemon.Feb 24 2015, 10:51 PM

epriestley added a commit: rPHUa219ac2a3635: Pass most daemon configuration over stdin.

epriestley added a commit: rPHU55861bcbd6a5: Use stdio, not signals, to heartbeat from the daemons.

epriestley added a commit: rPHU0e5b0f293436: Receive most overseer configuration over stdin.

epriestley added a commit: rPHUbd7d8e9fca98: Separate individual daemon process logic into PhutilDaemonHandle.

epriestley added a commit: rPHUe6cc2aaa36f7: Implement memory reserves in autoscale pools.

epriestley added a commit: rPHU4f2da5719488: Support daemon autoscaling in libphutil.

epriestley added a commit: rPHU46764a249766: Emit exit event from daemon handle when daemon is not running.

epriestley added a commit: rP6771a70499e5: Update Phabricator for DaemonOverseer vs DaemonHandle split.

epriestley added a commit: rPf0f2b2cbeb1d: Start all daemons under a single overseer.

epriestley added a commit: rP09f3d0bb7ec0: Pass overseer configuration over stdin.

epriestley added a commit: rPc2d66f29cd80: Make `phd` more aware of multiple daemons under a single overseer.

epriestley added a commit: rP48fc3126a124: Support autoscaling daemons in phd.

epriestley added a commit: rPef22fe1e743c: Add a --force command to `phd start`.

epriestley added a commit: rPa354e5fa6b94: Track daemon unique IDs in Phabricator daemon logs.

epriestley added a commit: rPaf303f458b9c: Convert taskmasters to use an autoscale pool.

epriestley added a commit: rP38636a39cf2d: Allow modern `phd stop` to stop old daemons cleanly.

epriestley added a commit: rPa3518e19a565: Merge GC daemon into Trigger daemon.

This stuff seems to be working (daemons restarted cleanly; autoscale worked; queue flushed; no errors) so I'm going to roll it to the cluster.

Seems to be working in the cluster, too.

epriestley mentioned this in Blog Post: Development Notes (2015 Week 47).Nov 22 2015, 9:55 PM

epriestley mentioned this in T12298: Allow daemon pools to autoscale down to 0 processes.Feb 20 2017, 4:30 PM

		Restricted Diffusion Commit
rPHU libphutil
	D11866	rPHU46764a249766 Emit exit event from daemon handle when daemon is not running
	D11864	rPHUe6cc2aaa36f7 Implement memory reserves in autoscale pools
	D11859	rPHU4f2da5719488 Support daemon autoscaling in libphutil
	D11851	rPHUbd7d8e9fca98 Separate individual daemon process logic into PhutilDaemonHandle
	D11854	rPHU0e5b0f293436 Receive most overseer configuration over stdin
	D11853	rPHUa219ac2a3635 Pass most daemon configuration over stdin
	D11850	rPHU55861bcbd6a5 Use stdio, not signals, to heartbeat from the daemons
	D11863	rPHU77f0eda5b427 Add a utility class for getting system memory information
rP Phabricator
	D11872	rPa3518e19a565 Merge GC daemon into Trigger daemon
	D11873	rP38636a39cf2d Allow modern `phd stop` to stop old daemons cleanly
	D11871	rPaf303f458b9c Convert taskmasters to use an autoscale pool
	D11865	rPa354e5fa6b94 Track daemon unique IDs in Phabricator daemon logs
	D11861	rPef22fe1e743c Add a --force command to `phd start`
	D11860	rP48fc3126a124 Support autoscaling daemons in phd
	D11855	rP09f3d0bb7ec0 Pass overseer configuration over stdin
	D11856	rPc2d66f29cd80 Make `phd` more aware of multiple daemons under a single overseer
	D11857	rPf0f2b2cbeb1d Start all daemons under a single overseer
	D11852	rP6771a70499e5 Update Phabricator for DaemonOverseer vs DaemonHandle split

Improve daemon scalability in the cluster
Closed, ResolvedPublic
Actions

Description

Revisions and Commits

Related Objects
Search...

Event Timeline

Improve daemon scalability in the clusterClosed, ResolvedPublicActions

Description

Revisions and Commits

Related ObjectsSearch...

Event Timeline

Improve daemon scalability in the cluster
Closed, ResolvedPublic
Actions

Related Objects
Search...