epriestley renamed T12425: User-initiated search reindex tasks can end up stuck behind import tasks in the daemon queue from An instance managed to generate an out-of-date URI index to User-initiated search reindex tasks can end up stuck behind import tasks in the daemon queue.

Mar 21 2017, 3:28 PM · Customer Impact, Daemons, Search, Diffusion

Mar 20 2017

chad triaged T12422: Unable to navigate to "All Daemons" on Mobile as Low priority.

Mar 20 2017, 7:29 PM · Daemons, Mobile, Bug Report

Mnkras created T12422: Unable to navigate to "All Daemons" on Mobile.

Mar 20 2017, 6:56 PM · Daemons, Mobile, Bug Report

Mar 18 2017

epriestley closed T12355: Daemons are having a rough time starting on `admin` as Resolved.

The setsid() appears to have resolved the issue, in the sense that daemons are now starting cleanly every time without receiving any inexplicable signals.

Mar 18 2017, 12:35 PM · Phacility, Daemons, Ops

epriestley added a comment to T12355: Daemons are having a rough time starting on `admin`.

The MySQL stuff seems to be in the old daemons (them trying to do logging while MySQL restarts), not the new daemons, and not the cause of the issue.

Mar 18 2017, 12:23 PM · Phacility, Daemons, Ops

Mar 16 2017

chad added a comment to T421: Show it to me Calimonk!.

Do this man

Mar 16 2017, 2:00 AM · newyork, Arcanist (Milestone 1), Restricted Project, Localization, ircbot, External, DeviantArt, Database, Dashboard, Dashboards, Daemons, Conpherence, Calendar, Chatlog, Unknown Object (Project)

Mar 13 2017

piotrsuplat updated Q588: Unable to start phabricator deamons from to Unable to start phabricator deamons.

Mar 13 2017, 2:02 PM · Daemons

Mar 8 2017

mbishopim3 added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

In our case these are commits from landed Differential diffs. Permissions
for the web user are there and correct, but I'm not sure I understand all
the details around the daemon vs. web user, and the daemon is doing the
work for this git commit parser.

Mar 8 2017, 12:46 PM · Daemons, Bug Report

avivey added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

mkdir: cannot create directory ‘/var/storage/3f/49’: Permission denied

Mar 8 2017, 11:56 AM · Daemons, Bug Report

epriestley added a comment to T12355: Daemons are having a rough time starting on `admin`.

I haven't been able to reproduce this locally by artificially making services unavailable or connections fail. Debugging this in production will require momentary service interruptions on admin until I can narrow things down so I'm going to wait until off-peak to poke at it.

Mar 8 2017, 12:00 AM · Phacility, Daemons, Ops

Mar 7 2017

mbishopim3 added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

Occasionally, even after running the re-parse command, the basics of the commit will import but the content itself will not. A phd restart will almost immediately allow the commit to then import.

Mar 7 2017, 10:18 PM · Daemons, Bug Report

mbishopim3 added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

More info: the commit contained binary files (executables actually).

Mar 7 2017, 8:25 PM · Daemons, Bug Report

mbishopim3 added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

We now have one that's failing even when forcing a reparse:

Mar 7 2017, 7:44 PM · Daemons, Bug Report

mbishopim3 added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

Is there a way to recall the logs from a failing task? Is there any value in providing the logs for the successful task? Not really sure what to say here besides that it's a consistent and random issue on our repositories. When we run the trace, it then "bumps" the task somehow and things succeed, so it would be difficult to provide detail on an error.

Mar 7 2017, 4:16 PM · Daemons, Bug Report

epriestley added a comment to T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

There isn't enough information here for us to reproduce this, or even begin.

Mar 7 2017, 4:12 PM · Daemons, Bug Report

mbishopim3 created T12365: Daemon never recovers with Git commit parsing task, but manually reparsing succeeds.

Mar 7 2017, 4:10 PM · Daemons, Bug Report

Mar 5 2017

eliaspro added a comment to T12355: Daemons are having a rough time starting on `admin`.

In terms of application/code stability, it would be good to handle this properly.

Mar 5 2017, 9:02 PM · Phacility, Daemons, Ops

epriestley added a comment to T12355: Daemons are having a rough time starting on `admin`.

Specifically, I suspect it won't affect repo because db restarts separately, so the database won't be unavailable when daemons on repo restart.

Mar 5 2017, 12:32 PM · Phacility, Daemons, Ops

epriestley added a comment to T12355: Daemons are having a rough time starting on `admin`.

It looks like this might be an issue with MySQL not being available yet when the daemons start. The deployment script restarts MySQL, then immediately restarts the daemons, and a bunch of this stuff is ending up in the log in the 1-2 seconds after the restart:

Mar 5 2017, 12:32 PM · Phacility, Daemons, Ops

epriestley added a project to T12355: Daemons are having a rough time starting on `admin`: Phacility.

Mar 5 2017, 12:24 PM · Phacility, Daemons, Ops

epriestley created T12355: Daemons are having a rough time starting on `admin`.

Mar 5 2017, 12:24 PM · Phacility, Daemons, Ops

Feb 28 2017

epriestley closed T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly as Resolved.

I've deployed everything so I believe this is now resolved. I'll keep an eye on it going forward.

Feb 28 2017, 4:37 PM · Daemons, Ops, Phacility

epriestley added a comment to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly.

The pool appears to be scaling up and down properly now. I'm going to deploy the fix to the secure and repo tiers since it may affect all instances.

Feb 28 2017, 4:21 PM · Daemons, Ops, Phacility

epriestley added a comment to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly.

I'm picking D17433 now and sending it to admin.

Feb 28 2017, 4:14 PM · Daemons, Ops, Phacility

epriestley added a revision to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly: D17433: Fix an issue where daemons can stick in a pool.

Feb 28 2017, 4:10 PM · Daemons, Ops, Phacility

epriestley added a comment to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly.

I'm picking those to stable and sending them to production now.

Feb 28 2017, 3:43 PM · Daemons, Ops, Phacility

epriestley added a revision to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly: D17432: Add more autoscale pool logging to daemons in verbose mode.

Feb 28 2017, 3:37 PM · Daemons, Ops, Phacility

epriestley added a revision to T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly: D17431: Add "--pool" and "--duration" flags to daemon CLI tools.

Feb 28 2017, 3:31 PM · Daemons, Ops, Phacility

epriestley created T12331: Taskmasters on `admin.phacility.com` are not scaling up correctly.

Feb 28 2017, 12:06 PM · Daemons, Ops, Phacility

joshuaspence added a revision to T12298: Allow daemon pools to autoscale down to 0 processes: D17429: Update `phd launch`.

Feb 28 2017, 3:49 AM · Daemons, Ops, Phacility

Feb 24 2017

epriestley added a comment to T12298: Allow daemon pools to autoscale down to 0 processes.

Hibernating daemons currently show as "Waiting" in the Daemon console, but I'm not going to worry about that for now.

Feb 24 2017, 6:49 PM · Daemons, Ops, Phacility

epriestley added a revision to T12298: Allow daemon pools to autoscale down to 0 processes: D17408: Allow the Trigger daemon to hibernate, reducing processes to 0.

Feb 24 2017, 5:14 PM · Daemons, Ops, Phacility

epriestley added a revision to T12298: Allow daemon pools to autoscale down to 0 processes: D17407: Allow daemons to "hibernate", reducing pool size to 0 for a time.

Feb 24 2017, 5:06 PM · Daemons, Ops, Phacility

Feb 22 2017

epriestley closed T12306: Failing svn connection brings daemon down as Resolved by committing rP939fb69aa68b: Be less strict when detecting dead daemons.

Feb 22 2017, 10:11 PM · Daemons, Bug Report

epriestley added a revision to T12306: Failing svn connection brings daemon down: D17397: Be less strict when detecting dead daemons.

Feb 22 2017, 10:09 PM · Daemons, Bug Report

epriestley added a comment to T12298: Allow daemon pools to autoscale down to 0 processes.

No time like the present.

Feb 22 2017, 9:14 PM · Daemons, Ops, Phacility

epriestley moved T12306: Failing svn connection brings daemon down from Backlog to vNext on the Daemons board.

Feb 22 2017, 12:15 PM · Daemons, Bug Report

epriestley added a project to T12306: Failing svn connection brings daemon down: Daemons.

It is expected that taskmasters will exit and restart after an unexpected failure, but this should not trigger setup warnings. I'll see if I can reproduce this.

Feb 22 2017, 12:14 PM · Daemons, Bug Report

Feb 21 2017

epriestley added a revision to T12298: Allow daemon pools to autoscale down to 0 processes: D17390: Update Phabricator for new daemon pool changes.

Feb 21 2017, 4:50 PM · Daemons, Ops, Phacility

epriestley added a revision to T12298: Allow daemon pools to autoscale down to 0 processes: D17389: Reorganize PhutilDaemon into Overseers, Pools and Daemons in libphutil.

Feb 21 2017, 4:48 PM · Daemons, Ops, Phacility

cspeckmim added a comment to T11708: Daemons erroneously saying "Diff does not exist".

I don't believe we've seen this pop up again. I can't be certain because right now all the log files are empty, but that means the errors went away?

Feb 21 2017, 5:26 AM · Daemons, Differential, Bug Report

epriestley moved T6615: Worker task table has some remaining awkward keys from Availability to vNext on the Daemons board.

Feb 21 2017, 12:37 AM · Infrastructure, Daemons

epriestley closed T10756: Make daemons work correctly no matter where they are or how many copies are running as Resolved.

I believe this is effectively resolved now -- we're running all the daemons on multiple hosts on secure. There's remaining followup work (T6768, T6615, T10753, T10754) but I think the core is in reasonable shape.

Feb 21 2017, 12:35 AM · Restricted Project, Daemons

epriestley added a comment to T12115: ApplicationTransactionPublishWorker can retry indefinitely, sending email as a side effect, if a transaction takes more than 2 hours to publish.

T11708 has almost nothing to do with this, but the fix for this will rewrite the code that's running into issues and probably moot them.

Feb 21 2017, 12:31 AM · Restricted Project, Daemons, Differential, Transactions

epriestley merged task T11708: Daemons erroneously saying "Diff does not exist" into T12115: ApplicationTransactionPublishWorker can retry indefinitely, sending email as a side effect, if a transaction takes more than 2 hours to publish.

Feb 21 2017, 12:30 AM · Daemons, Differential, Bug Report

epriestley merged T11708: Daemons erroneously saying "Diff does not exist" into T12115: ApplicationTransactionPublishWorker can retry indefinitely, sending email as a side effect, if a transaction takes more than 2 hours to publish.

Feb 21 2017, 12:30 AM · Restricted Project, Daemons, Differential, Transactions

epriestley added a comment to T11708: Daemons erroneously saying "Diff does not exist".

I'm just going to merge this into T12115, which isn't really related, but will rewrite this code and probably "fix" this, since a reproduction case seems elusive.

Feb 21 2017, 12:30 AM · Daemons, Differential, Bug Report

epriestley moved T5401: Record total time in queue for tasks from Backlog to vNext on the Daemons board.

Feb 21 2017, 12:27 AM · Daemons

epriestley closed T8809: Support more complex human-usable weekly clocks in trigger infrastructure as Wontfix.

This could probably be built with RRULEs now, but we don't currently have use cases / plans around a general-purpose cron-like tool.

Feb 21 2017, 12:27 AM · Daemons

epriestley closed T3554: Implement a `bin/worker kill <criteria>` for batch-flushing tasks from the queue as Resolved.

Use cases seem to be fairly well covered now.

Feb 21 2017, 12:25 AM · Restricted Project, Daemons

epriestley closed T5334: View a graph showing worker load over time as Resolved.

This is 2+ years old and autoscaling probably covers it now. T5401 is probably a more tangible attack on this.

Feb 21 2017, 12:23 AM · Restricted Project, Facts, Daemons

epriestley closed T7484: Lock is not locked by this process as Wontfix.

This got a little more work when clustering was written but it's essentially a log of two-year-old ghost sightings at this point and not actionable.

Feb 21 2017, 12:21 AM · Restricted Project, Daemons

epriestley added a comment to T8871: Indexing a task with 2,000 comments required a lot of RAM in mid-2015.

I guess this one can live for now since that's reproducible/actionable.

Feb 21 2017, 12:19 AM · Search, Daemons

epriestley renamed T8871: Indexing a task with 2,000 comments required a lot of RAM in mid-2015 from Phabricator daemons autoscale themselves to OOM to Indexing a task with 2,000 comments required a lot of RAM in mid-2015.

Feb 21 2017, 12:19 AM · Search, Daemons

epriestley closed T9603: PHD daemons regularly stopping/starting as Resolved.

Probably a dupe of T11708? I'm just going to kill this one since it's old as dirt.

Feb 21 2017, 12:16 AM · Daemons

epriestley closed T10148: diffusion.rawdiffquery can not encode/transport all diffs (e.g. to non-utf8 files) as Resolved.

I believe this was likely resolved by D16460. This (binary transport over Conduit) is still a problem in general, and T5955 is the general remedy, but diffusion.rawdiffquery specifically now returns data via Files.

Feb 21 2017, 12:15 AM · Conduit, Daemons

epriestley closed T9855: DifferentialChangesSinceLastUpdateField may fail policy checks and prevent mail from sending as Resolved.

I think it is likely that this was resolved by D17123. If it wasn't, we don't have a reproduction case anyway so we can't move forward. I'm going to call this one dead until more information turns up.

Feb 21 2017, 12:13 AM · Daemons, Differential

epriestley closed T10811: `phd-daemon` crash after automatic restart following config change as Resolved.

We were never able to reproduce this convincingly and as far as we know the reporting install no longer uses Phabricator, so I'm going to close this out.

Feb 21 2017, 12:12 AM · Daemons, Bug Report

epriestley closed T10811: `phd-daemon` crash after automatic restart following config change, a subtask of T10756: Make daemons work correctly no matter where they are or how many copies are running, as Resolved.

Feb 21 2017, 12:12 AM · Restricted Project, Daemons

epriestley moved T12298: Allow daemon pools to autoscale down to 0 processes from Backlog to vNext on the Daemons board.

Feb 21 2017, 12:10 AM · Daemons, Ops, Phacility

epriestley moved T10867: Version daemons more clearly in daemon console so it's clear when `phd reload` has taken effect from Backlog to vNext on the Daemons board.

Feb 21 2017, 12:10 AM · Daemons, Feature Request

epriestley moved T11037: `bin/phd status` is misleading about local / remote daemons from Backlog to vNext on the Daemons board.

Feb 21 2017, 12:10 AM · Daemons, Restricted Project

epriestley added a comment to T12298: Allow daemon pools to autoscale down to 0 processes.

I believe I have the first part of this (restructuring the code into a more sensible Overseer > Pool > Daemon sort of thing) working, but it could use more testing. I'm going to see if we have anything else in Daemons that I can fix while I'm here to help me kick the tires a bit.

Feb 21 2017, 12:09 AM · Daemons, Ops, Phacility

Feb 20 2017

epriestley updated the task description for T12298: Allow daemon pools to autoscale down to 0 processes.

Feb 20 2017, 9:42 PM · Daemons, Ops, Phacility

epriestley created T12298: Allow daemon pools to autoscale down to 0 processes.

Feb 20 2017, 4:30 PM · Daemons, Ops, Phacility

Feb 14 2017

eadler closed T12176: Importing large SVN repository causes a full disk as Invalid.

closed per user request

Feb 14 2017, 11:03 PM · Bug Report (Needs Information), Daemons, Diffusion

matthewrbowker added a comment to T12176: Importing large SVN repository causes a full disk.

@epriestley Please close this task (I can't).

Feb 14 2017, 10:54 PM · Bug Report (Needs Information), Daemons, Diffusion

Feb 2 2017

epriestley moved T10754: Remove Subversion daemon working copy operations from Backlog to Subversion on the Diffusion board.

Feb 2 2017, 4:10 PM · Subversion, Diffusion, Daemons, Infrastructure

epriestley closed T6221: "Fixes" should work in comment text on revisions, not just summary text as Wontfix.

This is from a million years ago and I now think we should only respect commit messages. If "Fixes Txxx" works or ever worked in comments, I'd say we should actually undo that.

Feb 2 2017, 4:06 PM · Diffusion, Daemons

Jan 31 2017

matthewrbowker added a comment to T12176: Importing large SVN repository causes a full disk.

@epriestley That's fair, I'll see what I can do.

Jan 31 2017, 7:13 PM · Bug Report (Needs Information), Daemons, Diffusion

epriestley edited projects for T12176: Importing large SVN repository causes a full disk, added: Bug Report (Needs Information); removed Bug Report.

Because of the complexity of building a reproduction case and high chance that this is a wild goose chase, we'll move forward with this after a community member confirms it reproduces for them. See T12134 for some discussion. See T12129 for a similar recent report which was a time-consuming wild goose chase.

Jan 31 2017, 7:00 PM · Bug Report (Needs Information), Daemons, Diffusion

matthewrbowker created T12176: Importing large SVN repository causes a full disk.

Jan 31 2017, 6:06 PM · Bug Report (Needs Information), Daemons, Diffusion

Jan 30 2017

cspeckmim added a comment to T11708: Daemons erroneously saying "Diff does not exist".

I upgraded last Friday to 2604c5af55f654d36f8db2f080b96486c4572216, so far this exception has not popped up again. I will check again later in the week.

Jan 30 2017, 3:45 PM · Daemons, Differential, Bug Report

Jan 17 2017

epriestley added a project to T12115: ApplicationTransactionPublishWorker can retry indefinitely, sending email as a side effect, if a transaction takes more than 2 hours to publish: Restricted Project.

Jan 17 2017, 8:08 PM · Restricted Project, Daemons, Differential, Transactions

epriestley created T12115: ApplicationTransactionPublishWorker can retry indefinitely, sending email as a side effect, if a transaction takes more than 2 hours to publish.

Jan 17 2017, 8:08 PM · Restricted Project, Daemons, Differential, Transactions

• Alexmoon2 added projects to T421: Show it to me Calimonk!: Chatlog, Calendar, Conpherence, Daemons, Dashboards, Dashboard, Database, DeviantArt, External, ircbot, Localization, Restricted Project, Arcanist (Milestone 1), newyork.

Jan 17 2017, 9:24 AM · newyork, Arcanist (Milestone 1), Restricted Project, Localization, ircbot, External, DeviantArt, Database, Dashboard, Dashboards, Daemons, Conpherence, Calendar, Chatlog, Unknown Object (Project)

Jan 12 2017

epriestley merged task T11270: PHP 7 does not reasonably support asynchronous signal handling until `async_signals` lands into T9640: Make Phabricator compatible with PHP7.

Jan 12 2017, 5:37 PM · Daemons, Bug Report

epriestley added a comment to T11270: PHP 7 does not reasonably support asynchronous signal handling until `async_signals` lands.

I'm just going to merge this into T9640, it isn't meaningfully different from an implementation perspective and is the major compatibility issue.

Jan 12 2017, 5:37 PM · Daemons, Bug Report

PhoneixS added a comment to T11270: PHP 7 does not reasonably support asynchronous signal handling until `async_signals` lands.

I haven't explain correctly, what I mean is that this task is "Needs Triage" now but you can know how many work you will need to do (more or less) then you can set its priority (low, hight, whatever).

Jan 12 2017, 5:35 PM · Daemons, Bug Report

chad added a comment to T11270: PHP 7 does not reasonably support asynchronous signal handling until `async_signals` lands.

I can't think of a reason to prioritize this. Specifically, it seems like since Phabricator for most (all?) companies is a business critical piece of software, you'd always choose to run it on the most reliable / stable version of PHP. Is there a reason we should consider that not to be the case?

Jan 12 2017, 2:50 PM · Daemons, Bug Report

PhoneixS added a comment to T11270: PHP 7 does not reasonably support asynchronous signal handling until `async_signals` lands.

Now that PHP 7.1 is released you could think what will need to be changed and give a priority to this task.

Jan 12 2017, 10:10 AM · Daemons, Bug Report