A small number of deployments failed with an internal error
During a routine software update, one server in one of Aptible's non-production infrastructure stacks didn't receive updated software, and as a result, app deployments that were scheduled there did not succeed.
Due to the limited scope of the issue, only a very small number of apps and deployments were affected (respectively 7 apps over 14 deployments). If you're investigating a deployment failure, you can identify affected deployments by an internal error logged in your deployment output immediately after attempting to lock ports.
Running apps were not affected (neither in production nor development environments).
Feb 6, 16:13 EST
Small number of development apps unavailable
Due to an EC2 instance failure, a small number of developments apps became unavailable at 15:20 UTC (apps scaled to more than 1 container were not affected). 85% of affected apps were recovered within 10 minutes, and as of 15:40 UTC, all affected apps have been recovered.
Jun 24, 11:42 EST
A low number of API requests are timing out
Our error capture suggests the few API failures we observed were due to network glitches. The problem has subsided for now. We've deployed additional monitoring to better track those errors and will consider additional remediation steps should the problem occur again.
Feb 5, 13:53-14:39 EST