š¤During the October 19-20 us-east-1 incident, thousands of Multi-AZ deployments stayed perfectly healthy ā while experiencing intermittent scaling issues. Zero data loss. Zero downtime for running workloads. But reduced ability to launch new resources during a cascading failure that took 14 hours to fully resolve.
š¾Lessons learned from recent AWS outageš¾
š¤During the October 19-20 us-east-1 incident, thousands of Multi-AZ deployments stayed perfectly healthy ā while experiencing intermittent scaling issues. Zero data loss. Zero downtime for running workloads. But reduced ability to launch new resources during a cascading failure that took 14 hours to fully resolve.