SE-Radio Episode 284: John Allspaw on System Failures: Preventing, Responding, and Learning From

Filed in Episodes by on March 7, 2017 1 Comment

allspaw-100x125John Allspaw CTO of Etsy speaks with Robert Blumen about systemic failures and outages; how are systems defended against outages?; why do they fail anyway?; why are failures not entirely preventable?; why do outages involve multiple failures?; the time that Etsy identified it’s own office as a potential source of fraud; the human as part of the system; is human error an important component of failure?; understanding human action during failures; what can we learn from outages?; effective post-mortems; testing as a way of preventing failure; the limitations of testing; testing in production.

Venue: Internet

Related Links

Tags: , , , ,