Tag: devops

SE-Radio Episode 301: Jason Hand on Handling Outages

Filed in Episodes by on August 29, 2017 0 Comments
SE-Radio Episode 301: Jason Hand on Handling Outages

Bryan Reinero talks with Jason Hand about handling outages and responding to failures. The episode explores basic problem-solving strategies and diagnostic techniques, organizing teams to address incidents efficiently, communicating with stakeholders, learning from incidents, and managing stress.   Related Links Episode 284 – John Allspaw on System Failures: Preventing, Responding, and Learning From Episode 225 […]

Continue Reading »

SE-Radio Episode 289: James Turnbull on Declarative Programming with Terraform

Filed in Episodes by on April 25, 2017 2 Comments
SE-Radio Episode 289: James Turnbull on Declarative Programming with Terraform

James Turnbull rejoins the show with Robert Blumen for a conversation mostly about Terraform, as well as a bit about Puppet. Terraform is a declarative programming tool for automating infrastructure resource creation; it targets resource providers, such as Amazon AWS, Microsoft Azure, Digital Ocean, and other cloud and SAAS back ends. The discussion explores the […]

Continue Reading »

SE-Radio Episode 284: John Allspaw on System Failures: Preventing, Responding, and Learning From

Filed in Episodes by on March 7, 2017 0 Comments
SE-Radio Episode 284: John Allspaw on System Failures: Preventing, Responding, and Learning From

John Allspaw CTO of Etsy speaks with Robert Blumen about systemic failures and outages; how are systems defended against outages?; why do they fail anyway?; why are failures not entirely preventable?; why do outages involve multiple failures?; the time that Etsy identified it’s own office as a potential source of fraud; the human as part […]

Continue Reading »

SE-Radio Episode 276: Björn Rabenstein on Site Reliability Engineering

Filed in Episodes by on December 6, 2016 2 Comments
SE-Radio Episode 276: Björn Rabenstein on Site Reliability Engineering

Björn Rabenstein discusses the field of Site Reliability Engineering (SRE) with host Robert Blumen. The term SRE has recently emerged to mean Google’s approach to DevOps. The publication of Google’s book on SRE has brought many of their practices into more public discussion. The interview covers: what is distinct about SRE versus devops; the SRE […]

Continue Reading »

SE-Radio Episode 268: Kief Morris on Infrastructure as Code

Filed in Episodes by on September 13, 2016 2 Comments
SE-Radio Episode 268: Kief Morris on Infrastructure as Code

Kief Morris, cloud specialist at ThoughtWorks and author of the recent book Infrastructure as Code, talks to Sven Johann about why this concept is becoming increasingly important due to cloud computing. They discuss best practices for writing infrastructure code, including why you should treat your servers as cattle, not pets, as well as how to […]

Continue Reading »