Tag: monitoring

Episode 367: Diomidis Spinellis on Debugging

Filed in Episodes by on May 21, 2019 1 Comment
Episode 367: Diomidis Spinellis on Debugging

Felienne talks to Diomidis Spinellis about debugging.  The discussion covers: defining “debugging” which can mean using a debugger or the process of finding and removing bugs; how is it done best; variation across different programming languages or paradigms; debugging different types of applications; tools for debugging: print-statements, version-control systems, operating system tools. Assessing debugging skills […]

Continue Reading »

SE-Radio Episode 337: Ben Sigelman on Distributed Tracing

Filed in Episodes by on September 11, 2018 1 Comment
SE-Radio Episode 337: Ben Sigelman on Distributed Tracing

Ben Sigelman CEO of LightStep and co-author of the OpenTracing standard discusses distributed tracing, a form of event-driven observability useful in debugging distributed systems, understanding latency outlyers, and delivering “white box” analytics.  Host Robert Blumen spoke with Sigelman about the basics of tracing, why it is harder in a distributed system, the concept of tracing […]

Continue Reading »

SE-Radio-Episode-325: Tammy Butow on Chaos Engineering

Filed in Episodes by on May 29, 2018 1 Comment
SE-Radio-Episode-325: Tammy Butow on Chaos Engineering

Edaena Salinas talks with Tammy Butow about Chaos Engineering. Topics include: the factors that caused Chaos Engineering to emerge, the different types of chaos that can be introduced to a system, how to structure experiments. Some of the chaos engineering experiments that were discussed are: DNS related attacks, black hole attacks and database attacks. Tammy […]

Continue Reading »

SE-Radio Episode 301: Jason Hand on Handling Outages

Filed in Episodes by on August 29, 2017 1 Comment
SE-Radio Episode 301: Jason Hand on Handling Outages

Bryan Reinero talks with Jason Hand about handling outages and responding to failures. The episode explores basic problem-solving strategies and diagnostic techniques, organizing teams to address incidents efficiently, communicating with stakeholders, learning from incidents, and managing stress.   Related Links Episode 284 – John Allspaw on System Failures: Preventing, Responding, and Learning From Episode 225 […]

Continue Reading »

SE-Radio Episode 277: Gil Tene on Tail Latency

Filed in Episodes by on December 14, 2016 1 Comment
SE-Radio Episode 277: Gil Tene on Tail Latency

Gil Tene joins Robert Blumen for a discussion of tail latency. What is latency? What is “tail latency”? Why are the upper percentiles of latency more relevant to humans? How is human interaction with an application influenced by tail latency? What are the economics of tail latency? What are the origins of tail latency within […]

Continue Reading »