Chaos Engineering Webinar Series

23 September 2020 14:00 CEST
Chaos Engineering Webinar Series
Building a Chaos Engineering Practice 

Building a chaos engineering practice

  1. Finding failures before they become outages
  2. Improve the availability and reliability of your Kubernetes Clusters
  3. Before the Code Freeze: Preparing for Black Friday and Cyber Monday
  4. Creating a culture of chaos

Audience: Site Reliability Engineers, System Engineers, Kubernetes Administrators, Technical Architects, Application Developers with an affinity with SRE and Technical Management

About this Webinar

19 August Webinar 1 - Finding failure before they become outages

Time: 14:00 CEST - 14:45 CEST

 

Introduction to chaos engineering

 

Most people have heard about Netflix’s Chaos Monkey. However, not everyone understands why they are randomly killing virtual machine instances in their infrastructure.

 

That practice called Chaos Engineering is one of the ways towards more resilient applications and ultimately a more reliable internet.

 

During this webinar, we will introduce you to the ideas behind Chaos Engineering, the benefits it brings to your business and how to get started. We’ll walk you through what we like to call: ‘The 5W’s of Chaos Engineering: What, Why, When, Where and Who’.

 

Speaker

  • Patrick van der Bleek, EMEA Solution Architect at Gremlin

9 September Webinar 2 - Improve the availability and reliability of your Kubernetes clusters

Time: 14:00 CEST - 14:45 CEST

 

Running your first Chaos Experiments on Kubernetes

 

 

Reliability and high availability are key features of Kubernetes, but even the most resilient systems can fail. Applications crash, hardware breaks, and nodes can go offline. These failures can have damaging and unpredictable consequences for organizations, especially those that are unprepared.

 

In our upcoming webinar, we’ll be exploring how to improve the availability and reliability of Kubernetes clusters using the discipline of Chaos Engineering.

 

Outline

  • You will learn how to use Chaos Engineering to safely inject failure into your applications and nodes in order to detect weaknesses
  • Additionally, we’ll walk through specific Chaos Experiments for you to run on Kubernetes to ensure you’ve designed a reliable system
  • By the end of the session, you’ll have specific recommendations for how to harden your infrastructure, improve reliability, and keep your applications running smoothly

 

Speaker

  • Patrick van der Bleek, EMEA Solution Architect at Gremlin

23 September Webinar 3 -Before the Code Freeze: Preparing for Black Friday and Cyber Monday

Time: 14:00 CEST - 14:45 CEST

Black Friday and Cyber Monday break site traffic and sales records each year. It's crucial for engineering teams to prepare for peak traffic because even a few seconds of downtime can cost companies thousands of dollars in lost revenue. However, even the most skilled engineering teams can struggle to keep up with the demands of peak holiday traffic.

In this webinar, we will discuss how to prepare for traffic spikes and how to avoid incidents on Black Friday and Cyber Monday by implementing Chaos Engineering.

Agenda:

  • Walkthrough common Peak Readiness pitfalls and how to avoid them
  • Understand how Chaos Engineering complements traditional load testing and fits into an overall seasonal readiness program
  • Run Chaos Engineering experiments simulating traffic spikes to proactively uncover weaknesses and fix failures before the code freeze
  • Prepare your on-call team by running fire drills to build incident management muscle memory
  • Learn how top companies have successfully avoided incidents on Black Friday and Cyber Monday by implementing Chaos Engineering

Speaker

  • Patrick van der Bleek, EMEA Solution Architect at Gremlin
     

7 October Webinar 4 - Creating a culture of Chaos 

Time: 14:00 CEST - 14:45 CEST

Chaos Engineering is not just Tools - it’s culture

 

Chaos Engineering is becoming a critical part of the DevOps toolchain when adopting Site Reliability Engineering practices.  Every system is becoming a distributed system  and Chaos Engineering proclaims many advantages for them. 

 

It improves infrastructure automation, increases reliability, and transforms incident management. However, an often-overlooked benefit of Chaos Engineering and SRE involves culture transformation.  Culture is often touched upon when talking about Chaos Engineering and SRE but not as often as skills and process. 

 

In this webinar, we will discuss how you can build out a Chaos engineering practice and how you can adopt a true blameless culture and maximize the potential of your team. 

 

Outline

 

  • Hold blameless postmortems
  • Share post mortems with other teams
  • Run regular FireDrills and GameDays
  • Automate Chaos experiments for continuous validation

 

Speaker

  • Roald Ruchti, EMEA Sales Director at Gremlin

 

Speakers

Patrick van der Bleek

Solution Architect EMEA at Gremlin

Roald Ruchti

Sales Director EMEA at Gremlin
Register here!