Check them out here!

Thank you so much for joining us! Recordings of the talks are available so we can all continue being resilient together.

Failover Conf on April 21st - 8am to 5pm PDT

A virtual event on reliability 

Thank you for joining us online to learn from the best and brightest in reliability and resilience engineering.

Replays of all of the talks can be found here!
(No need to register -- Just click!)


Being a resilient engineer means building systems that are hardened against the expected failures and resilient enough to withstand the unexpected ones.

This year we expected the opportunity to gather in-person to share our knowledge and experiences building production systems with one another. Then the unexpected happened, forcing many events to cancel or postpone.

But we’re resilient. When one opportunity falls through, we "failover" to another.

Thanks to  Cockroach Labs for being our Accessibility Partner, we were able to offer live closed captioning in English, French, German, and Spanish during the event!

Pitfalls in Measuring SLOs April 21, 2020, 09:30 PM
Danyel Fisher & Liz Fong-Jones Honeycomb & Honeycomb
Pitfalls in Measuring SLOs April 21, 2020, 09:30 PM
Danyel Fisher & Liz Fong-Jones Honeycomb & Honeycomb
Fight, Flight, or Freeze - Releasing Organizational Trauma April 21, 2020, 04:40 PM
Matt Stratton Transformation Specialist, Red Hat
Reliability Matters More Than Ever April 21, 2020, 04:00 PM
Tammy Butow Principal SRE, Gremlin

Thank you to all of our amazing speakers!

Whether you’re leveraging the lessons they’ve learned the hard way, or building off of their proven tactics, you’ll be learning from engineers who build and operate the world’s most reliable distributed systems.  

WATCH THE REPLAYS

Proudly supported by

Partnerships for this event are closed. Interested in partnering with us in the future? Email events@gremlin.com and we'll keep you updated on future opportunities.