Check them out here!

Thank you so much for joining us! Recordings of the talks are available so we can all continue being resilient together.

Failover Conf on April 21st - 8am to 5pm PDT

A virtual event on reliability 

Thank you for joining us online to learn from the best and brightest in reliability and resilience engineering.

Replays of all of the talks can be found here!
(No need to register -- Just click!)


Being a resilient engineer means building systems that are hardened against the expected failures and resilient enough to withstand the unexpected ones.

This year we expected the opportunity to gather in-person to share our knowledge and experiences building production systems with one another. Then the unexpected happened, forcing many events to cancel or postpone.

But we’re resilient. When one opportunity falls through, we "failover" to another.

Thanks to  Cockroach Labs for being our Accessibility Partner, we were able to offer live closed captioning in English, French, German, and Spanish during the event!

The Halo of Resilience Engineering April 22, 2020, 12:10 AM
J. Paul Reed Senior Applied Resilience Engineer, Netflix
Pitfalls in Measuring SLOs April 21, 2020, 09:30 PM
Danyel Fisher & Liz Fong-Jones Honeycomb & Honeycomb
Swim Don’t Sink: Why Training Matters to a Site Reliability Engineering Practice April 21, 2020, 05:20 PM
Jennifer Petoff Google
How to fail with Serverless April 21, 2020, 10:10 PM
Jeremy Daly AlertMe

Thank you to all of our amazing speakers!

Whether you’re leveraging the lessons they’ve learned the hard way, or building off of their proven tactics, you’ll be learning from engineers who build and operate the world’s most reliable distributed systems.  

WATCH THE REPLAYS

Proudly supported by

Partnerships for this event are closed. Interested in partnering with us in the future? Email events@gremlin.com and we'll keep you updated on future opportunities.