When DevOps met SRE: From Apollo 13 to Google SRE


The complexity of managing and delivering the high level of reliability expected of web-based, cloud hosted systems today, and the expectation of Continuous Delivery of new features has led to the evolution of a totally new field of Service Reliability Engineering catered for such systems. Google, who has been a pioneer in this field, calls it Site Reliability Engineering (SRE). While it would be more aptly named Service Reliability Engineering, the name has caught on. The seminal work documenting Google approach and practices is in the book by Google by the same name (commonly referred to as the ‘SRE book’), and has become the defacto standard on how to adopt SRE in an organization. This session will cover adopting SRE as a practice in organizations also adopting DevOps; address the challenges to adopting SRE faced by large traditional enterprises, and how to overcome them

Speaker

sanjeev-sharma

Sanjeev Sharma

 

Sanjeev Sharma is an internationally known DevOps and Cloud Transformation thought leader, technology executive, and author. Sanjeev’s industry experience includes tenures as CTO, Technical Sales leader, and Cloud Architect leader. As an IBM Distinguished Engineer, Sanjeev is recognized at the highest levels of IBM’s core of technical leaders.

Sanjeev provides leadership to drive the adoption of cutting edge solutions, architectures and strategies for Cloud and DevOps transformations, and advises C-level and senior technical executives leading these transformations. Sanjeev just published his 2nd book ‘DevOps Adoption Playbook’. He regularly blogs and podcasts on DevOps and Cloud adoption on his popular blog