E-book details

Becoming a Rockstar SRE. Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems

Becoming a Rockstar SRE. Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems

Jeremy Proffitt, Rod Anami

Ebook
Site reliability engineering is all about continuous improvement, finding the balance between business and product demands while working within technological limitations to drive higher revenue. But quantifying and understanding reliability, handling resources, and meeting developer requirements can sometimes be overwhelming. With a focus on reliability from an infrastructure and coding perspective, Becoming a Rockstar SRE brings forth the site reliability engineer (SRE) persona using real-world examples.

This book will acquaint you the role of an SRE, followed by the why and how of site reliability engineering. It walks you through the jobs of an SRE, from the automation of CI/CD pipelines and reducing toil to reliability best practices. You’ll learn what creates bad code and how to circumvent it with reliable design and patterns. The book also guides you through interacting and negotiating with businesses and vendors on various technical matters and exploring observability, outages, and why and how to craft an excellent runbook. Finally, you’ll learn how to elevate your site reliability engineering career, including certifications and interview tips and questions.

By the end of this book, you’ll be able to identify and measure reliability, reduce downtime, troubleshoot outages, and enhance productivity to become a true rockstar SRE!
  • 1. SRE Job Role – Activities and Responsibilities
  • 2. Fundamental Numbers – Reliability Statistics
  • 3. Imperfect Habits – Duct Tape Architecture and Spaghetti Code
  • 4. Essential Observability – Metrics, Events, Logs, and Traces (MELT)
  • 5. Resolution Path – Master Troubleshooting
  • 6. Operational Framework – Managing Infrastructure and Systems
  • 7. Data Consumed – Observability Data Science
  • 8. Reliable Architecture – Systems Strategy and Design
  • 9. Valued Automation – Toil Discovery and Elimination
  • 10. Exposing Pipelines – GitOps and Testing Essentials
  • 11. Worker Bees – Orchestrations of Serverless, Containers, and Kubernetes
  • 12. Final Exam – Tests and Capacity Planning
  • 13. First Thing – Runbooks and Low Noise Outage Notifications
  • 14. Rapid Response – Outage Management Techniques
  • 15. Postmortem Candor – Long-Term Resolution
  • 16. Chaos Injector – Advanced Systems Stability
  • 17. Interview Advice – Hiring and Being Hired
  • 18. Appendix A The Site Reliability Engineer Manifesto
  • 19. Appendix B The 12-Factor App Questionnaire
  • Title: Becoming a Rockstar SRE. Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems
  • Author: Jeremy Proffitt, Rod Anami
  • Original title: Becoming a Rockstar SRE. Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems
  • ISBN: 9781804614563, 9781804614563
  • Date of issue: 2023-04-28
  • Format: Ebook
  • Item ID: e_3d43
  • Publisher: Packt Publishing