Site Reliability Engineering Site Reliability Engineering

Site Reliability Engineering

How Google Runs Production Systems

    • £30.99

Publisher Description

The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems?

In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient—lessons directly applicable to your organization.

This book is divided into four sections:
Introduction—Learn what site reliability engineering is and why it differs from conventional IT industry practicesPrinciples—Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE)Practices—Understand the theory and practice of an SRE’s day-to-day work: building and operating large distributed computing systemsManagement—Explore Google's best practices for training, communication, and meetings that your organization can use

GENRE
Computing & Internet
RELEASED
2016
23 March
LANGUAGE
EN
English
LENGTH
552
Pages
PUBLISHER
O'Reilly Media
SIZE
8.7
MB
Art of Scalability, The: Scalable Web Architecture, Processes, and Organizations for the Modern Enterprise Art of Scalability, The: Scalable Web Architecture, Processes, and Organizations for the Modern Enterprise
2009
Building Microservices Building Microservices
2021
Software Engineering at Google Software Engineering at Google
2020
The Missing README The Missing README
2021
Monolith to Microservices Monolith to Microservices
2019
DevOps for Digital Leaders DevOps for Digital Leaders
2016
The Site Reliability Workbook The Site Reliability Workbook
2018
Reliable Machine Learning Reliable Machine Learning
2021
Engenharia de Confiabilidade do Google Engenharia de Confiabilidade do Google
2016
Site Reliability Engineering. Jak Google zarządza systemami producyjnymi Site Reliability Engineering. Jak Google zarządza systemami producyjnymi
2017
IPv6 Network Administration IPv6 Network Administration
2005
Software Engineering at Google Software Engineering at Google
2020
The Art of Capacity Planning The Art of Capacity Planning
2017
Seeking SRE Seeking SRE
2018
Observability Engineering Observability Engineering
2022
Software Architecture: The Hard Parts Software Architecture: The Hard Parts
2021
Designing Data-Intensive Applications Designing Data-Intensive Applications
2017