Niall Richard Murphy Books
Chris Jones, Jennifer Petoff y Niall Richard Murphy son ingenieros de Google con amplia experiencia en operaciones de sistemas y liderazgo técnico en infraestructura global.
Known for: Site Reliability Engineering: How Google Runs Production Systems, The Site Reliability Workbook: Practical Ways to Implement SRE
Books by Niall Richard Murphy

Site Reliability Engineering: How Google Runs Production Systems
Site Reliability Engineering (SRE) es una colección de ensayos escritos por ingenieros de Google que describe cómo la compañía diseña, implementa y mantiene sistemas de producción a gran escala. El li...

The Site Reliability Workbook: Practical Ways to Implement SRE
The Site Reliability Workbook provides practical guidance for implementing Site Reliability Engineering (SRE) principles in real-world organizations. Building on the foundational concepts introduced i...
Key Insights from Niall Richard Murphy
Introduction to Site Reliability Engineering (SRE)
Let me begin by explaining how SRE came to be. In Google’s early years, our operations teams were drowning in manual interventions—pages going off in the middle of the night, systems that didn’t scale, and infrastructure managed by human hands instead of automation. Traditional operations models wer...
From Site Reliability Engineering: How Google Runs Production Systems
The Role and Responsibilities of an SRE Team
The role of an SRE team is to protect reliability without stifling progress. We live in a world where innovation is constant, yet every new feature carries the risk of instability. SRE gives organizations a way to balance these forces. At Google, our mandate is simple: ensure systems meet their reli...
From Site Reliability Engineering: How Google Runs Production Systems
Defining and Implementing SLOs, SLIs, and Error Budgets
Every engineering leader knows the struggle: your users want flawless uptime, your developers want to ship features quickly, and your operations team wants stability above all else. At first glance, these goals seem incompatible. Site Reliability Engineering reframes this tension as a quantifiable t...
From The Site Reliability Workbook: Practical Ways to Implement SRE
Incident Management and the Culture of Learning
No system is perfect, and outages are inevitable. The difference between resilient organizations and fragile ones lies in how they respond when things go wrong. Within SRE, we cultivate a culture where incidents are opportunities for learning, not occasions for blame. In practice, this starts with ...
From The Site Reliability Workbook: Practical Ways to Implement SRE
About Niall Richard Murphy
Chris Jones, Jennifer Petoff y Niall Richard Murphy son ingenieros de Google con amplia experiencia en operaciones de sistemas y liderazgo técnico en infraestructura global.
Frequently Asked Questions
Chris Jones, Jennifer Petoff y Niall Richard Murphy son ingenieros de Google con amplia experiencia en operaciones de sistemas y liderazgo técnico en infraestructura global.
Read Niall Richard Murphy's books in 15 minutes
Get AI-powered summaries with key insights from 2 books by Niall Richard Murphy.