Awesome Site Reliability Engineering
An Awesome list of Site Reliability Engineering resources, best practices, and tools.
About this tool
title: Awesome Site Reliability Engineering slug: awesome-site-reliability-engineering url: https://github.com/dastergon/awesome-sre#readme brand: sindresorhusawesome brand_logo: /sindresorhus/awesome/raw/main/media/logo.png category: themed-directories tags:
- sre
- devops
- monitoring featured: false
Overview
Awesome Site Reliability Engineering is a curated "awesome list" of resources related to Site Reliability Engineering (SRE) and production operations. It aggregates links to articles, books, tools, best practices, and learning materials that help teams design, operate, and scale reliable systems.
Features
-
Curated SRE Resource Directory
Collection of community-maintained links focused on Site Reliability Engineering and production operations. -
Broad Topic Coverage
Includes materials on:- Core SRE principles and culture
- Incident management and postmortems
- Monitoring, observability, and alerting
- Reliability engineering practices in production systems
- DevOps-aligned practices relevant to SRE
-
Learning & Reference Materials
Organizes links to:- Articles and blog posts
- Conference talks and presentations
- Books and long-form guides
- Tutorials and how-to resources
-
Tools & Best Practices
Highlights tools, frameworks, and patterns used for:- Monitoring and metrics
- Logging and tracing
- Reliability testing and validation
- Operational automation
-
Open Source & Community-Driven
Hosted on GitHub with:- Public repository for transparency
- Contribution guidelines (CONTRIBUTING.md) for adding and maintaining entries
- Continuous updates via community pull requests and discussions
-
Part of the "Awesome" Ecosystem
Listed in the broader sindresorhus/awesome collection, following its conventions for quality and curation.
Pricing
- Not applicable. This is a free, open-source curated list hosted on GitHub.
Loading more......
Information
Categories
Tags
Similar Products
6 result(s)A curated awesome list of learning resources, tools, and best practices for detection engineering—designing, building, and operating detective cybersecurity controls.
A curated Awesome list of tools, exporters, dashboards, and resources for the Prometheus open-source monitoring system.
awesome-actions is an awesome-list dedicated to curating useful GitHub Actions. It provides a structured directory of reusable actions for CI/CD and automation on GitHub, aligning with the meta-collection of awesome directories.
A curated Awesome directory of resources, tools, and roles for Ansible, the open‑source automation and configuration management platform.
An Awesome list of tools, frameworks, and practices for managing technical and business operations via chat (ChatOps).
An awesome list of offensive security research, techniques, and resources related to attacking CI/CD systems and software delivery pipelines.