Compilation of public failure/horror stories related to Kubernetes
-
Updated
Jul 7, 2020 - HTML
Compilation of public failure/horror stories related to Kubernetes
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization
A curated list of Site Reliability and Production Engineering resources.
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
An active monitoring software to detect failures before your customers do.
Site Reliability Engineer Interview Preparation Guide
Web UI for Jaeger
A framework for gradual system automation
What to Read to Learn More About DevOps
Knowledge seeks no man
Linux Bash Shell Script and Python Script For Ops and Devops
Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners
A reading/viewing list for larval stage sysadmins and SREs
Marmot workflow execution engine
Curated list of good SRE interview questions.
Collection of AWS SSM Documents to perform Chaos Engineering experiments
Google Site Reliability Engineering book converted in audio
s3-streaming-upload is node.js library that listens to your stream and upload its data to Amazon S3 using ManagedUpload API.
Notes on Site Reliability Engineering. Leave a
Collection of python scripts to run failure injection on AWS infrastructure
A curated list of awesome Site Reliability and Production Engineering resources.
Stackdriver Sandbox is an open source tool that helps practitioners to learn Service Reliability Engineering practices from Google and apply them on their cloud services using Cloud Operations suite of tools.
The Skinny Distributed Lock Service
A curated list of Site Reliability and Production Engineering Tools
Chaos Injection library for AWS Lambda
An end to end example of implementing SLOs with prometheus, grafana and Go.
Add a description, image, and links to the sre topic page so that developers can more easily learn about it.
To associate your repository with the sre topic, visit your repo's landing page and select "manage topics."