#
reliability-engineering
Here are
104 public repositories
matching this topic...
A curated list of Site Reliability and Production Engineering resources.
Chaos Engineering Experiments Automation & Orchestration
Updated
Jul 30, 2020
Python
Serverless chaos monkey for AWS (runs on AWS Lambda) ☁️ 💥
Updated
Jul 15, 2020
JavaScript
Probabilistic Risk Analysis Tool (fault tree analysis, event tree analysis, etc.)
A curated list of awesome Site Reliability and Production Engineering resources.
A curated list of Site Reliability and Production Engineering Tools
GOV.UK PaaS - Cloud Foundry
The Chaos Toolkit core library
Updated
Jul 30, 2020
Python
A collection of SRE tools
An opinionated list of attributes and policies that need to be met in order to establish a stable software system.
GSP is a container platform and curated suite of components helping government deploy, run, observe and secure their services
Updated
Aug 11, 2020
Ruby
A collection templates ported from the SRE Workbook
A terraform provider for Concourse
Terraform configuration to manage a Prometheus server running on AWS.
A library to create service brokers for any service provider
A Go application for generating billing data from cloudfoundry events
[Work In Progress] A Site Reliability Engineering Community Blog
Administration tool for GOV.UK PaaS
Updated
Aug 12, 2020
TypeScript
🔖 Daily-updated reading list for designing High Scalability 🍒 , High Availability 🔥 , High Stability 🗻 back-end systems - Pull requests are greatly welcome 👬 I hope you will find this project helpful 🍀 Please help me share it to more and more people ❤️ Thank you - 谢谢 - धन्यवाद - ধন্যবাদ - Спасибо - شكرا - Merci - Gracias - Danke - Cảm ơn! 🙇
Documentation for Reliability Engineering services
Updated
Aug 13, 2020
Ruby
Bootstrap a VPC with BOSH and Concourse to run PaaS
Updated
Aug 12, 2020
Ruby
A small, underdocumented Puppet module for hardening Ubuntu systems.
Updated
Jul 9, 2020
Puppet
Technical documentation for GOV.UK PaaS
Updated
Aug 10, 2020
Ruby
Terraform configuration to manage a Prometheus server running on AWS.
Updated
Jul 30, 2020
Ruby
Team manual for Reliability Engineering and its sub-teams.
Updated
Aug 13, 2020
Ruby
Improve this page
Add a description, image, and links to the
reliability-engineering
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
reliability-engineering
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.