A curated list of Site Reliability and Production Engineering resources.
-
Updated
Jun 10, 2024
A curated list of Site Reliability and Production Engineering resources.
A curated list of Site Reliability and Production Engineering Tools
C++ implementation of Raft core logic as a replication library
A simple, zero-dependency, pure js/html status page based on GitHub Pages and Actions.
☕️ Grab a slick name for your new project
The tool to check the availability or syntax of domain, IP or URL.
Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!
Hermes: a fault-tolerant replication protocol, implemented over RDMA, guaranteeing linearizability and achieving low latency and high throughput.
Monitore your websites availability, http status code (current and history), certificate, redirects and more with Grafana and Prometheus blackbox exporter.
A curated list of awesome Site Reliability and Production Engineering resources.
Website Availability Monitor: add your website to our dashboard and get 24x7 monitoring of its availability (and a badge!)
Calculate how much downtime should be permitted in your Service Level Agreement or Objective
Rails Postgres ActiveRecord patches for common production workloads
Automatic repair for unhealthy Kubernetes nodes
Kubernetes Operator to manage node maintenance through NodeMaintenance custom resources
[ARCHIVED] Please report to https://github.com/funilrys/PyFunceble.
后台架构,性能,安全,高可用,高扩展,数据分片等案例
Add a description, image, and links to the availability topic page so that developers can more easily learn about it.
To associate your repository with the availability topic, visit your repo's landing page and select "manage topics."