howtheysre
cloudprober
Our great sponsors
howtheysre | cloudprober | |
---|---|---|
14 | 3 | |
8,918 | 1,428 | |
- | - | |
6.4 | 8.4 | |
4 months ago | over 2 years ago | |
JavaScript | Go | |
Creative Commons Zero v1.0 Universal | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
howtheysre
-
5 GitHub Projects to Help You Become a Better DevOps Engineer ⚡
1. How they SRE
- Good CI/CD and SRE Blogs
-
Which companies do SRE right?
This repo README has fantastic list of companies doing SRE https://github.com/upgundecha/howtheysre
- HowTheySRE is accepting #hacktoberfest2021 contributions
- How does your company organize the SRE Teams in your company along with the development teams?
-
Which companies implement SRE like Google does?
Relevant link How they SRE
- Looking for a blog post
- How they SRE
- How They SRE
cloudprober
-
Using Alerts in Grafana
Do you mean that your service doesn't have enough traffic all the time? Then you can (and should) use synthetic clients to send requests to your endpoint. They provide both a minimal amount of traffic all day round and also can report on responses they get and improve your coverage. Example project: https://github.com/google/cloudprober
-
How Best to Monitor Incoming Traffic for the Health of Applications
If your service might at times fall to almost zero requests outside business hours, having a synthetic client is a must. You can use Blackbox exporter as mentioned by u/SuperQue or CloudProber, both work well for simple cases (one step site check or API call), for anything more complicated (multi step scenarios) you are better off scripting it.
-
SLOs when your metrics suck?
Relatively easy to achieve: through variety of available opensource projects like cloudprober or blackbox exporter (if your test case is straight forward) or custom made programs out of bash, python, golang (if your test case is more complex).
What are some alternatives?
awesome-sre - A curated list of Site Reliability and Production Engineering resources.
blackbox_exporter - Blackbox prober exporter
iris-web - Collaborative Incident Response platform
sloth - 🦥 Easy and simple Prometheus SLO (service level objectives) generator
wazuh-documentation - Wazuh - Project documentation
Sloth - Mac app that shows all open files, directories, sockets, pipes and devices in use by all running processes. Nice GUI for lsof.
oneuptime - OneUptime is the complete open-source observability platform.
grafana-aws-cloudwatch-dashboards - :cloud: 40+ Grafana dashboards for AWS CloudWatch metrics: EC2, Lambda, S3, ELB, EMR, EBS, SNS, SES, SQS, RDS, EFS, ElastiCache, Billing, API Gateway, VPN, Step Functions, Route 53, CodeBuild, ...
Onyx-CNC-Motherboard - Onyx is the successor of the iconic UNO CNC shield, that runs on an ESP32 microcontroller
tcpprobe - Modern TCP tool and service for network performance observability.
dora-metrics - Small backend project to calculate DORA Metrics
kubesphere - The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️