cloudprober
sloth
Our great sponsors
cloudprober | sloth | |
---|---|---|
3 | 11 | |
1,428 | 1,949 | |
- | - | |
8.4 | 0.0 | |
over 2 years ago | about 2 months ago | |
Go | Go | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cloudprober
-
Using Alerts in Grafana
Do you mean that your service doesn't have enough traffic all the time? Then you can (and should) use synthetic clients to send requests to your endpoint. They provide both a minimal amount of traffic all day round and also can report on responses they get and improve your coverage. Example project: https://github.com/google/cloudprober
-
How Best to Monitor Incoming Traffic for the Health of Applications
If your service might at times fall to almost zero requests outside business hours, having a synthetic client is a must. You can use Blackbox exporter as mentioned by u/SuperQue or CloudProber, both work well for simple cases (one step site check or API call), for anything more complicated (multi step scenarios) you are better off scripting it.
-
SLOs when your metrics suck?
Relatively easy to achieve: through variety of available opensource projects like cloudprober or blackbox exporter (if your test case is straight forward) or custom made programs out of bash, python, golang (if your test case is more complex).
sloth
-
SLOscribe: embed SLO/SLI into GO source code
It’s a CLI that allows developers to embed SLO annotation into GO code as comments and generate Prometheus alert groups when paired with Sloth, https://github.com/slok/sloth.
-
help setting SLIs/SLOs
SLOTH: https://github.com/slok/sloth
-
Observability Mythbusters: Yes, Observability-Landscape-as-Code is a Thing
Note: Although it’s outside of the scope of this post to dig deep into this topic, in case you’re curious, you can check out what an OpenSLO YAML definition looks like here.
- Pyrra v0.3.0 released
-
What you use for observability?
The actual hard part is standardizing all teams on SLI/SLO-based thinking. For that we're looking at tools like Sloth.
- How do you measure the reliability of a Kubernetes platform?
-
Calculating Remaining Error Budget
Have a look at sloth (https://github.com/slok/sloth) which will help you generate SLOs and error budgets given a PromQL query. This might be easier than trying to calculate it yourself. Plus, it's "metrics as code" and OpenSLO spec compliant.
-
openSLO
If you are in k8s and use Prometheus you could take a look at sloth: https://github.com/slok/sloth which can either generate the rules/alerts for you, or can run as an operator and allows you to write SLOs as k8s kinds.
-
SLI/Error Budget Calculators and management
Check out https://github.com/slok/sloth
- SLO calculation
What are some alternatives?
blackbox_exporter - Blackbox prober exporter
pyrra - Making SLOs with Prometheus manageable, accessible, and easy to use for everyone!
Sloth - Mac app that shows all open files, directories, sockets, pipes and devices in use by all running processes. Nice GUI for lsof.
slo-computer - SLOs, Error windows and alerts are complicated. Here an attempt to make it easy SLO Computer makes setting and monitoring SLOs for all your services intuitively seamless and blazingly fast. Community Support on Discord - https://discord.com/invite/Q3p2EEucx9
grafana-aws-cloudwatch-dashboards - :cloud: 40+ Grafana dashboards for AWS CloudWatch metrics: EC2, Lambda, S3, ELB, EMR, EBS, SNS, SES, SQS, RDS, EFS, ElastiCache, Billing, API Gateway, VPN, Step Functions, Route 53, CodeBuild, ...
kube-prometheus - Use Prometheus to monitor Kubernetes and applications running on Kubernetes
tcpprobe - Modern TCP tool and service for network performance observability.
OpenSLO - Open specification for defining and expressing service level objectives (SLO)
howtheysre - A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
kube-state-metrics - Add-on agent to generate and expose cluster-level metrics.
kubesphere - The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️
mtail - extract internal monitoring data from application logs for collection in a timeseries database