alerta-contrib
check_systemd
alerta-contrib | check_systemd | |
---|---|---|
2 | 5 | |
118 | 25 | |
0.0% | - | |
2.8 | 9.0 | |
23 days ago | 20 days ago | |
Python | Python | |
MIT License | GNU Lesser General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
alerta-contrib
-
Looking for a tool to aggregate alerts
Sadly .. no datadog plugin just yet .. https://github.com/alerta/alerta-contrib/issues/376
-
Prometheus/Alert Manager
Or use tool like Alerta: https://alerta.io/ And plugin for alr manager: https://github.com/alerta/alerta-contrib/tree/master/plugins/prometheus Dont try to invent wheel again :)
check_systemd
-
What to use for application (process) monitoring?
The best all-around check I have found is check_systemd
-
AlmaLinux 9 / network-scripts?
In that case, it may be helpful to have a reliability "spin" sinilar how there are AlmaLinux Live Media images that allow trying an AlmaLinux desktop without installing. That is not an option on the other EL8 systems despite AlmaLinux also offering 1:1 compatibility. A reliability spin might include other reliability-focused components such as monitoring-plugins and check_systemd
-
Linux is dead, long-live Docker monoculture
Fast forward 12 years and I have Icinga2 collectors in each datacenter using check_by_ssh to run check_systemd, all front-ended by Thruk. The TIG stack is something on my list of things to look into at some point, but with Dynatrace available to do all the fancy application monitoring, there's no rush.
-
Anyone using LibreNMS in production?
For alerting for Linux systems, I use Icinga with check_ssh and check_systemd (caveat: distributed primarily on PyPI) with Thruk as the single pane of glass front-end to per-datacenter installations of Icinga.
-
If you wanted to see exactly how a system was performing when a load threshold was reached, how would you do it?
check_systemd is nice to alert based on failed systemd units - https://github.com/Josef-Friedrich/check_systemd but it requires python3 to install, so there is a bit more to install onto a system.
What are some alternatives?
karma - Alert dashboard for Prometheus Alertmanager
check_redfish - A monitoring/inventory plugin to check components and health status of systems which support Redfish. It will also create a inventory of all components of a system.
netbox-zabbix-sync - Python script to syncronise Netbox devices to Zabbix.
Zabbix - Real-time monitoring of IT components and services, such as networks, servers, VMs, applications and the cloud.
prometheus-am-executor - Execute command based on Prometheus alerts
Thruk - Thruk is a multibackend monitoring webinterface for Naemon, Nagios, Icinga and Shinken using the Livestatus API.
gassnerZabbixScripts - Scripts related to Zabbix
plotly4nagios - Plotly4Nagios is a nagios plugin to display the performance data in Graph. It uses the RRD database provided by pnp4nagios and visualize it in interactive graph format using plotly javascript framework.
prom2teams - prom2teams is an HTTP server built with Python that receives alert notifications from a previously configured Prometheus Alertmanager instance and forwards it to Microsoft Teams using defined connectors
node_exporter - Exporter for machine metrics
django-prometheus - Export Django monitoring metrics for Prometheus.io
librenms-mibs - A Collection of 3rd party MIBs