Monitoring 5,000 nodes

This page summarizes the projects mentioned and recommended in the original post on /r/devops

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • prometheus

    The Prometheus monitoring system and time series database.

  • If I were to design everything from scratch now, I'd look at setting up a Prometheus network. You can take a look at the Promcom Talk: Monitoring Cloudflare's Planet-Scale Edge Network with Prometheus

  • agent

    Vendor-neutral programmable observability pipelines. (by grafana)

  • Since you're interestedin SaaS, this might be a good candidate for Grafana's cloud agent and service. The cloud agent is open source and based on the Prometheus code.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • goflow

    The high-scalability sFlow/NetFlow/IPFIX collector used internally at Cloudflare.

  • For example, for a lot of IDS work, you want to capture netflows if you can. This is something you could do with goflow. Then you can use whatever SIEM/flow analysis tools to figure out what is touching each network location.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts