Those of you using prometheus as part of your observability stack, what approach did you take to scaling to scrape 25+ clusters, and why? Is Thanos the answer to my problems?

This page summarizes the projects mentioned and recommended in the original post on /r/devops

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • thanos

    Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.

  • I understand that Thanos (https://github.com/thanos-io/thanos) was built with the idea of improving prom's scalability and availability , but would love to hear from others that have tried various approaches to try to solve this.

  • agent

    Vendor-neutral programmable observability pipelines. (by grafana)

  • Furthermore, would recommend Grafana Agent OR Prometheus Agent in this case since you probably don't need the Prometheus UI in each Cluster as well as the Alerting stuff that is inside Prometheus. (Mimir will do the ruling stuff for you). Grafana Agent also has an Operator mode if you want to use ServiceMonitor and PodMonitor CustomResources.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Grafana to sumologic pricing

    1 project | /r/devops | 12 Apr 2023
  • Best unicorn monitoring system?

    3 projects | /r/sysadmin | 17 Mar 2023
  • Grafana agent JSON Schema

    1 project | /r/grafana | 8 Mar 2023
  • Monitoring Internet Quality and Speed

    2 projects | /r/sysadmin | 11 Feb 2023
  • Prometheus vs EFS: I don't know who to believe

    1 project | /r/sysadmin | 24 Jan 2023