-
thanos
Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.
-
mimir
Grafana Mimir provides horizontally scalable, highly available, multi-tenant, long-term storage for Prometheus.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Prometheus is VERY scalable, and is ideally run in Kubernetes. Resource consumption is really tied to storage. If you need 30 days (or more of storage) look at other storage engines to 'offload' that data. Two very popular ones are Thanos and mimir . You can also tune down the frequency of scrapes to cut down on resource consumption (Do you need to know how full your disk is every 15 seconds, or is 60 seconds sufficient, etc)
Prometheus is VERY scalable, and is ideally run in Kubernetes. Resource consumption is really tied to storage. If you need 30 days (or more of storage) look at other storage engines to 'offload' that data. Two very popular ones are Thanos and mimir . You can also tune down the frequency of scrapes to cut down on resource consumption (Do you need to know how full your disk is every 15 seconds, or is 60 seconds sufficient, etc)
Prometheus should work great for monitoring a few dozen of servers. If you need more scalable monitoring solution, then take a look at VictoriaMetrics - this is a Prometheus-like monitoring system optimized for low resource usage. It can scale to multiple nodes if needed - see VictoriaMetrics cluster docs.