-
thanos
Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.
For a homelab I think prometheus + grafana is easy to get started and scales well. There are lots of ways to set up the architecture. Prometheus can write to a directory on a filesystem, it can be set to write to a remote server, and there are other projects to integrate object storage (s3, minio, etc) or influxdb for long term storage and downsampling.
-
InfluxDB
Purpose built for real-time analytics at any scale. InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
-
cortex
A horizontally scalable, highly available, multi-tenant, long term Prometheus. (by cortexproject)
For a homelab I think prometheus + grafana is easy to get started and scales well. There are lots of ways to set up the architecture. Prometheus can write to a directory on a filesystem, it can be set to write to a remote server, and there are other projects to integrate object storage (s3, minio, etc) or influxdb for long term storage and downsampling.
-
I stopped using rancher because running rancher and a k8s cluster uses a lot of memory. My current cluster is managed with kubespray. It requires fiddling with python and ansible but at least it provides a declarative way to manage your cluster in code and config files. Their readme was enough to get me going. To add nodes you run kubespray again targeting the new node. To upgrade k8s, you bump the git tag and run kubespray again.