-
Ansible
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
-
spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Ansible This is a very easy and popular solution but it scales horribly.
Spack This is my go to for HPC software building and distribution. It designed specifically for HPC and Research applications. The included software library is very large and packaging new applications is fairly easy.
SLURM (distributed by OpenHPC) If you have shared storage then this is the industry standard solution that is both open source and free (extremely popular in the top 500 list). You can pair this with a high speed network or not depending on your research workloads.
Prometheus + Grafana This tool combination is very popular and you decent integration with HPC schedulers such as SLURM.
SLURM (distributed by OpenHPC) If you have shared storage then this is the industry standard solution that is both open source and free (extremely popular in the top 500 list). You can pair this with a high speed network or not depending on your research workloads.
Prometheus + Grafana This tool combination is very popular and you decent integration with HPC schedulers such as SLURM.