sarus
mpi-operator
sarus | mpi-operator | |
---|---|---|
2 | 1 | |
121 | 400 | |
4.1% | 2.0% | |
7.3 | 7.3 | |
13 days ago | 4 days ago | |
C++ | Go | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sarus
-
Sarus VS Podman: comparison of both technologies
Sarus is An OCI-compatible container engine for HPC: https://sarus.readthedocs.io/en/stable/. At this point of view, it is very similar to use case of Podman.
-
Scaling Kubernetes to 7,500 Nodes
The problem with slurm is how it's typically used: ssh into a shared login node with a shared file system, auth is handled by the linux users mostly, submit jobs with sbatch. Kubernetes deployment feels much more modern and safe.
I have worked with containers + slurm, where the vendor libmpi is injected in the container runtime [1] by a hook, which gives you close to bare metal performance with some container goodness in terms of isolation and deployment.
[1] https://github.com/eth-cscs/sarus
mpi-operator
-
Scaling Kubernetes to 7,500 Nodes
Hi, kube-scheduler maintainer here, currently looking into enabling MPI use cases in k8s.
We started a discussion in https://github.com/kubeflow/mpi-operator/issues/315
What are some alternatives?
kube-batch - A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
pyTORCS-docker - Docker-based, gym-like torcs environment with vision.
polyaxon - MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
udocker - A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
onepanel - The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.
img - Standalone, daemon-less, unprivileged Dockerfile and OCI compatible container image builder.
kserve - Standardized Serverless ML Inference Platform on Kubernetes
crun - A fast and lightweight fully featured OCI runtime and C library for running containers
kubeflow - Machine Learning Toolkit for Kubernetes
runtime-spec - OCI Runtime Specification
optimism-v2 - ARCHIVE of monorepo implementing Boba, an L2 Compute solution built on Optimistic Ethereum - active repo is at https://github.com/bobanetwork/boba