|4 days ago||7 days ago|
|Apache License 2.0||BSD 3-clause "New" or "Revised" License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
My Journey With Spark On Kubernetes... In Python (1/3)
4 projects | dev.to | 12 Apr 2021
For our experiments, we will use Volcano which is a batch scheduler for Kubernetes, well-suited for scheduling Spark applications pods with a better efficiency than the default kube-scheduler. The main reason is that Volcano allows "group scheduling" or "gang scheduling": while the default scheduler of Kubernetes schedules containers one by one, Volcano ensures that a gang of related containers (here, the Spark driver and its executors) can be scheduled at the same time. If for any reason it is not possible to deploy all the containers in a gang, Volcano will not schedule that gang. This article explains in more detail the reasons for using Volcano.
going to replace our old cluster, which way xCat or Warewulf?
1 project | reddit.com/r/HPC | 11 Apr 2021
What are some alternatives?
singularity - Singularity: Application containers for Linux
spark-on-k8s-operator - Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
kube-batch - A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
charts - ⚠️(OBSOLETE) Curated applications for Kubernetes
liqo - Building your endless Kubernetes ocean
gokey - A simple vaultless password manager in Go
helm - The Kubernetes Package Manager [Moved to: https://github.com/helm/helm]