Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
It might not be the answer you are looking but I would take a look at components published by System76/Lambda labs such as this to pick the one that would suit me: https://github.com/system76/thelio/blob/master/Thelio%20Comm...
If you would like to put Kubernetes on top of this kind of setup this repo is helpful https://github.com/robrohan/skoupidia
The main benefit for me using it for my ML work loads is you can shutoff nodes entirely when you are not using them, then when you turn them back on they just rejoin the cluster.
It also helps managing different types of devices and workload (tpu vs gpu vs cpu)
Just sharing.
2 x RTX4090 workstation guide
https://github.com/eul94458/Memo/blob/main/dual_rtx4090works...