Our great sponsors
-
flatcar-forklift
SystemD service to deploy always up-to-date kernel modules for Flatcar Container Linux
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
That leaves Flatcar and Clear Linux - both of which happen to at least have documentation for installing/running Nvidia drivers and CUDA. Flatcar has this repository from Nvidia, and I've also found this project called forklift which will supposedly handle auto-updating the kernel modules for you. The Clear Linux docs also seem to include a method to auto-rebuild the modules with kernel upgrades, though it does say that the driver version needs to be updated manually, which honestly almost sounds preferable considering how finicky Nvidia drivers can be on Linux. Clear Linux also has several other tutorials/guides that appear to try and market it for things like machine learning, which leads me to believe that Nvidia gpus would hopefully work decently on it.
That leaves Flatcar and Clear Linux - both of which happen to at least have documentation for installing/running Nvidia drivers and CUDA. Flatcar has this repository from Nvidia, and I've also found this project called forklift which will supposedly handle auto-updating the kernel modules for you. The Clear Linux docs also seem to include a method to auto-rebuild the modules with kernel upgrades, though it does say that the driver version needs to be updated manually, which honestly almost sounds preferable considering how finicky Nvidia drivers can be on Linux. Clear Linux also has several other tutorials/guides that appear to try and market it for things like machine learning, which leads me to believe that Nvidia gpus would hopefully work decently on it.
That leaves Flatcar and Clear Linux - both of which happen to at least have documentation for installing/running Nvidia drivers and CUDA. Flatcar has this repository from Nvidia, and I've also found this project called forklift which will supposedly handle auto-updating the kernel modules for you. The Clear Linux docs also seem to include a method to auto-rebuild the modules with kernel upgrades, though it does say that the driver version needs to be updated manually, which honestly almost sounds preferable considering how finicky Nvidia drivers can be on Linux. Clear Linux also has several other tutorials/guides that appear to try and market it for things like machine learning, which leads me to believe that Nvidia gpus would hopefully work decently on it.
I don't know about Flatcar but Fedora CoreOS provides good instructions for building a custom image https://github.com/coreos/coreos-assembler/blob/main/docs/building-fcos.md.
The container based driver installation method looks good too. GKE does something similar https://github.com/GoogleCloudPlatform/container-engine-accelerators.