Skip to content

The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.

License

Notifications You must be signed in to change notification settings

easystack/k8s-driver-manager

This branch is up to date with NVIDIA/k8s-driver-manager:master.

Folders and files

NameName
Last commit message
Last commit date
Oct 1, 2024
Aug 26, 2024
Oct 2, 2024
Jul 2, 2024
Jun 28, 2024
Oct 1, 2024
Jan 30, 2024
Aug 27, 2024
Oct 1, 2024
Jul 6, 2021
Jul 6, 2021
Feb 28, 2024
Jul 6, 2021
Feb 21, 2024
Jun 28, 2024
Jun 28, 2024
Oct 2, 2024

Repository files navigation

NVIDIA Driver Manager For Kubernetes

The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster. This component ensure that all pre-requisites are met before driver upgrades can be performed using NVIDIA GPU Driver. Following are the actions performed by this component when upgrade is required.

  1. Check for already installed kernel modules.
  2. Perform Drain on the node ignoring Daemonset pods.
  3. Evict GPU Operator components like Device-Plugin, GPU Feature Discovery, DCGM Exporter etc.
  4. Unload kernel-modules.
  5. Unmount Driver root filesystem mounted on the host previously under /run/nvidia/driver.
  6. Uncordon the node.

These steps allows new versions can be easily installed in the Kubernetes cluster.

About

The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 70.8%
  • Makefile 14.9%
  • Go 14.3%