This upgrade is needed in support of A100 GPU, kernel
upgrade and bug 1948050. It eliminates the requirement
to create nvidia specific runtimeclass prior to installing
the charts by pre-installing the toolkit through toolkit-
installer subchart.
This commit has been tested with the following:
driver: 470.57.02
toolkit: 1.7.1-ubi8
defaultRuntime: containerd
Test Plan:
PASS: Verify gpu-operator starts and adds nvidia.com/gpu
to the node.
PASS: Verify nvidia-toolkit is removed with helm override
of global.toolkit_force_clean=true.
PASS: Verify pods can access gpu device and nvidia tools
to monitor the GPU.
PASS: Verify pod can build and execute cuda sample code.
PASS: Verify driver pod prints out warning when building
on Low Latency kernel with helm override of:
--set driver.env[0].name=IGNORE_PREEMPT_RT_PRESENCE
Closes-Bug: 1948050
Signed-off-by: Babak Sarashki <babak.sarashki@windriver.com>
Change-Id: I18dd2a0ab1adc6f9364314a22373aadc93cad27f