Requesting zero GPUs allocates all GPUs #61

dhague · 2018-07-10T11:08:21Z

The README.md states:

WARNING: if you don't request GPUs when using the device plugin with NVIDIA images all the GPUs on the machine will be exposed inside your container.

I discovered a workaround for this, which is to set the environment variable NVIDIA_VISIBLE_DEVICES to none in the container spec.

With a resource request for nvidia.com/gpu: 0 this environment variable should be set automatically.

The text was updated successfully, but these errors were encountered:

everpeace · 2018-08-17T05:19:18Z

With a resource request for nvidia.com/gpu: 0 this environment variable should be set automatically.

currently, device plugin doesn't have the ability to inject env vars to pods.

However, You can implement this feature with Admission Webhook. Just writing small web server to mutate the env var to user definitions. I think it's not so difficult. (Actually, I did the same thing in our cluster.)

yukuo78 · 2018-10-13T05:23:42Z

Does it mean if I have two containers both requesting for nvidia.com/gpu: 0, then they could share the GPU?

thomasjungblut · 2018-10-13T13:06:24Z

@yukuo78 basically yes, this is equal to the node-selector trick to share GPUs, as described in:
kubernetes/kubernetes#52757 (comment)

go check the follow-ups on the thread for more information.

Davidrjx · 2019-05-17T03:22:11Z

@dhague it is prerequisite for both nvidia.com/gpu: 0 and NVIDIA_VISIBLE_DEVICES env var should be set together, isn't it?
recently， if only nvidia.com/gpu: 0 is set , related pod scheduled on GPU node probably crash resulting from "OutOfnvidia.com/gpu", its status specially looks like

status:
  message: 'Pod Node didn''t have enough resource: nvidia.com/gpu, requested: 0, used:
    1, capacity: 0'
  phase: Failed
  reason: OutOfnvidia.com/gpu
  startTime: "2019-05-09T03:05:49Z"

aebischers · 2019-08-14T14:48:10Z

@everpeace could you share your custom Admission Webhook code, especially the part that mutates NVIDIA_VISIBLE_DEVICES ?

Cas-pian · 2019-11-02T03:47:38Z

I 've tested. If we add NVIDIA_VISIBLE_DEVICES=none to pod.spec.containers[*].env, then a pod which want to use 1 GPU by k8s's container resource requests, the envionment list when nvidia-container-runtime was executed will be(the order is important):

NVIDIA_VISIBLE_DEVICES=GPU-xxx-xxx-xxx-xxx-xxx
NVIDIA_VISIBLE_DEVICES=none

And The nvidia-container-runtime may take the last one to decide which devices to mount, then will result in no devices avaiable in container which is not expected. This action is up to the version of nvidia-container-runtime-hook(renamed to nvidia-container-toolkit recently) you use, please referer to this)

Davidrjx · 2019-11-05T13:07:22Z

@Cas-pian what you meant was two pods setup respectively with NVIDIA_VISIBLE_DEVICES=none and nvidia.com/gpu:1 on same node, wasn't that?

Davidrjx · 2019-11-05T13:08:07Z

@Cas-pian what you meant was two pods setup respectively with NVIDIA_VISIBLE_DEVICES=none and nvidia.com/gpu:1 on same node, wasn't that?

Cas-pian · 2019-11-06T09:49:41Z

@Davidrjx no, just I found a bug of nvidia-container-runtime-hook(nvidia-container-tolkit): multi NVIDIA_VISIBLE_DEVICES envs which is not handled, and will make GPUs not mounted as expected.

step 1: I use a cuda image has env NVIDIA_VISIBLE_DEVICES=all to start a pod (without setting resources.requests for GPU), then all GPUs will be mounted into the container, this will make k8s-device-plugin useless, and break the environment of pod who use resources.requests for GPU.

step 2: In order to fix the problem in step 1, I add NVIDIA_VISIBLE_DEVICES=none to pod.spec.containers[*].env to disable the default value of NVIDIA_VISIBLE_DEVICES in image, but what I saw is no GPU was mounted into the pod even I use resource.requests to use GPU, even if you use resources.requests!!

And finally I found it's not a good design to use the same logic(env NVIDIA_VISIBLE_DEVICES) for single node GPU allocation and cluster GPU allocation, because CUDA images are made for single node usage, it's better to use a diffenent logics(eg: different envs). @flx42

Davidrjx · 2019-11-06T10:03:35Z

@Cas-pian oh, now i understand what you mean.

ktarplee · 2020-05-12T16:32:19Z

I wrote a Kubernetes Mutating Admission Webhook called gpu-admission-webhook to handle this case. It sets NVIDIA_VISIBLE_DEVICES to "none" if you do not request a GPU. It also deletes environment variables that would cause issues or bypass this constraint.

XciD · 2021-03-25T13:42:16Z

After reading the documentation about NVIDIA_VISIBLE_DEVICES I advices you to set void instead of none.

From the doc:

nvidia-container-runtime will have the same behavior as runc (i.e. neither GPUs nor capabilities are exposed)

orkenstein · 2021-06-28T11:00:56Z

I've tried to set:

        resources:
          limits:
            nvidia.com/gpu: 0

My idea is to have multiple pods on the same node sharing single GPU. But it looks like in such case, the app in container does not utilise GPU at all. What am I missing?

ktarplee · 2021-06-28T11:19:42Z

This is no longer an issue if you have the following lines in your /etc/nvidia-container-runtime/config.toml

accept-nvidia-visible-devices-envvar-when-unprivileged = false
accept-nvidia-visible-devices-as-volume-mounts = true

And you deploy the nvidia-device-plugin with the values

compatWithCPUManager: true
deviceListStrategy: volume-mounts

orkenstein · 2021-06-28T11:34:03Z

@ktarplee thanks for the clue!
Talking about /etc/nvidia-container-runtime/config.toml. I have a container build on top of tensorflow/tensorflow:1.14.0-gpu-py3 but see no config.toml. Or where it should be edited?

klueska · 2021-06-28T11:39:30Z

Needs to be set on the host, not inside a container.

Here’s a link to the details: https://docs.google.com/document/d/1zy0key-EL6JH50MZgwg96RPYxxXXnVUdxLZwGiyqLd8/edit

elezar · 2021-06-28T11:41:13Z

@orkenstein the config file mentioned is installed on every host along with the NVIDIA Container Toolkit / NVIDIA Docker.

orkenstein · 2021-06-29T11:45:40Z

Thanks @klueska @elezar
I'm not sure how to do that on GCloud. Should I tweak nvidia-installer somehow?

elezar · 2021-06-29T12:03:45Z

@orkenstein does that mean that you're not using the NVIDIA Device Plugin to allow GPU usage on GCloud but using instead?

(could you provide a link to the nvidia-installer you mention).

orkenstein · 2021-06-29T13:59:18Z

@elezar drivers gets installed like this: https://cloud.google.com/kubernetes-engine/docs/how-to/gpus#installing_drivers

elezar · 2021-06-29T14:34:42Z

@orkenstein GKE does not (currently) use the NVIDIA device plugin nor the NVIDIA container toolkit. Which means that the suggestion by @ktarplee is not applicable to you.

orkenstein · 2021-06-29T18:55:06Z

@orkenstein GKE does not (currently) use the NVIDIA device plugin nor the NVIDIA container toolkit. Which means that the suggestion by @ktarplee is not applicable to you.

Ah, okay. What should I do then?

elezar · 2021-06-30T08:42:26Z

This is unfortunately not something that I can help with. You could try post your request here https://github.com/GoogleCloudPlatform/container-engine-accelerators/issues (which contains the device plugin used on GKE systems).

sjdrc · 2021-11-23T23:16:41Z

This is no longer an issue if you have the following lines in your /etc/nvidia-container-runtime/config.toml
accept-nvidia-visible-devices-envvar-when-unprivileged = false
accept-nvidia-visible-devices-as-volume-mounts = true 
And you deploy the nvidia-device-plugin with the values
compatWithCPUManager: true
deviceListStrategy: volume-mounts

Thanks for this solution. However, I'm deploying https://github.com/NVIDIA/gpu-operator to my k3s cluster with a docker backend, using gpu-operator to install the container runtime. Is it possible to inject this configuration into the helm deployment?

shivamerla · 2021-11-24T00:47:07Z

@sjdrc Currently its not possible to set these parameters through gpu-operator helm deployment as the toolkit container doesn't support configuring these yet. We will look into adding this support. Meanwhile, these need to be added manually to /usr/local/nvidia/toolkit/.config/config.toml file, but device-plugin settings can be configured through --set devicePlugin.env[0].name=DEVICE_LIST_STRATEGY --set devicePlugin.env[0].value="volume-mounts" parameters during operator install. compatWithCPUManager setting is already default through gpu-operator deployment.

sjdrc · 2021-11-24T01:24:36Z

Thanks for your prompt reply.

So just to clarify, I should configure /usr/local/nvidia/toolkit/.config/config.toml on the host, and by setting volume-mounts, the device plugin will use the host configuration?

I do not have that file, but I do have /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml

sjdrc · 2021-12-03T01:58:48Z

Hey, I'm still having issues getting this working.

Should the config changes go into /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml - This file is present on my host, but not /usr/local/nvidia/toolkit/.config/config.toml
What section in the config file do these changes go? I have a top level section, [nvidia-container-cli], and [nvidia-container-runtime]
How can I make these persist? Every time I restart k3s the file content gets reverted.

sjdrc · 2021-12-03T05:52:49Z

Adding a bit more information about my setup process (from clean)

shivamerla · 2021-12-03T18:28:23Z

Hey, I'm still having issues getting this working.

Should the config changes go into /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml - This file is present on my host, but not /usr/local/nvidia/toolkit/.config/config.toml

Sorry, /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml is the right location of this file.

What section in the config file do these changes go? I have a top level section, [nvidia-container-cli], and [nvidia-container-runtime]

You need to add those lines as global params.

disable-require = false
accept-nvidia-visible-devices-envvar-when-unprivileged = false
accept-nvidia-visible-devices-as-volume-mounts = true 

[nvidia-container-cli]
  environment = []
  ldconfig = "@/run/nvidia/driver/sbin/ldconfig.real"
  load-kmods = true
  path = "/usr/local/nvidia/toolkit/nvidia-container-cli"
  root = "/run/nvidia/driver"

[nvidia-container-runtime]

How can I make these persist? Every time I restart k3s the file content gets reverted.

I think this was because they were not added as global params.

sjdrc · 2021-12-06T22:30:10Z

I'm still running into issues.

Steps to reproduce

Install ubuntu server 20.04
Install docker

curl https://get.docker.com | sh \
  && sudo systemctl --now enable docker

Blacklist nouveau

cat <<EOF | sudo tee /etc/modprobe.d/blacklist-nvidia-nouveau.conf
blacklist nouveau
options nouveau modeset=0
EOF

Disable apparmor
sudo apt remove --assume-yes --purge apparmor
install k3s with --docker flag
helm install --version 1.9.0 --wait --generate-name -n gpu-operator --create-namespace nvidia/gpu-operator --set devicePlugin.env[0].name=DEVICE_LIST_STRATEGY --set devicePlugin.env[0].value="volume-mounts"
Add globally to /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml

accept-nvidia-visible-devices-envvar-when-unprivileged = false
accept-nvidia-visible-devices-as-volume-mounts = true

Reboot

Result

nvidia-device-plugin-validator is giving an error and refusing to start:

Error: failed to start container "plugin-validation": Error response from daemon: OCI runtime create failed: container_linux.go:380: starting container process caused: process_linux.go:545: container init caused: Running hook #0:: error running hook: exit status 1, stdout: , stderr: nvidia-container-cli.real: device error: /var/run/nvidia-container-devices: unknown device: unknown

klueska · 2021-12-09T11:07:58Z

So it looks like your k8s-device-plugin settings are working (i.e. the envvars set in your helm chart), but your toolkit configs are not. This is verified by the fact that the "device" the toolkit is seeing is an unknown device with the name /var/run/nvidia-container-devices (which is what the plugin will set NVIDIA_VISIBLE_DEVICES to if it is listing the devices as volume mounts instead of via this envvar).

Can you show your entire file for /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml?

sjdrc · 2022-01-28T04:44:04Z

Hi there,

This is the full file contents:

disable-require = false
accept-nvidia-visible-devices-envvar-when-unprivileged = false
accept-nvidia-visible-devices-as-volume-mounts = true

[nvidia-container-cli]
  environment = []
  ldconfig = "@/run/nvidia/driver/sbin/ldconfig.real"
  load-kmods = true
  path = "/usr/local/nvidia/toolkit/nvidia-container-cli"
  root = "/run/nvidia/driver"

[nvidia-container-runtime]

After editing, does anything need to be done to reload the file? Running systemctl restart k3s will reset the file, removing the two added lines.

sjdrc · 2022-01-28T05:57:14Z

So basically it seems that after a reboot I need to add those lines back and re-deploy gpu-operator helm chart and everything works fine. If I reboot without doing that I run into the above error.

elgalu · 2023-04-04T19:53:16Z

Hi all, so it's already possible to set volume-mounts for DEVICE_LIST_STRATEGY https://github.com/NVIDIA/gpu-operator/blob/f38dc96ac4e74b6f0926b5c497a87878265cf689/deployments/gpu-operator/values.yaml#L220-L221 since ~2 years via the gpu-operator or am I missing something?

ozen · 2023-11-22T14:13:33Z

@elgalu The issue with the gpu operator seems not the device plugin's DEVICE_LIST_STRATEGY env var, but the toolkit's config file at /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml. You can read the steps followed by @sjdrc above.

Apsu · 2024-10-09T04:21:40Z

I have found a solution to this very old issue, after a customer of the managed k8s product I lead complained about the same thing, and I read through these years of posts.

After digging into the code for a while to see how the config files are read and parsed, I saw these https://github.com/NVIDIA/nvidia-container-toolkit/blob/main/tools/container/toolkit/toolkit.go#L159-L170 which were added in this commit 2 years ago NVIDIA/nvidia-container-toolkit@90518e0

So I threw this into the gpu-operator helm values:

toolkit:
  env:
    - name: ACCEPT_NVIDIA_VISIBLE_DEVICES_ENVVAR_WHEN_UNPRIVILEGED
      value: "false"
    - name: ACCEPT_NVIDIA_VISIBLE_DEVICES_AS_VOLUME_MOUNTS
      value: "true"
devicePlugin:
  env:
    - name: DEVICE_LIST_STRATEGY
      value: volume-mounts

Everything came up green, and /usr/local/nvidia/toolkit/.config/nvidia-container-runtime/config.toml has the right flag values in it.

I tested pods with no gpu resources specified, with a limit of nvidia.com/gpu: 0, multiple with 1 GPU on the same node, and with 1 GPU and privileged: true in the securityContext.

Without a resource specified or with a limit of 0, the nvidia tooling and GPUs are indeed not mounted into the pod. nvidia-smi disappears and python -c 'import torch; num_of_gpus = torch.cuda.device_count(); print(num_of_gpus);' shows 0 as well. 1 GPU pods on the same node get different ones (nvidia-smi shows unique Bus IDs), and the 1 GPU privileged pod indeed can only see the 1 it was allocated -- also a unique Bus ID.

Seems like everything is working as expected so I wanted to share for anyone else who stumbles on this thread.

klueska · 2024-10-09T11:09:26Z

Here is the document explaining how those are to be used:
https://docs.google.com/document/d/1zy0key-EL6JH50MZgwg96RPYxxXXnVUdxLZwGiyqLd8/edit

We will be adding these instructions to our official docs soon (which is obviously long overdue).

yds05 mentioned this issue Nov 23, 2018

[Kubeflow Training Service] Explicitly set cuda_visible_devices env var microsoft/nni#388

Merged

balchua mentioned this issue Apr 12, 2021

GPU limits for pods canonical/microk8s#2160

Closed

juliusvonkohout mentioned this issue Oct 12, 2021

Jupyter Notebook doesn't launch when gpus is readonly kubeflow/kubeflow#6182

Closed

sjdrc mentioned this issue Nov 24, 2021

Containers with 0 or no GPU assignment should not be able to see GPUs NVIDIA/gpu-operator#284

Closed

5 tasks

klueska added the not-a-bug label Jan 25, 2024

klueska self-assigned this Jan 26, 2024

klueska closed this as completed Jan 26, 2024

Milstein mentioned this issue Aug 16, 2024

Requesting "None" GPU Accelerator on NERC OpenShift AI (RHOAI) allocates all 4 GPUs nerc-project/operations#685

Closed

1 task

tardieu mentioned this issue Oct 18, 2024

Document reliance on ConfigMaps and RBAC implications openshift/instaslice-operator#185

Open

Fruneng mentioned this issue Jan 8, 2025

Bug: 不设置volcano.sh/vgpu-number，则在容器内能够看到所有卡 Project-HAMi/volcano-vgpu-device-plugin#40

Open

astefanutti mentioned this issue Jan 29, 2025

KEP-2170: Add PyTorch DDP MNIST training example kubeflow/trainer#2387

Merged

1 task

Requesting zero GPUs allocates all GPUs #61

Requesting zero GPUs allocates all GPUs #61

Comments

dhague commented Jul 10, 2018 • edited Loading

everpeace commented Aug 17, 2018

yukuo78 commented Oct 13, 2018

thomasjungblut commented Oct 13, 2018

Davidrjx commented May 17, 2019

aebischers commented Aug 14, 2019

Cas-pian commented Nov 2, 2019

Davidrjx commented Nov 5, 2019

Davidrjx commented Nov 5, 2019

Cas-pian commented Nov 6, 2019

Davidrjx commented Nov 6, 2019

ktarplee commented May 12, 2020

XciD commented Mar 25, 2021

orkenstein commented Jun 28, 2021

ktarplee commented Jun 28, 2021

orkenstein commented Jun 28, 2021

klueska commented Jun 28, 2021

elezar commented Jun 28, 2021

orkenstein commented Jun 29, 2021

elezar commented Jun 29, 2021

orkenstein commented Jun 29, 2021

elezar commented Jun 29, 2021

orkenstein commented Jun 29, 2021

elezar commented Jun 30, 2021

sjdrc commented Nov 23, 2021

shivamerla commented Nov 24, 2021

sjdrc commented Nov 24, 2021 • edited Loading

sjdrc commented Dec 3, 2021

sjdrc commented Dec 3, 2021

shivamerla commented Dec 3, 2021

sjdrc commented Dec 6, 2021 • edited Loading

Steps to reproduce

Result

klueska commented Dec 9, 2021

sjdrc commented Jan 28, 2022

sjdrc commented Jan 28, 2022

elgalu commented Apr 4, 2023

ozen commented Nov 22, 2023

Apsu commented Oct 9, 2024

klueska commented Oct 9, 2024

dhague commented Jul 10, 2018 •

edited

Loading

sjdrc commented Nov 24, 2021 •

edited

Loading

sjdrc commented Dec 6, 2021 •

edited

Loading