Problem:
Starting a nvidia-docker container results in the error message
docker: Error response from daemon: OCI runtime create failed: container_linux.go:380: starting container process caused: process_linux.go:545: container init caused: Running hook #0:: error running hook: exit status 1, stdout: , stderr: nvidia-container-cli: container error: cgroup subsystem devices not found: unknown.
The root of the problem seems to be that current kernel uses cgroups v2 which libnvidia-container does not currently support.
See discussion in an issue posted in the nvidia-docker github repo.
To make nvidia-docker working I followed the instruction given by user "Zethson", commented on 10 Apr in the link given above:
Edit the file
/usr/share/nvidia-container-runtime/config.toml
and change the line
# no-cgroups = false
to
no-cgroups = true
After
systemctl restart docker
everything works fine. My proposal would be to make this change in post build step when package is build and installed.