NVIDIA gpu support #165

ianm-nv · 2024-09-30T17:17:04Z

This PR enables gpu passthrough with virtme-ng. It adds a new paramemeter "--nvgpu' which is used to provide the PCI address of the gpu. There are included instructions in the README for enabling VFIO passthrough support.

Signed-off-by: Ian May <[email protected]>

Signed-off-by: Andrea Righi <[email protected]>

Signed-off-by: Ian May <[email protected]>

arighi

Looks great overall, thanks for working on this!

I just left a small comment about running vng as regular user, but we can definitely merge it and address this later in-tree.

arighi · 2024-09-30T17:44:43Z

README.md

+   # Load VFIO module
+   $ sudo modprobe vfio-pci
+   # Pass PCI address to virtme-ng
+   $ sudo vng --nvgpu "01:00.0" -r linux


Do we really need sudo permission here to run vng? That might be the case because we need to access pci directly I guess... do you get an permission error if you try to run vng as regular user?

SPYFF · 2024-10-03T11:13:38Z

While this is a good addition, I wonder if it can be generalised. I used to pass-through the network interface with VFIO and the steps were almost identical. I'm afraid there are driver and device specific quirks, but the basic steps were the same as in this case:

Find the PCIe address and device ID of the device
2a. Unbind it from its driver (on a running system)
2b. Blacklist the driver (if runtime unbind failed)
Load the vfio-pci driver
Map the device to the vfio-pci driver
May or may not be visible and usable in QEMU

Sure, success is highly device and host dependent (e.g. the same i225 NIC worked in the VM on an Intel host, but not on an AMD, etc.), but a generic PCIe passthrough feature might be worth considering.

arighi · 2024-10-03T11:24:06Z

Absolutely, I totally agree on generalizing the concept of PCI passthrough and I think we can improve this in-tree. At this stage even if we break the API (command options) is not so critical as long as we don't break any previous behavior or option (maybe we should have a way to mark new options as "unstable"?). But I think it's good to have "something" merged, a lot of people have been asking me this feature for a while, so at least now people can start using it and improving it. 🙂

That said, I'll think about a proper interface for a generic PCI passthrough and maybe we can discuss it here or better create a separate issue.

Thanks!

ianm-nv and others added 3 commits September 30, 2024 12:00

vng: add support for passthrough NVIDIA GPUs

4f85de3

Signed-off-by: Ian May <[email protected]>

architectures: GPU passthrough

3949fde

Signed-off-by: Andrea Righi <[email protected]>

doc: update README.md with gpu passthrough setup

c7fea6c

Signed-off-by: Ian May <[email protected]>

arighi approved these changes Sep 30, 2024

View reviewed changes

arighi merged commit bcf6589 into arighi:main Sep 30, 2024
4 checks passed

arighi mentioned this pull request Oct 4, 2024

Generic PCI passthrough support #168

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA gpu support #165

NVIDIA gpu support #165

ianm-nv commented Sep 30, 2024

arighi left a comment

arighi Sep 30, 2024

SPYFF commented Oct 3, 2024

arighi commented Oct 3, 2024

NVIDIA gpu support #165

NVIDIA gpu support #165

Conversation

ianm-nv commented Sep 30, 2024

arighi left a comment

Choose a reason for hiding this comment

arighi Sep 30, 2024

Choose a reason for hiding this comment

SPYFF commented Oct 3, 2024

arighi commented Oct 3, 2024