Does this work with vGPU? #14

bsteinb · 2022-12-07T17:05:08Z

We would like to use this project to configure MIG backed vGPUs on A100s in an OpenStack cloud. We are using vGPU host driver version 525.60.12. As I understand it, I have to enable SR-IOV on the A100s in order to expose the vGPUs to OpenStack / KVM using the /usr/lib/nvidia/sriov-manage included with the vGPU driver. However, once I do that, mig-parted starts acting up:

# nvidia-mig-parted export
FATA[0000] Error checking MIG capable: error getting device handle: Invalid Argument

Is this scenario not supported or am I doing something wrong?

The text was updated successfully, but these errors were encountered:

cdesiniotis · 2022-12-07T19:58:29Z

This use case is not officially supported as we do not test it. Can you provide the debug output so we can get more information:

nvidia-mig-parted -d export

bsteinb · 2022-12-07T20:39:44Z

I'm afraid that does not provide any additional info:

# nvidia-mig-parted -d export
FATA[0000] Error checking MIG capable: error getting device handle: Invalid Argument

cdesiniotis · 2022-12-07T21:02:44Z

Okay. I have tried using mig-parted in the past for this use case and recall needing this change in one of mig-parted's dependencies to make it work: https://gitlab.com/nvidia/cloud-native/go-nvlib/-/merge_requests/13. mig-parted is still using an older version of this dependency without this change, so it is not expected to work today. I am tracking this as a feature request. cc @klueska

bsteinb · 2022-12-07T21:40:39Z

Thanks for the pointer. I can confirm that nvidia-mig-parted export works after bumping go-nvlib to the latest commit. I will try applying a configuration tomorrow.

sdake · 2023-03-27T09:52:44Z

@cdesiniotis Hi. I took a look at the PR https://gitlab.com/nvidia/cloud-native/go-nvlib/-/merge_requests/1 where you authored the PR.

In my case, I want to bind a virtual function to vfio-pci and pass through the virtual function. I don't want to use vGPU. It is unclear if this is something that would ever function. Would you clarify if you could?

Thank you,
-steve

cdesiniotis added the enhancement label Dec 7, 2022

ArangoGutierrez removed the enhancement label Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this work with vGPU? #14

Does this work with vGPU? #14

bsteinb commented Dec 7, 2022

cdesiniotis commented Dec 7, 2022

bsteinb commented Dec 7, 2022

cdesiniotis commented Dec 7, 2022

bsteinb commented Dec 7, 2022

sdake commented Mar 27, 2023

Does this work with vGPU? #14

Does this work with vGPU? #14

Comments

bsteinb commented Dec 7, 2022

cdesiniotis commented Dec 7, 2022

bsteinb commented Dec 7, 2022

cdesiniotis commented Dec 7, 2022

bsteinb commented Dec 7, 2022

sdake commented Mar 27, 2023