Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does this work with vGPU? #14

Open
bsteinb opened this issue Dec 7, 2022 · 5 comments
Open

Does this work with vGPU? #14

bsteinb opened this issue Dec 7, 2022 · 5 comments

Comments

@bsteinb
Copy link

bsteinb commented Dec 7, 2022

We would like to use this project to configure MIG backed vGPUs on A100s in an OpenStack cloud. We are using vGPU host driver version 525.60.12. As I understand it, I have to enable SR-IOV on the A100s in order to expose the vGPUs to OpenStack / KVM using the /usr/lib/nvidia/sriov-manage included with the vGPU driver. However, once I do that, mig-parted starts acting up:

# nvidia-mig-parted export
FATA[0000] Error checking MIG capable: error getting device handle: Invalid Argument 

Is this scenario not supported or am I doing something wrong?

@cdesiniotis
Copy link
Contributor

This use case is not officially supported as we do not test it. Can you provide the debug output so we can get more information:

nvidia-mig-parted -d export

@bsteinb
Copy link
Author

bsteinb commented Dec 7, 2022

I'm afraid that does not provide any additional info:

# nvidia-mig-parted -d export
FATA[0000] Error checking MIG capable: error getting device handle: Invalid Argument 

@cdesiniotis
Copy link
Contributor

Okay. I have tried using mig-parted in the past for this use case and recall needing this change in one of mig-parted's dependencies to make it work: https://gitlab.com/nvidia/cloud-native/go-nvlib/-/merge_requests/13. mig-parted is still using an older version of this dependency without this change, so it is not expected to work today. I am tracking this as a feature request. cc @klueska

@bsteinb
Copy link
Author

bsteinb commented Dec 7, 2022

Thanks for the pointer. I can confirm that nvidia-mig-parted export works after bumping go-nvlib to the latest commit. I will try applying a configuration tomorrow.

@sdake
Copy link

sdake commented Mar 27, 2023

@cdesiniotis Hi. I took a look at the PR https://gitlab.com/nvidia/cloud-native/go-nvlib/-/merge_requests/1 where you authored the PR.

In my case, I want to bind a virtual function to vfio-pci and pass through the virtual function. I don't want to use vGPU. It is unclear if this is something that would ever function. Would you clarify if you could?

Thank you,
-steve

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants