-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Small BAR Size Support? #13
Comments
same question here =) |
What motherboard is AMD? I just updated bios on mine and it got Re-Size Bar support |
same question here, here is our configuration dual Epyc 7742, supermicro H12DSG system here is the lspci information 01:00.0 VGA compatible controller: NVIDIA Corporation GA102 [GeForce RTX 3090] (rev a1) (prog-if 00 [VGA controller]) here is the error information from simpeP2P test |
NVIDIA Open GPU Kernel Modules Version
550.90.07
Operating System and Version
Ubuntu 22.04
Kernel Release
6.8.9
Hardware: GPU
4090
Describe the bug
We have tested the modified kernel on two systems: 1x intel desktop (with full BAR=32GB), 1x amd server (without resizable_bar, BAR=256MB,512MB)
On the intel with full 32GB BAR size for the 2x4090, NCCL/P2P test is passing with modded driver.
However on the amd server platform where bios doesn't support resizable_bar, nvidia-smi is only showing 256MB and 512MB bar sizes for the 2x4090. On this amd server, even with the this modded nvidia driver, NCCL/P2P tests failed. The amd server also has lots of pcie devices so it may be running out of pcie map space to assign the large 32GB bars that 4090 support.
So my question is, is the current P2P+4090 code only working if BAR size >= full 4090 GPU vram size? Thank you!
The text was updated successfully, but these errors were encountered: