Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error about driver version compatibility #7798

Open
GLW1215 opened this issue Nov 15, 2024 · 2 comments
Open

Error about driver version compatibility #7798

GLW1215 opened this issue Nov 15, 2024 · 2 comments
Assignees
Labels
module: platforms Issues related to platforms, hardware, and support matrix question Further information is requested

Comments

@GLW1215
Copy link

GLW1215 commented Nov 15, 2024

Description
When I tried to using Triton server version 2.51.0(Nvidia Release 24.10) on Orin Nano with Jetpack 6.1, an Error shows:
image

Triton Information
2.51.0(Nvidia Release 24.10)

Are you using the Triton container or did you build it yourself?
just pull the Triton container

To Reproduce

  1. pull the triton container 24.10 on Jetpack 6.1
  2. run the triton server with docker

Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).

Expected behavior
the Triton server can run, but is there any risk regarding the Error?

@rmccorm4
Copy link
Collaborator

Hi @GLW1215, if your driver is incompatible with the build, it's possible that you'll encounter GPU/CUDA related errors at server startup when trying to load a GPU-based model. This should be a quick thing to verify by trying to load a GPU model with Triton.

CC @nv-kmcgill53 for viz

@rmccorm4 rmccorm4 added question Further information is requested module: platforms Issues related to platforms, hardware, and support matrix labels Nov 15, 2024
@rmccorm4 rmccorm4 self-assigned this Nov 15, 2024
@nv-kmcgill53
Copy link
Contributor

The issue you are facing is a cuda forward compatibility issue. The host machine has the R540 driver installed but the container has the R560 driver installed which seems incompatible from the screenshot you have posted. The easiest fix is to upgrade the host device to the latest R560 driver and then the container and the host have the same installation. If this is not possible then hopefully you can install the cuda compat package on the host machine to allow for driver compatibility. Let me know if this advice doesn't work or isn't applicable to your use case in some way and I can look further into it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: platforms Issues related to platforms, hardware, and support matrix question Further information is requested
Development

No branches or pull requests

3 participants