feat(initialize): default to first GPU when gpu_id not provided #125

btruhand · 2024-05-31T20:24:17Z

Issue #, if available:

I was trying to deploy Huggingface Transformers on Sagemaker with multi-modal-server (MMS) preload_model = true (about preloading). Unfortunately I hit a snag and the server was unable to preload the model due to missing GPU ID

Checking the MMS code here, here, and here we can see that no GPU ID is provided on model preload. Worse, the service will be constructed with no GPU ID and thus on subsequent attempts to initialize on prediction in the handler, the same exception will again be raised

Considering that the existing call already uses .get instead of indexing operator, arguably there was already awareness that gpu_id may be missing, but it was not properly handled. Or it was thought that in subsequent initialization attempts the problem will be fixed

Description of changes:

Provide a default GPU ID of 0, if no gpu_id is provided, indicating downstream code to use the first GPU. I feel like this solution is quite sensible considering that we already check whether GPU is available or not and thus, we should be safe to assume that there is at least 1 GPU with GPU ID 0. Though I'm not entirely well-versed in GPU ID schemes so maybe 0 isn't a universally applicable ID to use

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

btruhand · 2024-05-31T20:25:25Z

tests/unit/test_handler_service_with_context.py

+@require_torch
+@pytest.mark.skipif(not _is_gpu_available(), reason="No GPU available")
+@slow
+def test_initialize_without_gpu_id_fallback_to_first_gpu(inference_handler):


I never got to test this myself since I don't have GPU access currently. Also not sure if we want to use the skipif on availability of GPU. I didn't see any such marks in the test file. But i think it makes sense to have it

kurtgdl · 2024-12-31T11:48:18Z

I'm also being faced with this error with the release 2.3.0.

feat(initialize): default to first GPU when gpu_id not provided

297bce1

btruhand commented May 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(initialize): default to first GPU when gpu_id not provided #125

feat(initialize): default to first GPU when gpu_id not provided #125

btruhand commented May 31, 2024 •

edited

Loading

btruhand May 31, 2024

kurtgdl commented Dec 31, 2024

feat(initialize): default to first GPU when gpu_id not provided #125

Are you sure you want to change the base?

feat(initialize): default to first GPU when gpu_id not provided #125

Conversation

btruhand commented May 31, 2024 • edited Loading

btruhand May 31, 2024

Choose a reason for hiding this comment

kurtgdl commented Dec 31, 2024

btruhand commented May 31, 2024 •

edited

Loading