You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NVIDIA time-slicing landed (see #2347) in Bottlerocket 1.25. While a step forward, this can't really be used all that securely in a multi-tenant EKS cluster (as mentioned in the documentation).
NVIDIA vGPU is another alternative, but it has fewer isolation guarantees than MIG. It's also not all that clear how it would be used in EKS, and I think might require a commercial license.
Today, in our multi-tenant clusters, the only solution seems to be that each pod receives its own GPU worker node, which is highly cost ineffective.
The text was updated successfully, but these errors were encountered:
Hi @jcmcken ! MIG is a prioritized feature on the Bottlerocket roadmap, and the team is actively working on its development. While we cannot commit to a specific delivery date, we aim to provide an update on the progress in the next two weeks. Please use this issue for the tracking purposes. Thank you for your interest.
What I'd like:
NVIDIA time-slicing landed (see #2347) in Bottlerocket 1.25. While a step forward, this can't really be used all that securely in a multi-tenant EKS cluster (as mentioned in the documentation).
NVIDIA MIG provides a more secure alternative.
Here are some references:
Any alternatives you've considered:
NVIDIA vGPU is another alternative, but it has fewer isolation guarantees than MIG. It's also not all that clear how it would be used in EKS, and I think might require a commercial license.
Today, in our multi-tenant clusters, the only solution seems to be that each pod receives its own GPU worker node, which is highly cost ineffective.
The text was updated successfully, but these errors were encountered: