Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Support GPU blocks #1780

Open
Tracked by #1782
peterschmidt85 opened this issue Oct 3, 2024 · 2 comments
Open
Tracked by #1782

[Feature] Support GPU blocks #1780

peterschmidt85 opened this issue Oct 3, 2024 · 2 comments
Assignees

Comments

@peterschmidt85
Copy link
Contributor

peterschmidt85 commented Oct 3, 2024

Motivation and implementation notes: https://docs.google.com/document/d/1-0cInSwV3CkybQUi4w4KtsrkFw5grvYdKxGo0gqPh2w/edit?usp=sharing

Copy link

github-actions bot commented Nov 3, 2024

This issue is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale label Nov 3, 2024
Copy link

This issue was closed because it has been inactive for 14 days since being marked as stale. Please reopen the issue if it is still relevant.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Nov 17, 2024
@peterschmidt85 peterschmidt85 changed the title [Feature] Run multiple jobs concurrently on one instance with a fraction of resources GPU/RAM/disk [Feature] Support GPU blocks Nov 17, 2024
@peterschmidt85 peterschmidt85 pinned this issue Dec 2, 2024
un-def added a commit that referenced this issue Dec 10, 2024
The shim is now able to process multiple tasks at a time.
The global state for the existing "legacy" API is emulated.
From the API caller's point of view, the shim works exactly
the same as before.

Part-of: #1780
un-def added a commit that referenced this issue Dec 10, 2024
The shim is now able to process multiple tasks at a time.
The global state for the existing "legacy" API is emulated.
From the API caller's point of view, the shim works exactly
the same as before.

Part-of: #1780
un-def added a commit that referenced this issue Dec 12, 2024
The shim is now able to process multiple tasks at a time.
The global state for the existing "legacy" API is emulated.
From the API caller's point of view, the shim works exactly
the same as before.

Part-of: #1780
un-def added a commit that referenced this issue Dec 13, 2024
* Collect GPU resource identifiers: UUIDs for NVIDIA, DRI render nodes
  for AMD
* Request GPUs by identifiers when creating a container
* Keep track of busy/idle GPUs
* Move host-related code to a separate package
* Don't `panic()`

Part-of: #1780
@un-def un-def added the major label Dec 13, 2024
un-def added a commit that referenced this issue Dec 18, 2024
* Collect GPU resource identifiers: UUIDs for NVIDIA, DRI render nodes
  for AMD
* Request GPUs by identifiers when creating a container
* Keep track of busy/idle GPUs
* Move host-related code to a separate package
* Don't `panic()`

Part-of: #1780
un-def added a commit that referenced this issue Dec 25, 2024
un-def added a commit that referenced this issue Dec 25, 2024
un-def added a commit that referenced this issue Dec 29, 2024
Let's call it v2

Part-of: #1780
un-def added a commit that referenced this issue Dec 30, 2024
From the caller's point of view, ShimClient interface stays the same,
but internally it's now able to work with both API v1 and v2, emulating
API v1 logic via API v2 methods if v2 is supported by shim


Part-of: #1780
Co-authored-by: Victor Skvortsov <[email protected]>
un-def added a commit that referenced this issue Dec 30, 2024
un-def added a commit that referenced this issue Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants