-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Various fixes and optimizations to the k3s deployment (#140)
Fixes a bunch of issues that we've seen doing installs on different kinds of systems: 1. Include poetry.lock to keep versioning consistent 2. Install curl and jq, if necessary 3. Use the helmchart CRD to install the Nvidia operator so we don't have to install and configure helm 4. Include `nvidia-utils` in the Nvidia driver install path since it's required for the operator to initialize 5. Be more patient waiting for the GPU capacity to be recognized by the node.
- Loading branch information
1 parent
8265846
commit befdcbb
Showing
5 changed files
with
2,989 additions
and
28 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
apiVersion: v1 | ||
kind: Namespace | ||
metadata: | ||
name: gpu-operator | ||
--- | ||
apiVersion: helm.cattle.io/v1 | ||
kind: HelmChart | ||
metadata: | ||
name: nvidia-gpu-operator | ||
namespace: kube-system | ||
annotations: | ||
helm.cattle.io/helm-controller: "true" | ||
spec: | ||
repo: https://nvidia.github.io/gpu-operator | ||
chart: gpu-operator | ||
targetNamespace: gpu-operator | ||
bootstrap: true # Add this to ensure it's processed during bootstrap | ||
# https://github.com/NVIDIA/gpu-operator/blob/main/deployments/gpu-operator/values.yaml | ||
valuesContent: |- | ||
driver: | ||
enabled: false # Disable NVIDIA driver installation since we have it pre-installed | ||
toolkit: | ||
enabled: false # Disable NVIDIA Container Toolkit installation since we have it pre-installed |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.