forked from intel/ai-reference-models
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'kdkasrav/k8s-improvements' into 'develop'
k8s kustomize refactoring and improvements See merge request intelai/models!218
- Loading branch information
Showing
37 changed files
with
159 additions
and
302 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
# Kubernetes Mlops | ||
|
||
## Using NFS as storage | ||
|
||
Configuring the examples to use NFS as storage requires specifying these values in the mlops.env within the tree show below: | ||
|
||
``` | ||
examples | ||
└── common | ||
└── k8s | ||
└── mlops | ||
├── base | ||
│ └── mlops.env | ||
├── multi-node | ||
└── single-node | ||
``` | ||
|
||
The NFS related values within mlops.env are shown below: | ||
|
||
``` | ||
NFS_PATH=/exported_users | ||
NFS_MOUNT_PATH=/home | ||
NFS_SERVER=0.0.0.0 | ||
``` | ||
|
||
They should reflect values specific to your NFS implementation. NFS_PATH and NFS_SERVER are typically found in /etc/mtab | ||
and are NFS server values. NFS_MOUNT_PATH is a nfs client option indicating where the exported file system is mounted at. | ||
|
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
DATASET_DIR=/dataset | ||
NFS_PATH=/exported_users | ||
NFS_MOUNT_PATH=/home | ||
NFS_SERVER=0.0.0.0 | ||
OUTPUT_DIR=/workspace/output |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
apiVersion: kustomize.config.k8s.io/v1beta1 | ||
kind: Kustomization | ||
bases: | ||
- ../base | ||
resources: | ||
- mpi-job.yaml | ||
configurations: | ||
- mlops.yaml |
2 changes: 2 additions & 0 deletions
2
...ning/fp32/k8s/mlops/multi-node/mlops.yaml → ...es/common/k8s/mlops/multi-node/mlops.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
apiVersion: kustomize.config.k8s.io/v1beta1 | ||
kind: Kustomization | ||
bases: | ||
- ../base | ||
resources: | ||
- pod.yaml | ||
configurations: | ||
- mlops.yaml |
2 changes: 2 additions & 0 deletions
2
...ing/fp32/k8s/mlops/single-node/mlops.yaml → ...s/common/k8s/mlops/single-node/mlops.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
apiVersion: v1 | ||
kind: Pod | ||
metadata: | ||
name: $(MODEL_NAME) | ||
spec: | ||
serviceAccountName: model-service | ||
containers: | ||
- name: single-node | ||
imagePullPolicy: Always | ||
volumes: | ||
- name: users | ||
persistentVolumeClaim: | ||
claimName: users-pvc | ||
- name: datasets | ||
persistentVolumeClaim: | ||
claimName: datasets-pvc |
4 changes: 0 additions & 4 deletions
4
examples/image_recognition/tensorflow/resnet50v1_5/training/fp32/k8s/mlops/base/mlops.env
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 2 additions & 0 deletions
2
...es/image_recognition/tensorflow/resnet50v1_5/training/fp32/k8s/mlops/multi-node/mlops.env
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,6 @@ | ||
DATASET_DIR=/tf_dataset | ||
MODEL_NAME=resnet50v1_5_fp32_training | ||
MODEL_DIR=/workspace/resnet50v1_5_fp32_training | ||
NFS_MOUNT_PATH=/Users | ||
OUTPUT_DIR=/tmp/output | ||
REGISTRY=amr-registry.caas.intel.com/aipg-tf |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
56 changes: 0 additions & 56 deletions
56
...image_recognition/tensorflow/resnet50v1_5/training/fp32/k8s/mlops/multi-node/mpi-job.yaml
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 2 additions & 0 deletions
2
...s/image_recognition/tensorflow/resnet50v1_5/training/fp32/k8s/mlops/single-node/mlops.env
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,6 @@ | ||
DATASET_DIR=/tf_dataset | ||
MODEL_NAME=resnet50v1_5_fp32_training | ||
MODEL_DIR=/workspace/resnet50v1_5_fp32_training | ||
/NFS_MOUNT_PATH=/Users | ||
OUTPUT_DIR=/tmp/output | ||
REGISTRY=amr-registry.caas.intel.com/aipg-tf |
11 changes: 0 additions & 11 deletions
11
.../image_recognition/tensorflow/resnet50v1_5/training/fp32/k8s/mlops/single-node/mlops.yaml
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.