SeldonIO · abhimanyu003 · Jan 22, 2025 · Jan 22, 2025 · Jan 22, 2025 · Jan 22, 2025
@@ -161,6 +161,34 @@ spec:
 
 ```
 
+{% tabs %}
+{% tab title="kubectl" %}
+```bash
+kubectl apply -f ../../models/cifar10.yaml -n ${NAMESPACE}
+kubectl apply -f ../../models/cifar10-outlier-detect.yaml -n ${NAMESPACE}
+kubectl apply -f ../../models/cifar10-drift-detect.yaml -n ${NAMESPACE}
+```
+
+```
+model.mlops.seldon.io/cifar10 created
+model.mlops.seldon.io/cifar10-outlier created
+model.mlops.seldon.io/cifar10-drift created
+```
+
+```bash
+kubectl wait --for condition=ready --timeout=300s model cifar10 -n ${NAMESPACE}
+kubectl wait --for condition=ready --timeout=300s model cifar10-outlier -n ${NAMESPACE}
+kubectl wait --for condition=ready --timeout=300s model cifar10-drift -n ${NAMESPACE}
+```
+
+```
+model.mlops.seldon.io/cifar10 condition met
+model.mlops.seldon.io/cifar10-outlier met
+model.mlops.seldon.io/cifar10-drift condition met
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
 ```bash
 seldon model load -f ../../models/cifar10.yaml
 seldon model load -f ../../models/cifar10-outlier-detect.yaml
@@ -186,6 +214,10 @@ seldon model status cifar10-drift -w ModelAvailable | jq .
 {}
 
 ```
+{% endtab %}
+{% endtabs %}
+
+
 
 ```bash
 cat ../../pipelines/cifar10.yaml
@@ -210,6 +242,23 @@ spec:
 
 ```
 
+{% tabs %}
+{% tab title="kubectl" %}
+
+```bash
+kubectl apply -f ../../pipelines/cifar10.yaml  -n ${NAMESPACE}
+```
+
+```bash
+kubectl wait --for condition=ready --timeout=300s pipelines cifar10-production -n ${NAMESPACE}
+```
+
+```
+pipeline.mlops.seldon.io/cifar10-production condition met
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
 ```bash
 seldon pipeline load -f ../../pipelines/cifar10.yaml
 ```
@@ -261,6 +310,9 @@ seldon pipeline status cifar10-production -w PipelineReady | jq -M .
 }
 
 ```
+{% endtab %}
+{% endtabs %}
+
 
 ```python
 infer("cifar10-production.pipeline",20, "normal")
@@ -457,6 +509,20 @@ seldon.default.model.cifar10.outputs	cifeiq8fh5ss738i5bqg	{"modelName":"cifar10_
 
 ```
 
+
+{% tabs %}
+{% tab title="kubectl" %}
+```bash
+kubectl delete -f ../../piplines/cifar10.yaml -n ${NAMESPACE}
+kubectl delete -f ../../models/cifar10.yaml -n ${NAMESPACE}
+kubectl delete -f ../../models/cifar10-outlier-detect.yaml -n ${NAMESPACE}
+kubectl delete -f ../../models/cifar10-drift-detect.yaml -n ${NAMESPACE}
+```
+
+{% endtab %}
+
+{% tab title="seldon-cli" %}
+
 ```bash
 seldon pipeline unload cifar10-production
 ```
@@ -466,3 +532,5 @@ seldon model unload cifar10
 seldon model unload cifar10-outlier
 seldon model unload cifar10-drift
 ```
+{% endtab %}
+{% endtabs %}
@@ -20,29 +20,97 @@ spec:
 
 Load the model
 
+{% tabs %}
+
+{% tab title="kubectl" %}
 ```bash
-seldon model load -f ./models/hf-text-gen.yaml
+kubectl apply -f ./models/hf-text-gen.yaml
+```
+```bash
+model.mlops.seldon.io/text-gen created
 ```
+{% endtab %}
 
+{% tab title="seldon-cli" %}
+```bash
+seldon model load -f ./models/hf-text-gen.yaml
+```
 ```json
 {}
+```
+{% endtab %}
+
+{% endtabs %}
+
+
+Wait for the model to be ready
 
+{% tabs %}
+
+{% tab title="kubectl" %}
+```bash
+kubectl get model text-gen -n ${NAMESPACE} -o json | jq -r '.status.conditions[] | select(.message == "ModelAvailable") | .status'
 ```
+```bash
+True
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
 
 ```bash
 seldon model status text-gen -w ModelAvailable | jq -M .
 ```
-
 ```json
 {}
+```
+{% endtab %}
+
+{% endtabs %}
+
+
+Do a REST inference call
+
+{% tabs %}
+
+{% tab title="kubectl" %}
+```bash
+curl --location 'http://${MESH_IP}:9000/v2/models/text-gen/infer' \
+	--header 'Content-Type: application/json'  \
+    --data '{"inputs": [{"name": "args","shape": [1],"datatype": "BYTES","data": ["Once upon a time in a galaxy far away"]}]}'
+```
+```json
+{
+	"model_name": "text-gen_1",
+	"model_version": "1",
+	"id": "121ff5f4-1d4a-46d0-9a5e-4cd3b11040df",
+	"parameters": {},
+	"outputs": [
+		{
+			"name": "output",
+			"shape": [
+				1,
+				1
+			],
+			"datatype": "BYTES",
+			"parameters": {
+				"content_type": "hg_jsonlist"
+			},
+			"data": [
+				"{\"generated_text\": \"Once upon a time in a galaxy far away, the planet is full of strange little creatures. A very strange combination of creatures in that universe, that is. A strange combination of creatures in that universe, that is. A kind of creature that is\"}"
+			]
+		}
+	]
+}
 
 ```
+{% endtab %}
 
+{% tab title="seldon-cli" %}
 ```bash
 seldon model infer text-gen \
   '{"inputs": [{"name": "args","shape": [1],"datatype": "BYTES","data": ["Once upon a time in a galaxy far away"]}]}'
 ```
-
 ```json
 {
 	"model_name": "text-gen_1",
@@ -68,6 +136,10 @@ seldon model infer text-gen \
 }
 
 ```
+{% endtab %}
+
+{% endtabs %}
+
 
 ```python
 res = !seldon model infer text-gen --inference-mode grpc \
@@ -88,9 +160,22 @@ b'{"generated_text": "Once upon a time in a galaxy far away\\n\\nThe Universe is
 
 Unload the model
 
+{% tabs %}
+
+{% tab title="kubectl" %}
+```bash
+kubectl delete model text-gen
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
 ```bash
 seldon model unload text-gen
 ```
+{% endtab %}
+
+{% endtabs %}
+
 
 ### Custom Text Generation Model
 
@@ -112,6 +197,61 @@ spec:
 
 Load the model
 
+
+{% tabs %}
+
+{% tab title="kubectl" %}
+
+```bash
+kubectl apply -f ./models/hf-text-gen-custom-tiny-stories.yaml
+```
+
+```bash
+model.mlops.seldon.io/custom-tiny-stories-text-gen created
+```
+
+```bash
+kubectl get model custom-tiny-stories-text-gen -n ${NAMESPACE} -o json | jq -r '.status.conditions[] | select(.message == "ModelAvailable") | .status'
+```
+```bash
+True
+```
+
+```bash
+curl --location 'http://${MESH_IP}:9000/v2/models/custom-tiny-stories-text-gen/infer' \
+	--header 'Content-Type: application/json'  \
+    --data '{"inputs": [{"name": "args","shape": [1],"datatype": "BYTES","data": ["Once upon a time in a galaxy far away"]}]}'
+```
+
+```json
+{
+	"model_name": "custom-tiny-stories-text-gen_1",
+	"model_version": "1",
+	"id": "d0fce59c-76e2-4f81-9711-1c93d08bcbf9",
+	"parameters": {},
+	"outputs": [
+		{
+			"name": "output",
+			"shape": [
+				1,
+				1
+			],
+			"datatype": "BYTES",
+			"parameters": {
+				"content_type": "hg_jsonlist"
+			},
+			"data": [
+				"{\"generated_text\": \"Once upon a time in a galaxy far away. It was a very special place to live.\\n\"}"
+			]
+		}
+	]
+}
+
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
+
 ```bash
 seldon model load -f ./models/hf-text-gen-custom-tiny-stories.yaml
 ```
@@ -160,6 +300,10 @@ seldon model infer custom-tiny-stories-text-gen \
 }
 
 ```
+{% endtab %}
+
+{% endtabs %}
+
 
 ```python
 res = !seldon model infer custom-tiny-stories-text-gen --inference-mode grpc \
@@ -180,9 +324,21 @@ b'{"generated_text": "Once upon a time in a galaxy far away\\nOne night, a littl
 
 Unload the model
 
+{% tabs %}
+{% tab title="kubectl" %}
+```bash
+kubectl delete custom-tiny-stories-text-gen
+```
+{% endtab %}
+
+{% tab title="seldon-cli" %}
 ```bash
 seldon model unload custom-tiny-stories-text-gen
 ```
+{% endtab %}
+{% endtabs %}
+
+
 
 ````
 As a next step, why not try running a larger-scale model? You can find a definition for one in ./models/hf-text-gen-custom-gpt2.yaml. However, you may need to request and allocate more memory!