feat(operator): Included LLM spec to CRD #6234

RobertSamoilescu · 2025-01-27T12:18:25Z

What this PR does / why we need it:

This PR adds support for the llm tag in the model CRD. Similarly to the explainer tag, the llm tag allows to reference an inference model by the name. This specification is used in the PromptRuntime for the LLM Module.

Example usage:

apiVersion: mlops.seldon.io/v1alpha1
kind: Model
metadata:
  name: chat-completions
spec:
  storageUri: "gs://seldon-models/llm-runtimes/prompting-test/models/chat-completions/"
  llm:
    modelRef: tiny-llama
  requirements:
  - prompt

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

CLAassistant · 2025-01-27T12:18:32Z

All committers have signed the CLA.

sakoush

In general it looks good, I left some comments for consideration.

apis/mlops/scheduler/scheduler.proto

scheduler/pkg/agent/repository/mlserver/mlserver.go

sakoush · 2025-01-28T14:14:21Z

operator/apis/mlops/v1alpha1/model_types_test.go

@@ -96,6 +96,9 @@ func TestAsModelDetails(t *testing.T) {
 						Type:     "anchor_tabular",
 						ModelRef: &incomeModel,
 					},
+					Llm: &LlmSpec{


nit: I think that we should have a test case specific to LLM spec and not have it with explainer. In fact the user should not be able to set the two together and this can should be marked as invalid perhaps.

operator/apis/mlops/v1alpha1/model_types.go

Co-authored-by: Sherif Akoush <[email protected]>

sakoush

LGTM!

RobertSamoilescu added 5 commits January 27, 2025 10:41

Included llm spec into operator

8b164c1

Implemented SetLlm in scheduler and included associated test

5ce2bee

Generated protocol buffers

b28dc15

Generated k8s crds

9c5a0d8

Refactored SetExplainer and SetLlm to avoid code duplication

1df90d1

RobertSamoilescu requested review from sakoush and lc525 as code owners January 27, 2025 12:18

Refactored TestSetExplainer and TestSetLlm to avoid code duplication

5ce0906

RobertSamoilescu marked this pull request as draft January 27, 2025 13:24

Fixed operator test for model types

f8db9f3

sakoush added the v2 label Jan 27, 2025

Fixed operator model_types

75285d7

RobertSamoilescu marked this pull request as ready for review January 27, 2025 16:57

Fixed crd config

78b3c71

RobertSamoilescu force-pushed the feature/llm-spec branch from 6b0e24a to 1ac617b Compare January 27, 2025 18:17

Regenerated manifests and deepcopy

a698f39

RobertSamoilescu force-pushed the feature/llm-spec branch from 1ac617b to a698f39 Compare January 27, 2025 18:31

RobertSamoilescu requested review from sakoush and removed request for sakoush and lc525 January 27, 2025 18:49

sakoush changed the title ~~Included LLM spec to CRD~~ feat(operator): Included LLM spec to CRD Jan 28, 2025

sakoush reviewed Jan 28, 2025

View reviewed changes

RobertSamoilescu and others added 7 commits January 28, 2025 14:29

Update scheduler/pkg/agent/repository/mlserver/mlserver.go

eeb7a78

Co-authored-by: Sherif Akoush <[email protected]>

Keep original id/ordering and make explainer and llm of type oneof

8a12c76

Renamed setModelSettings to setExtraParameters

6842e16

Updated operator for mutual exclusive explainer and llm

3918179

Included model spec validation test

9a9bd1e

Use map as param in customize function in scheduler

81e2fa1

Fix return err from spec validation step

6a60422

sakoush approved these changes Jan 28, 2025

View reviewed changes

Replaced customize function with map

579ab9e

RobertSamoilescu force-pushed the feature/llm-spec branch from e006dc0 to 579ab9e Compare January 28, 2025 17:48

RobertSamoilescu merged commit 08fa567 into SeldonIO:v2 Jan 29, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(operator): Included LLM spec to CRD #6234

feat(operator): Included LLM spec to CRD #6234

RobertSamoilescu commented Jan 27, 2025 •

edited

Loading

CLAassistant commented Jan 27, 2025 •

edited

Loading

sakoush left a comment

sakoush Jan 28, 2025

sakoush left a comment

feat(operator): Included LLM spec to CRD #6234

feat(operator): Included LLM spec to CRD #6234

Conversation

RobertSamoilescu commented Jan 27, 2025 • edited Loading

CLAassistant commented Jan 27, 2025 • edited Loading

sakoush left a comment

Choose a reason for hiding this comment

sakoush Jan 28, 2025

Choose a reason for hiding this comment

sakoush left a comment

Choose a reason for hiding this comment

RobertSamoilescu commented Jan 27, 2025 •

edited

Loading

CLAassistant commented Jan 27, 2025 •

edited

Loading