fix: Delete all versions of a deleted model #6193

abhimanyu003 · 2025-01-15T08:58:13Z

What this PR does / why we need it:

How to replicate the issue

Apply model

apiVersion: mlops.seldon.io/v1alpha1
kind: Model
metadata:
  name: add10
spec:
  storageUri: "gs://seldon-models/scv2/samples/triton_23-03/add10"
  requirements:
  - triton
  - python

Update manifest by applying faulty update

apiVersion: mlops.seldon.io/v1alpha1
kind: Model
metadata:
  name: add10
spec:
  storageUri: "gs://seldon-models/scv2/samples/triton_23-03/add10-faulty-url"
  requirements:
  - triton
  - python

Do seldon model unload add10
Model will remain in memory and never get cleand-up

Which issue(s) this PR fixes:
Fixes #INFRA-1230

sakoush

This change is potentially going to cause downtime. This will need to be revisited.

sakoush · 2025-01-16T11:45:51Z

scheduler/pkg/agent/client.go

+		}
+		c.logger.WithField("func", c.stateManager.modelVersions.getVersionsForAllModels()).Infof("----")
+
+		err := c.ModelRepository.RemoveModelVersion(util.GetVersionedModelName(modelName, v.GetVersion()))


This flow is probably not the correct flow as we are not yet sure that we should be removing the old version of this model.

In progressive rollout we have to make sure that the new version of the model is up before removing an old version otherwise we risk inference requests not being served in this transition.

I dont think this is the correct place also to do the clean up as one agent (i.e. a server replica loading one model replica) doesnt have visibility with regards to the actual state of the new version of the model (e.g. that the envoy routes have been updated). Therefore I think that the clean up process should be done somewhere at the scheduler side and not on the agent.

fix: Delete all versions of a deleted model

36ddd9e

abhimanyu003 requested review from sakoush and lc525 as code owners January 15, 2025 08:58

sakoush added the v2 label Jan 16, 2025

sakoush reviewed Jan 16, 2025

View reviewed changes

sakoush marked this pull request as draft January 16, 2025 11:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Delete all versions of a deleted model #6193

fix: Delete all versions of a deleted model #6193

abhimanyu003 commented Jan 15, 2025 •

edited

Loading

sakoush left a comment

sakoush Jan 16, 2025

fix: Delete all versions of a deleted model #6193

Are you sure you want to change the base?

fix: Delete all versions of a deleted model #6193

Conversation

abhimanyu003 commented Jan 15, 2025 • edited Loading

sakoush left a comment

Choose a reason for hiding this comment

sakoush Jan 16, 2025

Choose a reason for hiding this comment

abhimanyu003 commented Jan 15, 2025 •

edited

Loading