Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error on reconciliation of odh-model-controller #86

Closed
vaibhavjainwiz opened this issue Sep 20, 2023 · 8 comments · Fixed by #100 or #111
Closed

Error on reconciliation of odh-model-controller #86

vaibhavjainwiz opened this issue Sep 20, 2023 · 8 comments · Fixed by #100 or #111
Assignees
Labels
kind/bug Something isn't working rhods-2.5

Comments

@vaibhavjainwiz
Copy link
Member

Below issue is coming on reconciliation of odh-model-controller.

2023-09-20T00:50:14Z	ERROR	Reconciler error	{"controller": "inferenceservice", "controllerGroup": "serving.kserve.io", "controllerKind": "InferenceService", "InferenceService": {"name":"caikit-example-isvc","namespace":"kserve-demo"}, "namespace": "kserve-demo", "name": "caikit-example-isvc", "reconcileID": "f399dd1f-1ee1-45c2-badc-c37507bc51fe", "error": "servicemonitors.monitoring.coreos.com \"caikit-example-isvc-metrics\" is invalid: metadata.resourceVersion: Invalid value: 0x0: must be specified for an update"}
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:329
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:274
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:235

Currently this error is not visible in logs because error object is not send back. We capture the error and then just ignore it. In below line we are capturing err but never return it to calling function.
https://github.com/opendatahub-io/odh-model-controller/blob/main/controllers/inferenceservice_controller.go#L99

@vaibhavjainwiz vaibhavjainwiz self-assigned this Sep 20, 2023
@heyselbi heyselbi moved this from New/Backlog to To-do/Groomed in ODH Model Serving Planning Sep 20, 2023
@vaibhavjainwiz
Copy link
Member Author

depends on changes of #84

@vaibhavjainwiz vaibhavjainwiz moved this to In Progress in Internal tracking Oct 10, 2023
@vaibhavjainwiz vaibhavjainwiz moved this to Model Serving in ODH Feature Tracking Oct 10, 2023
@vaibhavjainwiz vaibhavjainwiz moved this from To-do/Groomed to In Progress in ODH Model Serving Planning Oct 10, 2023
@heyselbi heyselbi added the kind/bug Something isn't working label Oct 11, 2023
@vaibhavjainwiz vaibhavjainwiz linked a pull request Oct 12, 2023 that will close this issue
3 tasks
@github-project-automation github-project-automation bot moved this from In Progress to Done in ODH Model Serving Planning Oct 23, 2023
@github-project-automation github-project-automation bot moved this from In Progress to Done in Internal tracking Oct 23, 2023
@heyselbi
Copy link
Contributor

@rpancham will retest it. Once test passes, @vaibhavjainwiz please make sure it is in RHODS 2.4.

@heyselbi heyselbi reopened this Oct 25, 2023
@github-project-automation github-project-automation bot moved this from Done to New/Backlog in ODH Model Serving Planning Oct 25, 2023
@heyselbi heyselbi moved this from New/Backlog to Under Review in ODH Model Serving Planning Oct 25, 2023
@vaibhavjainwiz vaibhavjainwiz linked a pull request Oct 26, 2023 that will close this issue
3 tasks
@github-project-automation github-project-automation bot moved this from Under Review to Done in ODH Model Serving Planning Oct 26, 2023
@bdattoma
Copy link

bdattoma commented Nov 8, 2023

@vaibhavjainwiz how can I try to reproduce the issue?

@vaibhavjainwiz
Copy link
Member Author

These error will not be visible in logs but you could see the impacts it on ServiceMeshMemberRole.
If you delete the Namespace from ServiceMeshMemberRole in running application then ideally it would add back by operator but beacause of this error it would not added back.

@bdattoma
Copy link

These error will not be visible in logs but you could see the impacts it on ServiceMeshMemberRole. If you delete the Namespace from ServiceMeshMemberRole in running application then ideally it would add back by operator but beacause of this error it would not added back.

I tried, and don't see any error in odh-model-controller pod.
However, the ServiceMeshMemberRoll doesn't get reconcilied automatically.

@heyselbi heyselbi reopened this Nov 16, 2023
@github-project-automation github-project-automation bot moved this from Done to New/Backlog in ODH Model Serving Planning Nov 16, 2023
@heyselbi heyselbi moved this from New/Backlog to To-do/Groomed in ODH Model Serving Planning Nov 16, 2023
@israel-hdez israel-hdez self-assigned this Nov 16, 2023
@israel-hdez
Copy link
Contributor

I updated the label to rhods-2.5, because the fix is missing on 2.4.
Once there is a build for 2.5 I'll re-check and close if I the fix is correctly in place.

@bdattoma
Copy link

I updated the label to rhods-2.5, because the fix is missing on 2.4. Once there is a build for 2.5 I'll re-check and close if I the fix is correctly in place.

sounds good, thank you!

@heyselbi heyselbi moved this from To-do/Groomed to Under Review in ODH Model Serving Planning Nov 27, 2023
@israel-hdez
Copy link
Contributor

The issue seems to be gone in 2.5-RC
Closing.

@github-project-automation github-project-automation bot moved this from Under Review to Done in ODH Model Serving Planning Dec 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working rhods-2.5
Projects
Status: Done
Status: Model Serving
Status: Done
4 participants