Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Bubble ImagePullErr and ImagePullBackoff to the Ray CRD #2387

Open
1 of 2 tasks
EngHabu opened this issue Sep 17, 2024 · 1 comment
Open
1 of 2 tasks

[Bug] Bubble ImagePullErr and ImagePullBackoff to the Ray CRD #2387

EngHabu opened this issue Sep 17, 2024 · 1 comment
Labels
bug Something isn't working triage

Comments

@EngHabu
Copy link

EngHabu commented Sep 17, 2024

Search before asking

  • I searched the issues and found no similar issues.

KubeRay Component

ray-operator

What happened + What you expected to happen

I deployed a rayjob with a bad image reference (image does not exist)
The RayJob stayed in "Initializing" phase and didn't get updated/bubble up the error from starting the Driver Pod.

Reproduction script

TBD

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!
@EngHabu EngHabu added bug Something isn't working triage labels Sep 17, 2024
@andrewsykim
Copy link
Collaborator

We are working on improving RayCluster observability with new conditions APIs which should hopefully surface these types of failures.

@rueian do you know if the existing implementation would surface ImagePullErr and ImagePullBackoff errors?

See https://docs.google.com/document/d/1bRL0cZa87eCX6SI7gqthN68CgmHaB6l3-vJuIse-BrY/edit?usp=sharing for more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working triage
Projects
None yet
Development

No branches or pull requests

2 participants