Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail to enable batch jobs #4043

Open
octotocat opened this issue Jan 23, 2025 · 0 comments
Open

Fail to enable batch jobs #4043

octotocat opened this issue Jan 23, 2025 · 0 comments
Labels
kind/support Categorizes issue or PR as a support question.

Comments

@octotocat
Copy link

octotocat commented Jan 23, 2025

Kueue controller manager container failed to start if I enable batch:
Config refers to https://kueue.sigs.k8s.io/docs/tasks/manage/setup_wait_for_pods_ready/

managerConfig:
  controllerManagerConfigYaml: |-
...
  waitForPodsReady:
      enable: true
      timeout: 5m
      blockAdmission: true
      requeuingStrategy:
        timestamp: Eviction | Creation
        backoffLimitCount: 5 # null indicates infinite requeuing
        backoffBaseSeconds: 60
        backoffMaxSeconds: 3600

with pod integration:

   integrations:
      frameworks:
      - "pod"
      - "deployment"
      podOptions:
        namespaceSelector:
          matchExpressions:
            - key: kubernetes.io/metadata.name
              operator: NotIn
              values: [ kube-system, kueue-system ]
            - key: kueue-job
              operator: In
              values: [ "true", "True", "yes" ]

Error: Back-off restarting failed container manager in pod kueue-controller-manager-xxx. No container log as container failed to start.
Any clue?
Version: v0.10.1.
Same config used to work with v0.8.1. After upgraded to v10, didn't work, and when I downgraded to v8 again, still didn't work.

@octotocat octotocat added the kind/support Categorizes issue or PR as a support question. label Jan 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/support Categorizes issue or PR as a support question.
Projects
None yet
Development

No branches or pull requests

1 participant