argo-workflows/metrics/ #10319
Replies: 2 comments 4 replies
-
I've deployed Argo inside a local kind cluster with - kubectl create namespace argo
kubectl create \
--filename https://github.com/argoproj/argo-workflows/releases/download/v3.4.4/namespace-install.yaml \
--namespace argo
kubectl wait deployment workflow-controller \
--for condition=Available \
--namespace argo
kubectl create rolebinding default-admin \
--clusterrole cluster-admin \
--namespace argo \
--serviceaccount=argo:default However, when I want to label the Here's the output of NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
argo-server ClusterIP 10.96.45.141 <none> 2746/TCP 20m Is my argo installation faulty? Why can't kubectl find the |
Beta Was this translation helpful? Give feedback.
-
Template level CPU/Memory Metrics are not collected if the retryStrategy is enabled. Hello Experts, I am using Template level metric to collect CPU and Memory information. For example, the CPU metric is defined as below at template level: metrics: {
"prometheus": [
{
"name": "template_exec_cpu_gauge",
"labels": [
{
"key": "template_name",
"value": "My_Template_Name"
}
],
"help": "CPU gauge by template name",
"gauge": {
"value": "{{resourcesDuration.cpu}}",
"realtime": false
}
}
]
} What I find is, if the "retryStrategy": {
"limit": 10
}, Then from Argo server metric endpoint (http://localhost:9090/metrics), the CPU gague is always 0:
However, it is a heavy computation. When I click this template node (both on Retry node and POD node) of the workflow in the Argo workflow UI, I can see the CPU and memory usage:
When I disable
Is it expected behavior? How can I collect template CPU and memory metric when Thanks very much for your help in advance. |
Beta Was this translation helpful? Give feedback.
-
argo-workflows/metrics/
https://argoproj.github.io/argo-workflows/metrics/
Beta Was this translation helpful? Give feedback.
All reactions