Skip to content

Commit

Permalink
[YUNIKORN-1626] Listing Yunikorn metrics revealed in the prometheus
Browse files Browse the repository at this point in the history
  • Loading branch information
0yukali0 committed Oct 29, 2023
1 parent 58312fb commit 4c708bb
Show file tree
Hide file tree
Showing 2 changed files with 92 additions and 105 deletions.
6 changes: 3 additions & 3 deletions docs/metrics/queue.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ These application states include `running`, `accepted`, `rejected`, `failed` and

**TYPE**: `yunikorn_<queue name>_queue_app`

```
```json
yunikorn_root_default_queue_app{state="accepted"} 3
yunikorn_root_default_queue_app{state="running"} 3
```
Expand All @@ -53,11 +53,11 @@ These resource states include `guaranteed`, `max`, `allocated`, `pending`, `pree

**TYPE**: `yunikorn_<queue name>_queue_resource`

```
```json
yunikorn_root_queue_resource{resource="ephemeral-storage",state="max"} 9.41009558e+10
yunikorn_root_queue_resource{resource="hugepages-1Gi",state="max"} 0
yunikorn_root_queue_resource{resource="hugepages-2Mi",state="max"} 0
yunikorn_root_queue_resource{resource="memory",state="max"} 1.6223076352e+10
yunikorn_root_queue_resource{resource="pods",state="max"} 110
yunikorn_root_queue_resource{resource="vcore",state="max"} 8000
```
```
191 changes: 89 additions & 102 deletions docs/metrics/scheduler.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -84,7 +84,7 @@ This metric includes latencies, such as `Node sorting`, `Trynode` and `Trypreemp

**Subsystem**: `scheduler`

```
```json
yunikorn_scheduler_scheduling_latency_milliseconds_bucket{le="0.0001"} 0
yunikorn_scheduler_scheduling_latency_milliseconds_bucket{le="0.001"} 0
yunikorn_scheduler_scheduling_latency_milliseconds_bucket{le="0.01"} 0
Expand Down Expand Up @@ -117,53 +117,47 @@ Latencies including `node sorting`, `application sorting` and `queue sorting`, i
]}>
<TabItem value="app_sorting">

```json
/*
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.0001"} 5
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.001"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.01"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.1"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="1"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="10"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="+Inf"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="app"} 0.00026345400000000004
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="app"} 6
*/
```
```json
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.0001"} 5
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.001"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.01"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="0.1"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="1"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="10"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="app",le="+Inf"} 6
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="app"} 0.00026345400000000004
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="app"} 6
```

</TabItem>
<TabItem value="node_sorting">

```json
/*
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.0001"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.001"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.01"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.1"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="1"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="10"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="+Inf"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="node"} 2.5013999999999998e-05
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="node"} 3
*/
```
```json
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.0001"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.001"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.01"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="0.1"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="1"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="10"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="node",le="+Inf"} 3
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="node"} 2.5013999999999998e-05
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="node"} 3
```

</TabItem>
<TabItem value="queue_sorting">

```json
/*
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.0001"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.001"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.01"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.1"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="1"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="10"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="+Inf"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="queue"} 4.0093e-05
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="queue"} 9
*/
```
```json
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.0001"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.001"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.01"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="0.1"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="1"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="10"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_bucket{level="queue",le="+Inf"} 9
yunikorn_scheduler_node_sorting_latency_milliseconds_sum{level="queue"} 4.0093e-05
yunikorn_scheduler_node_sorting_latency_milliseconds_count{level="queue"} 9
```

</TabItem>
</Tabs>
Expand All @@ -179,7 +173,7 @@ Latency of node condition checks for container allocations, such as placement co

**Subsystem**: `scheduler`

```
```json
yunikorn_scheduler_trynode_latency_milliseconds_bucket{le="0.0001"} 0
yunikorn_scheduler_trynode_latency_milliseconds_bucket{le="0.001"} 0
yunikorn_scheduler_trynode_latency_milliseconds_bucket{le="0.01"} 0
Expand All @@ -201,7 +195,8 @@ Latency of preemption condition checks for container allocations, in millisecond
**Namespace**: `yunikorn`

**Subsystem**: `scheduler`
```

```json
yunikorn_scheduler_trypreemption_latency_milliseconds_bucket{le="0.0001"} 0
yunikorn_scheduler_trypreemption_latency_milliseconds_bucket{le="0.001"} 0
yunikorn_scheduler_trypreemption_latency_milliseconds_bucket{le="0.01"} 0
Expand Down Expand Up @@ -249,78 +244,70 @@ Total resource usage of node, by resource name.
]}>
<TabItem value="ephemeral_storage">
```json
yunikorn_scheduler_ephemeral_storage_node_usage_total
/*
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="[0,10%]"} 1
*/
```
```json
yunikorn_scheduler_ephemeral_storage_node_usage_total
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_ephemeral_storage_node_usage_total{range="[0,10%]"} 1
```

</TabItem>
<TabItem value="memory">

```json
yunikorn_scheduler_memory_node_usage_total
/*
yunikorn_scheduler_memory_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="[0,10%]"} 1
*/
```
```json
yunikorn_scheduler_memory_node_usage_total
yunikorn_scheduler_memory_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_memory_node_usage_total{range="[0,10%]"} 1
```

</TabItem>
<TabItem value="pods">

```json
yunikorn_scheduler_pods_node_usage_total
/*
yunikorn_scheduler_pods_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="[0,10%]"} 1
*/
```
```json
yunikorn_scheduler_pods_node_usage_total
yunikorn_scheduler_pods_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_pods_node_usage_total{range="[0,10%]"} 1
```

</TabItem>
<TabItem value="vcore">

```json
yunikorn_scheduler_vcore_node_usage_total
/*
yunikorn_scheduler_vcore_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="[0,10%]"} 1
*/
```
```json
yunikorn_scheduler_vcore_node_usage_total
yunikorn_scheduler_vcore_node_usage_total{range="(10%, 20%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(20%,30%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(30%,40%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(40%,50%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(50%,60%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(60%,70%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(70%,80%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(80%,90%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="(90%,100%]"} 0
yunikorn_scheduler_vcore_node_usage_total{range="[0,10%]"} 1
```

</TabItem>
</Tabs>
Expand Down

0 comments on commit 4c708bb

Please sign in to comment.