-
Notifications
You must be signed in to change notification settings - Fork 162
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
* Add docs about SLO burn rate alert details page * Fix ID * Apply suggestions from mdbirnstiehl (cherry picked from commit 6231d4a) Co-authored-by: DeDe Morton <[email protected]>
- Loading branch information
1 parent
d96b644
commit f5d5b76
Showing
11 changed files
with
64 additions
and
9 deletions.
There are no files selected for viewing
Binary file not shown.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
39 changes: 39 additions & 0 deletions
39
docs/en/observability/triage-slo-burn-rate-breaches.asciidoc
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,39 @@ | ||
[[triage-slo-burn-rate-breaches]] | ||
= Triage SLO burn rate breaches | ||
++++ | ||
<titleabbrev>SLO burn rate breaches</titleabbrev> | ||
++++ | ||
|
||
SLO burn rate breaches occur when the percentage of bad events over a specified time period exceeds the threshold set in your <<slo-burn-rate-alert,SLO burn rate rule>>. | ||
When this happens, you are at risk of exhausting your error budget and violating your SLO. | ||
|
||
To triage issues quickly, go to the alert details page: | ||
|
||
. Go to **{observability}** -> **Alerts** (or open the SLO and click **Alerts**.) | ||
. From the Alerts table, click the image:images/icons/boxesHorizontal.svg[More actions icon] icon next to the alert and select **View alert details**. | ||
|
||
The alert details page shows information about the alert, including when the alert was triggered, | ||
the duration of the alert, the source SLO, and the rule that triggered the alert. | ||
You can follow the links to navigate to the source SLO or rule definition. | ||
|
||
Explore charts on the page to learn more about the SLO breach: | ||
|
||
[role="screenshot"] | ||
image::images/slo-burn-rate-breach.png[Alert details for SLO burn rate breach] | ||
|
||
* The first chart shows the burn rate during the time range when the alert was active. | ||
The line indicates how close the SLO came to breaching the threshold. | ||
* The next chart shows the alerts history over the last 30 days. | ||
It shows the number of alerts that were triggered and the average time it took to recover after a breach. | ||
* Both timelines are annotated to show when the threshold was breached. | ||
You can hover over an alert icon to see the timestamp of the alert. | ||
|
||
The number, duration, and frequency of these breaches over time gives you an indication of how severely the service is degrading so that you can focus on high severity issues first. | ||
|
||
NOTE: The contents of the alert details page may vary depending on the type of SLI that's defined in the SLO. | ||
|
||
After investigating the alert, you may want to: | ||
|
||
* Click **Snooze the rule** to snooze notifications for a specific time period or indefinitely. | ||
* Click the image:images/icons/boxesVertical.svg[Actions] icon and select **Add to case** to add the alert to a new or existing case. To learn more, refer to <<create-cases>>. | ||
* Click the image:images/icons/boxesVertical.svg[Actions] icon and select **Mark as untracked**. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters