Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

4.15 automation [OCS-5576]: Fail prometheus and verify that the alert still exists after recovering it. #10849

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

nagendra202
Copy link
Contributor

@nagendra202 nagendra202 commented Nov 13, 2024

Automated below test case:

RHSTOR-4798 Fail prometheus and verify that the alert still exists after recovering it.

  1. Deploy cluster and run workloads.
  2. Run IO pods to fill MDS memory to 95%.
  3. Now, the alert is triggered to increase memory limit. Don't take any action on the alert.
  4. Fail Prometheus, and recover it.
  5. Make sure that the alert is still firing after recovering Prometheus.

@nagendra202 nagendra202 self-assigned this Nov 13, 2024
@nagendra202 nagendra202 requested a review from a team as a code owner November 13, 2024 13:13
@pull-request-size pull-request-size bot added the size/S PR that changes 10-29 lines label Nov 13, 2024
Copy link

openshift-ci bot commented Nov 13, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: nagendra202

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link

@ocs-ci ocs-ci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR validation on existing cluster

Cluster Name: nagreddy-n11-2
Cluster Configuration:
PR Test Suite: tier2
PR Test Path: tests/functional/monitoring/prometheus/alerts/test_alert_mds_cache_high_usage.py
Additional Test Params:
OCP VERSION: 4.17
OCS VERSION: 4.17
tested against branch: master

Job PASSED.

@nagendra202 nagendra202 added Verified Mark when PR was verified and log provided Automation Backlog PR's wrt ABC labels Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Automation Backlog PR's wrt ABC size/S PR that changes 10-29 lines Verified Mark when PR was verified and log provided
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants