Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Successfully started workspaces ratio #12852

Merged
merged 6 commits into from
Mar 12, 2019
Merged

Successfully started workspaces ratio #12852

merged 6 commits into from
Mar 12, 2019

Conversation

skabashnyuk
Copy link
Contributor

@skabashnyuk skabashnyuk commented Mar 10, 2019

What does this PR do?

The idea behind the scene is to provide some metric for end users/admins that will show how successful Che master is able to start workspaces in some particular point of time. This metric should be related to the total number of workspace what was starting in that period of time.

Add Prometheus metrics:

# HELP che_workspace_starting_attempts_total The count of workspaces starting attempts
# TYPE che_workspace_starting_attempts_total counter
che_workspace_starting_attempts_total 4.0
# HELP che_workspace_started_total The count of started workspaces
# TYPE che_workspace_started_total counter
che_workspace_started_total 3.0

And Grafana panel
Знімок екрана  о 13 56 31

What issues does this PR fix or reference?

Fixes #12723

Release Notes

n/a

Docs PR

n/a

@skabashnyuk skabashnyuk marked this pull request as ready for review March 10, 2019 13:52
@skabashnyuk skabashnyuk requested a review from a user March 10, 2019 13:52
Copy link
Member

@sleshchenko sleshchenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@skabashnyuk
Copy link
Contributor Author

ci-test

Copy link

@garagatyi garagatyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be very helpful to explain in the PR description not only what is done but actually why it is done. It is not obvious that counters needed for the evaluation of rates. Adding this info into a PR would simplify PR review and you would get less questions and faster reviews. Thanks!

@skabashnyuk
Copy link
Contributor Author

skabashnyuk commented Mar 11, 2019

@metlos you were right with zero division and NaN values. I've changed a bit a concept of a graph.
Now it's two metrics

  1. Success - Success/(Success+Fail)
  2. Fail - Fail/(Success+Fail)

Also, there are some changes in the graph.

  • Values are stacked.
  • Nan interpreted as zero.

Знімок екрана  о 13 51 20

@che-bot
Copy link
Contributor

che-bot commented Mar 11, 2019

Results of automated E2E tests of Eclipse Che Multiuser on OCP:
Build details
Test report
docker image: eclipseche/che-server:12852
https://github.com/orgs/eclipse/teams/eclipse-che-qa please check this report.

@skabashnyuk
Copy link
Contributor Author

ci-test

@metlos
Copy link
Contributor

metlos commented Mar 11, 2019

@metlos you were right with zero division and NaN values. I've changed a bit a concept of a graph.
Now it's two metrics

1. Success - Success/(Success+Fail)

2. Fail - Fail/(Success+Fail)

The only question I have is about the new che_workspace_starting_attempts_total. Do I understand it correctly that it should be equal to the Success+Fail in your above description?

@che-bot
Copy link
Contributor

che-bot commented Mar 11, 2019

Results of automated E2E tests of Eclipse Che Multiuser on OCP:
Build details
Test report
docker image: eclipseche/che-server:12852
https://github.com/orgs/eclipse/teams/eclipse-che-qa please check this report.

@ibuziuk
Copy link
Member

ibuziuk commented Mar 11, 2019

@skabashnyuk great job!
@fche @l0rd @slemeur probably a great candidate for one of two metrics exposed to statuspage redhat-developer/rh-che#1224

@slemeur
Copy link
Contributor

slemeur commented Mar 12, 2019

Are there any documentation planed for that PR?

@skabashnyuk
Copy link
Contributor Author

ci-test

@skabashnyuk
Copy link
Contributor Author

@metlos

The only question I have is about the new che_workspace_starting_attempts_total. Do I understand it correctly that it should be equal to the Success+Fail in your above description?

Not exactly. che_workspace_starting_attempts_total=Success+Fail+Starting

@slemeur

Are there any documentation planned for that PR?

I was planning to do that as part of this #12543 issue

@che-bot
Copy link
Contributor

che-bot commented Mar 12, 2019

Results of automated E2E tests of Eclipse Che Multiuser on OCP:
Build details
Test report
docker image: eclipseche/che-server:12852
https://github.com/orgs/eclipse/teams/eclipse-che-qa please check this report.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Metric] Provide a successfully started workspaces ratio
8 participants