-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
first pass at exploring infra flakes #448
base: master
Are you sure you want to change the base?
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
d99c734
to
2a16d7e
Compare
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are we sure that the x-axis here are in days? If iirc they were successive builds. I could be wrong here, we should double check.
Reply via ReviewNB
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No you're right I just copy pasted this graph from some other place
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we have a count of the NA values? If it's 1% it wouldn't be a problem, if it's 99% we'd have to think this through.
Reply via ReviewNB
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should the NA value counting go before the array is fillna'ed with zeros?
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change the last line to better explain the example. "So if a column has 2 tests that failed, and the first had a row score of 2/5...."
Reply via ReviewNB
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please describe the intuition behind the approach in a bit more detail? E.g. what does sum of entropies signify? why are we looking at entropies of row scores for failed tests?
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If these outputs are meant to function as a "progress bar", I'd suggest just using tqdm instead
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My tqdm has never worked on the MOC jupyterhub ):
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How are you bringing the score into the 0-1 range? What does 0 mean and what does 1 mean?
Reply via ReviewNB
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you print some examples of jobs that have a high unexpected failure rate according to this method? And some that are expected failures?
Reply via ReviewNB
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Quite a creative method @antter 👏
Left some comments on the notebook.
@@ -0,0 +1,973 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Related Issues and Dependencies
#447
This introduces a breaking change
This Pull Request implements
Infra flakes are likely not detectable in an honest matter with methods by just looking at grid data; more analysis will be needed with different data, e.g. the logs of the outputs. This is a notebook that works to detect anomalous columns in the testgrid data just using very simple probability calculations.