Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Plot histogram of outlier scores #55

Open
hannah-rae opened this issue Aug 3, 2021 · 2 comments
Open

Plot histogram of outlier scores #55

hannah-rae opened this issue Aug 3, 2021 · 2 comments
Assignees

Comments

@hannah-rae
Copy link
Contributor

It would be helpful for deciding what threshold to use for reviewing outliers to see a distribution of the outlier scores for each algorithm. This module would generate a histogram of outlier scores.

As a side note, #24 specifies the number of outliers to record in the subset. It could also be useful to specify a threshold, though this would be algorithm-dependent (unless the threshold was based on the data, e.g., 2 std deviations or similar).

@hannah-rae
Copy link
Contributor Author

@vinr515 I think we can close this now, but it would be good for some others to try out the functionality before closing to make sure it works across the use cases.

@hannah-rae
Copy link
Contributor Author

@vinr515 I tried running this and it works great with no parameters but I get the following error when I use the bins parameter:
TypeError: _run() got multiple values for argument 'bins'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants