Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove outdated study #157

Merged
merged 1 commit into from
Jan 5, 2024
Merged

Remove outdated study #157

merged 1 commit into from
Jan 5, 2024

Conversation

AlexanderSaydakov
Copy link
Contributor

This study is not relevant anymore. t-Digest code improved since then, and comparing with KLL was not the best idea. Now we have REQ sketch, which seems to be a better candidate for comparison.

@AlexanderSaydakov AlexanderSaydakov merged commit 028b0af into master Jan 5, 2024
4 checks passed
@AlexanderSaydakov AlexanderSaydakov deleted the remove_outdated_study branch January 5, 2024 20:29
@PavelVesely
Copy link
Contributor

As for t-digest vs. KLL or ReqSketch comparison, we have a paper with Graham and people from Splunk. I think t-digest was not updated much since then, so I think you can point to this paper for comparison.

The upshot is that while t-digest is more accurate on well-behaved distributions, there are examples and distributions where it fails spectacularly (though these inputs are somewhat artificial). However, KLL and ReqSketch have much better update time than t-digest.

@AlexanderSaydakov
Copy link
Contributor Author

@PavelVesely you may want to comment here:
apache/datasketches-cpp#416

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants