Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Descriptive statistics #167

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

andrusha
Copy link

Hackday project in need of feedback.

Working with data you're interested in:

  • Shape of data, which could be your schema
  • General case, which is addressed by bigSampler
  • Edge cases, which this PR tries to tackle

It's inspired by summary from R.

Todo:

  • Property and unit tests
  • Support booleans
  • Support floating point numbers
  • Support for different formats (protobuf)?

@idreeskhan idreeskhan self-requested a review September 3, 2019 21:04
@idreeskhan
Copy link
Contributor

Hey, thanks for taking the initiative! We have some internal stuff that overlaps a bit. I've been thinking a lot about the future of that and might be good to make sure we're on the same page.

@andrusha
Copy link
Author

@idreeskhan may I ask, were data profiling tools opensourced since then?

@idreeskhan
Copy link
Contributor

Sorry this comment got lost in email while I was on vacation back in June. They have not been open sourced but we are hesitant to merge this in. Internally the data profiling tools fit our needs and if we merge this it means we are taking over maintenance which we don't really want to do at the moment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants