Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supports Spark DataFrame? #59

Closed
tk3369 opened this issue May 2, 2018 · 1 comment
Closed

Supports Spark DataFrame? #59

tk3369 opened this issue May 2, 2018 · 1 comment

Comments

@tk3369
Copy link

tk3369 commented May 2, 2018

I read through the documentation and it refers to RDD's. The current version of Spark has concepts of DataFrame and DataSet. Does this package support those? Perhaps it is related to #58?

@dfdx
Copy link
Owner

dfdx commented May 2, 2018

DataFrames/Datasets are described in SQL Interface section, although documentation isn't complete, so it's best to consult source code. In short, various methods for reading and writing data are supported, as well as general-purpose select, group_by, sql and some other functions.

There's a lot of missing functionality, of course, so if you miss something specific, please post an issue for it.

@dfdx dfdx closed this as completed Oct 7, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants