Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data input format? #31

Open
G-kodes opened this issue Feb 17, 2020 · 6 comments
Open

Data input format? #31

G-kodes opened this issue Feb 17, 2020 · 6 comments

Comments

@G-kodes
Copy link

G-kodes commented Feb 17, 2020

Can you please include in your readme.md how to structure incoming data? I can't see anywhere what format my data frame needs to be in, in order to render a graph. The only solution is to grab and unpickle your test data which defeats the point of your readme.md instructions.

@MahmoudAbdelRahman
Copy link

The same issue here, please.
Thank you

@MahmoudAbdelRahman
Copy link

@sgtporkchops,
I think this resource might be useful:
http://data.caleydo.org/papers/2014_infovis_upset.pdf

@monika0603
Copy link

Does the package only take pickle file as an input? How is a pickle file created from movie lens dataset?

@macho9099
Copy link

It looks like that input must be a dictionary that contains pandas dataframes, however there are some issues with source code because methods like ix are depreciated

@G-kodes
Copy link
Author

G-kodes commented Nov 16, 2021

From what I have been able to tell, I needed to first convert my column data into a boolean form using .astype(bool), and then re-factor it into a single-column, multi-index, count form using .groupby(["Column1", "column2", ...]).count().

This made it LOOK like their data and what the docs describe, however the package still complained for me. What I then discovered is that the Package specifically wants a Series and not a DataFrame type object. I personally used the .iloc[0] function which returns the first column as a Series, but I believe there is a .squeeze() function which basically does this in-house for exactly this operation (Docs here).

Either way, I am not closing this issue as it is in fact a request for better documentation to explain this aspect of the package.

@rLannes
Copy link

rLannes commented Sep 27, 2022

That is too bad, won't lose my time figuring out what should be the data format. This issue has been around from 2 years. It is unfortunate, this looked nice. I am back to R.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants