Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing data from first META page #53

Open
tk3369 opened this issue May 18, 2019 · 4 comments
Open

Missing data from first META page #53

tk3369 opened this issue May 18, 2019 · 4 comments
Labels

Comments

@tk3369
Copy link
Owner

tk3369 commented May 18, 2019

It seems that data residing the in the first META page is missing. I guess it might be introduced in the last major refactoring.

Examples:

data_pandas/test2.sas7bdat

julia> readsas("data_pandas/test2.sas7bdat")
Read data_pandas/test2.sas7bdat with size 10 x 100 in 0.00088 seconds
SASLib.ResultSet (10 rows x 100 columns)
Columns 1:Column1, 2:Column2, 3:Column3, 4:Column4, 5:Column5, 6:Column6, 7:Column7, 8:Column8, 9:Column9, 10:Column10 …
1: 0.0, , 0.0, 1960-01-01, 0.0, , 0.0, 0.0, 0.0, 
2: 0.0, , 0.0, 1960-01-01, 0.0, , 0.0, 0.0, 0.0, 
3: 0.0, , 0.0, 1960-01-01, 0.0, , 0.0, 0.0, 0.0, 
4: 0.0, , 0.0, 1960-01-01, 0.0, , 0.0, 0.0, 0.0, 
5: 0.0, , 0.0, 1960-01-01, 0.0, , 0.0, 0.0, 0.0, 

data_AHS2013/omov.sas7bdat

The first 103 records are missing as compared with results from ReadStat.

@tk3369 tk3369 added the bug label Jan 2, 2020
@pmbaumgartner
Copy link

I've encountered this bug recently. I have a dataset (that I unfortunately can't share) where it skips the first 48 rows. What ends up happening is it concatenates these "empty" rows at the bottom of the dataset - e.g. I see something like the above with 0.0 or blank values.

@xiaodaigh
Copy link
Contributor

xiaodaigh commented Jan 7, 2021 via email

@pmbaumgartner
Copy link

I'll try and generate something that replicates this. I think it has something to do with the size of the dataset: I've got 1800 columns and that seems to upset whatever I throw at this.

@xiaodaigh
Copy link
Contributor

've got 1800 columns

If you can generate a synthetic one that fails I can log the file here too for other to test https://github.com/xiaodaigh/sas7bdat-resources

The hardest thing about SAS is to get sample files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants