Polars 1.3.0 throws parquet errors when reading (I think) categorical columns in settings that 1.1.0 does not. #17931
Labels
A-io-parquet
Area: reading/writing Parquet files
accepted
Ready for implementation
bug
Something isn't working
P-medium
Priority: medium
python
Related to Python Polars
Checks
Reproducible example
I don't have a MWE at the moment, but wanted to post this in the interim so others can find it.
I have a confirmed test case in a public repo I'm working on that I'll implement below where running code with polars 1.1.0 works fine and with 1.3.0 raises this error:
polars.exceptions.ComputeError: parquet: Not yet supported: Dictionary array without a dictionary page
The test case is this test file in this repo: https://github.com/mmcdermott/MEDS_transforms/blob/397916bc4941c270b205e8fbf5307aa407d5b56b/tests/test_extract.py
Here is an example of the test failing due to polars 1.3.0: https://github.com/mmcdermott/MEDS_transforms/actions/runs/10146769727/job/28055619925?pr=31
Log output
No response
Issue description
I suspect this has to due with categorical columns but I'm not sure.
Expected behavior
It should work the same way on 1.3.0 as 1.1.0
Installed versions
The text was updated successfully, but these errors were encountered: