Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

De-duplicate decompression in CSV/NDJSON scans #17908

Open
nameexhaustion opened this issue Jul 27, 2024 · 0 comments
Open

De-duplicate decompression in CSV/NDJSON scans #17908

nameexhaustion opened this issue Jul 27, 2024 · 0 comments
Labels
A-io-csv Area: reading/writing CSV files A-io-json Area: reading/writing JSON files accepted Ready for implementation enhancement New feature or an improvement of an existing feature P-goal Priority: aligns with long-term Polars goals performance Performance issues or improvements

Comments

@nameexhaustion
Copy link
Collaborator

Description

ref #17841 (comment)

Decompression happens during schema inference and then again during actual reads. There is room for improvement here but it will not be trivial.

@nameexhaustion nameexhaustion added enhancement New feature or an improvement of an existing feature performance Performance issues or improvements accepted Ready for implementation A-io-csv Area: reading/writing CSV files A-io-json Area: reading/writing JSON files P-goal Priority: aligns with long-term Polars goals labels Jul 27, 2024
@github-project-automation github-project-automation bot moved this to Ready in Backlog Jul 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-io-csv Area: reading/writing CSV files A-io-json Area: reading/writing JSON files accepted Ready for implementation enhancement New feature or an improvement of an existing feature P-goal Priority: aligns with long-term Polars goals performance Performance issues or improvements
Projects
Status: Ready
Development

No branches or pull requests

1 participant