Skip to content

v2.18.0

Compare
Choose a tag to compare
@mjakubowski84 mjakubowski84 released this 19 May 18:35
· 22 commits to master since this release

This release introduces two significant changes:

  1. Improved internals responsible for reading content and statistics of Parquet files. The difference is especially noticeable in the case of Stats: it is faster and now you can also query for min and max of partition fields.

  2. Upgrades Parquet to 1.14.0. The biggest improvement is support for Hadoop's vectored IO, which you can optionally enable in ParquetReader.Options. It can significantly improve the performance of reading huge files.