v2.18.0
This release introduces two significant changes:
-
Improved internals responsible for reading content and statistics of Parquet files. The difference is especially noticeable in the case of
Stats
: it is faster and now you can also query for min and max of partition fields. -
Upgrades Parquet to 1.14.0. The biggest improvement is support for Hadoop's vectored IO, which you can optionally enable in
ParquetReader.Options
. It can significantly improve the performance of reading huge files.