-
Notifications
You must be signed in to change notification settings - Fork 129
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update blaze version 2.0.8-SNAPSHOT (#386)
Co-authored-by: zhangli20 <[email protected]>
- Loading branch information
Showing
10 changed files
with
184 additions
and
172 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,20 +1,32 @@ | ||
# blaze-v2.0.7 | ||
# blaze-v2.0.8 | ||
|
||
## Features | ||
* Supports native BroadcastNestedLoopJoinExec. | ||
* Supports multithread UDF evaluation. | ||
* Supports spark.files.ignoreCorruptFiles. | ||
* Supports input batch statistics. | ||
|
||
* Enables nested complex data types by default. | ||
* Supports writing parquet table with dynamic partitions. | ||
* Supports partial aggregate skipping. | ||
* Enable first() aggregate function converting. | ||
* Add spill metrics. | ||
* | ||
## Performance | ||
* Improves get_json_object() performance by reducing duplicated json parsing. | ||
* Improves parquet reading performance by skipping utf-8 validation. | ||
* Supports cached expression evaluator in native AggExec. | ||
* Supports column pruning during native evaluation. | ||
* Prefer native sort even if child is non-native. | ||
* Implement batch updating/merging in aggregates. | ||
* Use slim box for storing bytes. | ||
* get_json_object use Cow to avoid copying. | ||
* Reduce the probability of unexpected off-heap memory overflows. | ||
* Introduce multiway merge sort to SortExec and SortRepartitioner. | ||
* SortExec removes redundant columns from batch. | ||
* Implement loser tree with inlined comparable traits. | ||
* Use unchecked index in LoserTree to get slightly performance improvement. | ||
* Remove BucketRepartitioner. | ||
* Reduce number of awaits in sort-merge join. | ||
* Pre-merge records in sorting mode if cardinality is low. | ||
* Use gxhash as default hasher in AggExec. | ||
* Optimize collect_set/collect_list function with SmallVec. | ||
* Implement async ipc reader. | ||
|
||
## Bugfix | ||
* Fix missing outputPartitioning in NativeParquetExec. | ||
* Fix missing native converting checks in parquet scan. | ||
* Fix inconsistency: implement spark-compatible float to int casting. | ||
* Avoid closing hadoop fs for reusing in cache. | ||
* Fix buggy GetArrayItem/GetMapValue native converter pattern matching. | ||
* Fix parquet pruning with NaN values. | ||
* Fix map type conversion with incorrect nullable value. | ||
* Fix ffi-export error in some cases. | ||
* Fix incorrect behavior of get_index_field with incorrect number of rows. | ||
* Fix task hanging in some cases with ffi-export. |
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.