-
Notifications
You must be signed in to change notification settings - Fork 458
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CORE-8569][CH] Support DeltaOptimizedWriterTransformer #8570
base: main
Are you sure you want to change the base?
Conversation
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/apache/incubator-gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
Run Gluten ClickHouse CI on ARM |
1 similar comment
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this class related to this PR?
@transient private var cachedShuffleRDD: ShuffledColumnarBatchRDD = _ | ||
|
||
@transient override def outputPartitioning: Partitioning = { | ||
val resolver = org.apache.spark.sql.catalyst.analysis.caseInsensitiveResolution |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
109 ~ 116 duplicated with 124 ~131
@@ -270,15 +270,15 @@ class ClickhouseOptimisticTransaction( | |||
// TODO: DeltaOptimizedWriterExec |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove TODO
!isOptimize && | ||
shouldOptimizeWrite(writeOptions, spark.sessionState.conf) | ||
) { | ||
DeltaOptimizedWriterExec(checkInvariants, metadata.partitionColumns, deltaLog) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it is better to not chagne unrelated codes
else { | ||
committer.addedStatuses | ||
}) | ||
committer.addedStatuses.map { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it is better to not chagne unrelated codes
What changes were proposed in this pull request?
In spark3.5, delta write support optimize write. This pr offload
DeltaOptimizedWriterExec
to gluten(Fixes: #8569)
How was this patch tested?
Test by ut