Skip to content

High CPU utlization with all query operators/stages GPU based. #11963

Answered by revans2
MaxNevermind asked this question in Q&A
Discussion options

You must be logged in to vote

I first wanted to verify that I got similar results, because I was running in local mode with 12 CPU cores and 1 GPU instead of you using 4 GPUs with 12 CPU cores each. It is just a lot simple to profile thing in local mode.

The query has three stages. The first stage reads in the parquet data and does a partial aggregation to drop the duplicates. Te second stage finishes the deduplication and repartitions the data so that the window operation can happen. The last stage will sort the data, do the window operation, and write the results out.

For the first stage about 9 CPU cores were fully utilized the entire time. For the second stage I saw about 10 CPU cores being fully utilized. The fin…

Replies: 5 comments 8 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@revans2
Comment options

@revans2
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@MaxNevermind
Comment options

Answer selected by MaxNevermind
Comment options

You must be logged in to vote
5 replies
@MaxNevermind
Comment options

@revans2
Comment options

@revans2
Comment options

@MaxNevermind
Comment options

@sameerz
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants