[Enhancement] optimize the performace for topn with large offset #55886

stdpain · 2025-02-13T13:24:22Z

Why I'm doing:

SSB100G dop=4 1BE

select lo_shipmode from lineorder order by lo_shipmode limit 50000000, 400

baseline:1m42s patched:28s339ms

The reasons why baseline performance is too low are:
1.The merge operation is too frequent and needs to be done every 256 chunks. But the total input data is too large. This results in too many merge operations.

This PR adds max_buffer_size, which depends on offset + limit /chunk_size, to reduce the frequency of merge operations. But it may result in using more memory.
This PR additionally optimizes memory for merge chunks. It can reduce the peak memory of merge.

What I'm doing:

change the max_buffered_size to chunk_size/4096 when limit greater than 65535
Avoid large permutations that take up too much memory.
reduce memory when merge large chunks

Fixes #issue

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

Bugfix cherry-pick branch check:

be/src/exec/chunks_sorter_topn.cpp

be/src/exec/chunks_sorter_topn.h

be/src/exec/sorting/merge.h

Signed-off-by: stdpain <[email protected]>

github-actions · 2025-02-25T09:35:19Z

[FE Incremental Coverage Report]

✅ pass : 0 / 0 (0%)

github-actions · 2025-02-25T09:35:41Z

[Java-Extensions Incremental Coverage Report]

✅ pass : 0 / 0 (0%)

github-actions · 2025-02-25T09:48:20Z

[BE Incremental Coverage Report]

✅ pass : 296 / 344 (86.05%)

file detail

	path	covered_line	new_line	coverage	not_covered_line_detail
🔵	be/src/exec/topn_node.cpp	0	2	00.00%	[234, 235]
🔵	be/src/exec/chunks_sorter_topn.h	9	13	69.23%	[81, 141, 145, 147]
🔵	be/src/exec/sorting/merge.h	29	39	74.36%	[152, 184, 185, 186, 188, 189, 190, 210, 211, 212]
🔵	be/src/exec/chunks_sorter_topn.cpp	163	194	84.02%	[87, 109, 174, 175, 176, 493, 497, 498, 499, 501, 503, 504, 505, 506, 507, 509, 510, 511, 512, 514, 515, 520, 521, 575, 732, 749, 762, 763, 772, 773, 774]
🔵	be/src/exec/sorting/merge_cascade.cpp	16	17	94.12%	[282]
🔵	be/src/exec/sorting/merge.cpp	47	47	100.00%	[]
🔵	be/src/exec/sorting/merge_column.cpp	30	30	100.00%	[]
🔵	be/src/exec/pipeline/sort/partition_sort_sink_operator.cpp	2	2	100.00%	[]

github-actions · 2025-02-27T08:51:41Z

@Mergifyio backport branch-3.3

mergify · 2025-02-27T08:51:57Z

backport branch-3.3

✅ Backports have been created

#56369 [Enhancement] optimize the performace for topn with large offset (backport #55886) has been created for branch branch-3.3 but encountered conflicts

github-actions · 2025-02-27T08:51:59Z

@Mergifyio backport branch-3.4

mergify · 2025-02-27T08:52:10Z

backport branch-3.4

✅ Backports have been created

#56370 [Enhancement] optimize the performace for topn with large offset (backport #55886) has been created for branch branch-3.4

) 1. change the max_buffered_size to chunk_size/4096 when limit greater than 65535 2. Avoid large permutations that take up too much memory. 3. Reduce memory when merging large chunks SSB100G dop=4 1BE ``` select lo_shipmode from lineorder order by lo_shipmode limit 50000000, 400 ``` baseline:1m42s patched:28s339ms The reasons why baseline performance is too low are: 1.The merge operation is too frequent and needs to be done every 256 chunks. But the total input data is too large. This results in too many merge operations. This PR adds max_buffer_size, which depends on offset + limit /chunk_size, to reduce the frequency of merge operations. But it may result in using more memory. This PR additionally optimizes memory for merge chunks. It can reduce the peak memory of merge. Signed-off-by: stdpain <[email protected]> (cherry picked from commit 11b98b0) # Conflicts: # be/src/exec/chunks_sorter.cpp # be/src/exec/chunks_sorter.h # be/src/exec/chunks_sorter_topn.h # be/src/exec/pipeline/sort/local_partition_topn_context.cpp # be/src/exec/sorting/merge.h

) 1. change the max_buffered_size to chunk_size/4096 when limit greater than 65535 2. Avoid large permutations that take up too much memory. 3. Reduce memory when merging large chunks SSB100G dop=4 1BE ``` select lo_shipmode from lineorder order by lo_shipmode limit 50000000, 400 ``` baseline:1m42s patched:28s339ms The reasons why baseline performance is too low are: 1.The merge operation is too frequent and needs to be done every 256 chunks. But the total input data is too large. This results in too many merge operations. This PR adds max_buffer_size, which depends on offset + limit /chunk_size, to reduce the frequency of merge operations. But it may result in using more memory. This PR additionally optimizes memory for merge chunks. It can reduce the peak memory of merge. Signed-off-by: stdpain <[email protected]> (cherry picked from commit 11b98b0)

…kport #55886) (#56370) Co-authored-by: stdpain <[email protected]>

stdpain requested a review from a team as a code owner February 13, 2025 13:24

github-actions bot added 3.4 3.3 labels Feb 13, 2025

mergify bot assigned stdpain Feb 13, 2025

starrocks-cr bot reviewed Feb 13, 2025

View reviewed changes

be/src/exec/chunks_sorter_topn.cpp Show resolved Hide resolved

starrocks-cr bot reviewed Feb 13, 2025

View reviewed changes

be/src/exec/chunks_sorter_topn.h Show resolved Hide resolved

starrocks-cr bot reviewed Feb 13, 2025

View reviewed changes

be/src/exec/sorting/merge.h Show resolved Hide resolved

stdpain force-pushed the opt_topn_with_large_limit branch from 8eb9d93 to 2bf3116 Compare February 14, 2025 06:44

satanson previously approved these changes Feb 14, 2025

View reviewed changes

stdpain dismissed satanson’s stale review via 6735781 February 14, 2025 08:13

stdpain force-pushed the opt_topn_with_large_limit branch 2 times, most recently from 6735781 to d421e44 Compare February 17, 2025 02:10

satanson previously approved these changes Feb 24, 2025

View reviewed changes

stdpain dismissed satanson’s stale review via 01f65e3 February 24, 2025 13:14

stdpain force-pushed the opt_topn_with_large_limit branch 4 times, most recently from 160e90d to e765fd7 Compare February 25, 2025 06:15

[Enhancement] optimize the performace for topn with large offset

4262201

Signed-off-by: stdpain <[email protected]>

stdpain force-pushed the opt_topn_with_large_limit branch from e765fd7 to 4262201 Compare February 25, 2025 08:11

murphyatwork approved these changes Feb 26, 2025

View reviewed changes

Seaven approved these changes Feb 27, 2025

View reviewed changes

stdpain merged commit 11b98b0 into StarRocks:main Feb 27, 2025
52 checks passed

github-actions bot removed the 3.3 label Feb 27, 2025

github-actions bot removed the 3.4 label Feb 27, 2025

mergify bot mentioned this pull request Feb 27, 2025

[Enhancement] optimize the performace for topn with large offset (backport #55886) #56369

Closed

18 tasks

mergify bot mentioned this pull request Feb 27, 2025

[Enhancement] optimize the performace for topn with large offset (backport #55886) #56370

Merged

18 tasks

wanpengfei-git pushed a commit that referenced this pull request Feb 27, 2025

[Enhancement] optimize the performace for topn with large offset (bac…

908cb6d

…kport #55886) (#56370) Co-authored-by: stdpain <[email protected]>

github-actions bot added the 3.4-merged label Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] optimize the performace for topn with large offset #55886

[Enhancement] optimize the performace for topn with large offset #55886

stdpain commented Feb 13, 2025 •

edited

Loading

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 27, 2025

mergify bot commented Feb 27, 2025 •

edited

Loading

github-actions bot commented Feb 27, 2025

mergify bot commented Feb 27, 2025 •

edited

Loading

[Enhancement] optimize the performace for topn with large offset #55886

[Enhancement] optimize the performace for topn with large offset #55886

Conversation

stdpain commented Feb 13, 2025 • edited Loading

Why I'm doing:

What I'm doing:

What type of PR is this:

Checklist:

Bugfix cherry-pick branch check:

github-actions bot commented Feb 25, 2025

[FE Incremental Coverage Report]

github-actions bot commented Feb 25, 2025

[Java-Extensions Incremental Coverage Report]

github-actions bot commented Feb 25, 2025

[BE Incremental Coverage Report]

file detail

github-actions bot commented Feb 27, 2025

mergify bot commented Feb 27, 2025 • edited Loading

✅ Backports have been created

github-actions bot commented Feb 27, 2025

mergify bot commented Feb 27, 2025 • edited Loading

✅ Backports have been created

stdpain commented Feb 13, 2025 •

edited

Loading

mergify bot commented Feb 27, 2025 •

edited

Loading

mergify bot commented Feb 27, 2025 •

edited

Loading