Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

colocated aware random picker #2301

Merged
merged 97 commits into from
Feb 19, 2025
Merged

Conversation

makalaaneesh
Copy link
Collaborator

@makalaaneesh makalaaneesh commented Feb 6, 2025

Describe the changes in this pull request

Introduce a ColocatedAwareRandomTaskPicker which picks tasks based on weights/probabilities of the tables involved. This will essentially lead to multiple tables being ingested at the same time.

The max no. of tasks in parallel will be equal to the value of parallel-jobs (either default calculation of 0.25x cores in the case of adaptive parallelism, or the value provided by the user).

Describe if there are any user-facing changes

How was this pull request tested?

Does your PR have changes that can cause upgrade issues?

Component Breaking changes?
MetaDB Yes/No
Name registry json Yes/No
Data File Descriptor Json Yes/No
Export Snapshot Status Json Yes/No
Import Data State Yes/No
Export Status Json Yes/No
Data .sql files of tables Yes/No
Export and import data queue Yes/No
Schema Dump Yes/No
AssessmentDB Yes/No
Sizing DB Yes/No
Migration Assessment Report Json Yes/No
Callhome Json Yes/No
YugabyteD Tables Yes/No
TargetDB Metadata Tables Yes/No

@makalaaneesh makalaaneesh changed the title Aneesh/colocated aware random picker colocated aware random picker Feb 6, 2025
@makalaaneesh makalaaneesh marked this pull request as ready for review February 10, 2025 04:50
@makalaaneesh makalaaneesh marked this pull request as draft February 10, 2025 04:50
Copy link
Contributor

@priyanshi-yb priyanshi-yb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some initial comments

Base automatically changed from aneesh/reuse-worker-pool-sequential-task-picker to main February 11, 2025 17:38
@makalaaneesh makalaaneesh marked this pull request as ready for review February 17, 2025 06:45
Copy link
Contributor

@priyanshi-yb priyanshi-yb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

if err != nil {
break
}
if task != pickedTask2 && task != pickedTask1 {
Copy link
Contributor

@priyanshi-yb priyanshi-yb Feb 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how can the picked Task be equal to pickedTask1, it is already marked as done above right?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right, it can't. Just wanted to exclude both task1 task2, to get task3.

@makalaaneesh makalaaneesh merged commit ceb675b into main Feb 19, 2025
66 checks passed
@makalaaneesh makalaaneesh deleted the aneesh/colocated-aware-random-picker branch February 19, 2025 04:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants