Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

resource_manager: record the max RU per second #7936

Merged
merged 6 commits into from
Mar 21, 2024

Conversation

nolouch
Copy link
Contributor

@nolouch nolouch commented Mar 18, 2024

What problem does this PR solve?

Issue Number: Close #7908

When I only run workload A:

sysbench --mysql-user=user1 --mysql-host=10.2.12.53 --mysql-port=32335 --table-size=10000000  oltp_read_only run --time=1000 --report-interval=1 --threads=4  --mysql-db=test --tables=32 --rate=2

RU Avg is OK because the workload is stable.
image

But When I manually run the big query SQL like:

select count(*) from test.sbtest1;

The monitoring was not very accurate(on-avg), which led me to mistakenly believe that I was far away from triggering RC control.
image

But from the slow query:
image
it cost 8000+ RU, and which let some query need wait in the RC queue

What is changed and how does it work?

resource_manager: record the max RU per second 

Check List

Tests

  • Unit test
  • Integration test

Release note

None.

Copy link
Contributor

ti-chi-bot bot commented Mar 18, 2024

[REVIEW NOTIFICATION]

This pull request has been approved by:

  • CabinfeverB
  • HuSharp

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

@ti-chi-bot ti-chi-bot bot added the release-note-none Denotes a PR that doesn't merit a release note. label Mar 18, 2024
@ti-chi-bot ti-chi-bot bot requested review from JmPotato and Yisaer March 18, 2024 12:12
@ti-chi-bot ti-chi-bot bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 18, 2024
@nolouch nolouch requested review from CabinfeverB and HuSharp and removed request for JmPotato and Yisaer March 18, 2024 12:14
Copy link

codecov bot commented Mar 18, 2024

Codecov Report

Merging #7936 (23db662) into master (955d30a) will increase coverage by 0.11%.
The diff coverage is 94.00%.

❗ Current head 23db662 differs from pull request most recent head 6ea61b4. Consider uploading reports for the commit 6ea61b4 to get more accurate results

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #7936      +/-   ##
==========================================
+ Coverage   73.48%   73.60%   +0.11%     
==========================================
  Files         436      436              
  Lines       48376    48425      +49     
==========================================
+ Hits        35550    35644      +94     
+ Misses       9768     9721      -47     
- Partials     3058     3060       +2     
Flag Coverage Δ
unittests 73.60% <94.00%> (+0.11%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Signed-off-by: nolouch <[email protected]>
@ti-chi-bot ti-chi-bot bot added the status/LGT1 Indicates that a PR has LGTM 1. label Mar 20, 2024
@nolouch
Copy link
Contributor Author

nolouch commented Mar 20, 2024

PTAL @glorv

Copy link
Member

@HuSharp HuSharp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to update metrics.json?

Copy link
Contributor

@glorv glorv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rest LGTM

Comment on lines 485 to 488
if maxPerSecTrackers[name] == nil {
maxPerSecTrackers[name] = newMaxPerSecCostTracker(name, defaultCollectIntervalSec)
}
maxPerSecTrackers[name].Observe(rruSum[name], wruSum[name])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
if maxPerSecTrackers[name] == nil {
maxPerSecTrackers[name] = newMaxPerSecCostTracker(name, defaultCollectIntervalSec)
}
maxPerSecTrackers[name].Observe(rruSum[name], wruSum[name])
if maxPerSecTrackers[name] == nil && (rruSum[name] + wruSum[name] > 0.0) {
maxPerSecTrackers[name] = newMaxPerSecCostTracker(name, defaultCollectIntervalSec)
}
if maxPerSecTrackers[name] != nil {
maxPerSecTrackers[name].Observe(rruSum[name], wruSum[name])
}

Maybe skip inactive groups here is better

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It may be inactive for a period of time, and active for a period of time. To reflect this trend of change, the cost is acceptable

pkg/mcs/resourcemanager/server/manager.go Outdated Show resolved Hide resolved
@ti-chi-bot ti-chi-bot bot added status/LGT2 Indicates that a PR has LGTM 2. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Mar 21, 2024
@nolouch
Copy link
Contributor Author

nolouch commented Mar 21, 2024

ptal @glorv

Copy link
Contributor

@glorv glorv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@nolouch
Copy link
Contributor Author

nolouch commented Mar 21, 2024

/merge

Copy link
Contributor

ti-chi-bot bot commented Mar 21, 2024

@nolouch: It seems you want to merge this PR, I will help you trigger all the tests:

/run-all-tests

You only need to trigger /merge once, and if the CI test fails, you just re-trigger the test that failed and the bot will merge the PR for you after the CI passes.

If you have any questions about the PR merge process, please refer to pr process.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

Copy link
Contributor

ti-chi-bot bot commented Mar 21, 2024

This pull request has been accepted and is ready to merge.

Commit hash: 23db662

@ti-chi-bot ti-chi-bot bot added the status/can-merge Indicates a PR has been approved by a committer. label Mar 21, 2024
@ti-chi-bot ti-chi-bot bot merged commit 52e8763 into tikv:master Mar 21, 2024
22 checks passed
@nolouch nolouch deleted the record-max branch March 21, 2024 16:26
@nolouch nolouch added the needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. label Mar 21, 2024
@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-7.5: #7968.

@nolouch nolouch added the needs-cherry-pick-release-7.1 Should cherry pick this PR to release-7.1 branch. label Apr 1, 2024
@nolouch
Copy link
Contributor Author

nolouch commented Apr 1, 2024

/run-cherry-picker

@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-7.1: #8011.

ti-chi-bot pushed a commit to ti-chi-bot/pd that referenced this pull request Apr 1, 2024
ti-chi-bot bot added a commit that referenced this pull request Apr 3, 2024
close #7908

resource_manager: record the max RU per second

Signed-off-by: nolouch <[email protected]>

Co-authored-by: nolouch <[email protected]>
Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>
ti-chi-bot bot pushed a commit that referenced this pull request Apr 11, 2024
close #7908

resource_manager: record the max RU per second

Signed-off-by: ti-chi-bot <[email protected]>
Signed-off-by: nolouch <[email protected]>

Co-authored-by: ShuNing <[email protected]>
Co-authored-by: nolouch <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
needs-cherry-pick-release-7.1 Should cherry pick this PR to release-7.1 branch. needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. release-note-none Denotes a PR that doesn't merit a release note. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. status/can-merge Indicates a PR has been approved by a committer. status/LGT2 Indicates that a PR has LGTM 2.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Monitors RU consumption to maximize short-term usage of MAX RU.
5 participants