Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ISSUE-378][HugePartition][Part-3] Introduce more metrics about huge partition #494

Merged
merged 6 commits into from
Jan 22, 2023

Conversation

zuston
Copy link
Member

@zuston zuston commented Jan 17, 2023

What changes were proposed in this pull request?

Introduce more metrics about huge partition
counter

  1. total_require_buffer_failed_for_huge_partition
  2. total_require_buffer_failed_for_regular_partition
  3. total_app_num
  4. total_app_with_huge_partition_num
  5. total_partition_num
  6. total_huge_partition_num

Gauge

  1. huge_partition_num
  2. app_with_huge_partition_num

Why are the changes needed?

Having these metrics, we should observe the concrete influence from huge partition for regular partition and huge partition number in one shuffle-server

Does this PR introduce any user-facing change?

No

How was this patch tested?

  1. UTs

@codecov-commenter
Copy link

codecov-commenter commented Jan 17, 2023

Codecov Report

Merging #494 (866905e) into master (849993c) will increase coverage by 0.65%.
The diff coverage is 92.30%.

@@             Coverage Diff              @@
##             master     #494      +/-   ##
============================================
+ Coverage     58.83%   59.49%   +0.65%     
- Complexity     1707     1759      +52     
============================================
  Files           206      205       -1     
  Lines         11508    11539      +31     
  Branches       1030     1035       +5     
============================================
+ Hits           6771     6865      +94     
+ Misses         4323     4267      -56     
+ Partials        414      407       -7     
Impacted Files Coverage Δ
.../org/apache/uniffle/server/ShuffleTaskManager.java 76.76% <85.71%> (+0.16%) ⬆️
...ava/org/apache/uniffle/server/ShuffleTaskInfo.java 98.07% <94.73%> (-1.93%) ⬇️
...rg/apache/uniffle/server/ShuffleServerMetrics.java 97.32% <100.00%> (+0.26%) ⬆️
...he/uniffle/server/buffer/ShuffleBufferManager.java 83.33% <100.00%> (+0.11%) ⬆️
...java/org/apache/uniffle/common/util/Constants.java
.../org/apache/uniffle/common/util/UnitConverter.java 51.31% <0.00%> (+3.94%) ⬆️
.../org/apache/uniffle/common/config/ConfigUtils.java 95.55% <0.00%> (+61.11%) ⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@zuston zuston requested a review from advancedxy January 17, 2023 04:17
@advancedxy advancedxy self-assigned this Jan 17, 2023
@zuston zuston requested a review from advancedxy January 17, 2023 10:32
@zuston zuston requested a review from advancedxy January 19, 2023 03:13
}

public void markHugePartition(int shuffleId, int partitionId) {
if (!existHugePartition.get()) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Previously, I was thinking:

if (!existHugePartition.getAndSet(true)) {
        ShuffleServerMetrics.gaugeAppWithHugePartitionNum.inc();
        ShuffleServerMetrics.counterTotalAppWithHugePartitionNum.inc();
}

But maybe get is cheaper, and we should guide by it first.

@advancedxy
Copy link
Contributor

Generally lgtm, except two minor comments.

@advancedxy
Copy link
Contributor

LGTM, pending CI passes

@zuston zuston requested a review from advancedxy January 19, 2023 07:57
@zuston
Copy link
Member Author

zuston commented Jan 20, 2023

PTAL @advancedxy . CI passed

Copy link
Contributor

@advancedxy advancedxy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@zuston zuston merged commit 116125c into apache:master Jan 22, 2023
zuston pushed a commit that referenced this pull request Apr 26, 2024
### What changes were proposed in this pull request?

Fix the metric huge_partition_num.

### Why are the changes needed?

A follow-up PR for: #494.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Existing UTs.
jerqi pushed a commit that referenced this pull request Apr 30, 2024
### What changes were proposed in this pull request?

Fix the metric huge_partition_num.

### Why are the changes needed?

A follow-up PR for: #494.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Existing UTs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants