Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CELEBORN-1685] ShuffleFallbackPolicy supports ShuffleFallbackCount metric #2891

Closed
wants to merge 1 commit into from

Conversation

SteNicholas
Copy link
Member

@SteNicholas SteNicholas commented Nov 7, 2024

What changes were proposed in this pull request?

  1. ShuffleFallbackPolicy supports ShuffleFallbackCount metric to provide the shuffle fallback count of each fallback policy.
  2. Introduce ShuffleTotalCount metric to record the total count of shuffle.
  3. Fix Spark 2 does not increment shuffle count via LifecycleManager.

Why are the changes needed?

The implementations of ShuffleFallbackPolicy does not support ShuffleFallbackCount metric at present. Meanwhile, Bilibili production practice needs ShuffleFallbackCount of different ShuffleFallbackPolicy.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Cluster test.

@SteNicholas SteNicholas force-pushed the CELEBORN-1685 branch 2 times, most recently from 94b747b to 91ba974 Compare November 7, 2024 10:45
@SteNicholas
Copy link
Member Author

Ping @turboFei, @FMX, @RexXiong.

@SteNicholas SteNicholas force-pushed the CELEBORN-1685 branch 2 times, most recently from 35d9c97 to 2a50d74 Compare November 8, 2024 07:56
Copy link
Member

@turboFei turboFei left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks

Copy link
Contributor

@RexXiong RexXiong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, merge to main(v0.6.0)

@RexXiong RexXiong closed this in 169b6f6 Nov 11, 2024
RexXiong pushed a commit that referenced this pull request Nov 12, 2024
…compute shuffleFallbackCounts via class name of ShuffleFallbackPolicy implementation

### What changes were proposed in this pull request?

`CelebornShuffleFallbackPolicyRunner` should compute `shuffleFallbackCounts` via class name of `ShuffleFallbackPolicy` implementation.

Follow up #2891.

### Why are the changes needed?

`CelebornShuffleFallbackPolicyRunner` computes `shuffleFallbackCounts` via class name of `Option` instead of class name of `ShuffleFallbackPolicy` implementation at present.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

CI.

Closes #2909 from SteNicholas/CELEBORN-1685.

Authored-by: SteNicholas <[email protected]>
Signed-off-by: Shuang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants