-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhance the region balance of the 1M tables imported by lightning #8424
Comments
i'm interesting in the issue. |
/assign @River2000i |
PD schedule scatter region base on pd/pkg/schedule/scatter/region_scatterer.go Line 346 in 13174b5
PD will compare region_count base on the group . For now, gourp define by table ID.(every table belong to a group )pd/pkg/schedule/scatter/region_scatterer.go Line 367 in 13174b5
Summary:
root cause: pd/pkg/schedule/scatter/region_scatterer.go Line 444 in 13174b5
If we want to schedule scatter region in cluster level, we can call ScatterRegion with the same group . It will be an options for caller.
|
Development Task
Background
balance-scheduler
.Balance-Region
will not schedule empty region, There is a hardcode inpd/pkg/schedule/filter/region_filters.go
Lines 150 to 153 in 7e18a69
Problems faced
For lightning importing 1 million tables(one table corresponds to one region), even though there are more than 3 stores, consecutive region keys will generate a lot of regions aggregations in the first 3 stores. And since regions are not scheduled, the three stores have a high probability of OOM.
The text was updated successfully, but these errors were encountered: