-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
*: improve NULL count estimation for single column index #9474
Conversation
Codecov Report
@@ Coverage Diff @@
## master #9474 +/- ##
================================================
- Coverage 77.5011% 77.2189% -0.2823%
================================================
Files 404 405 +1
Lines 81591 81717 +126
================================================
- Hits 63234 63101 -133
- Misses 13664 13936 +272
+ Partials 4693 4680 -13 |
/run-all-tests |
/run-unit-test |
3358d85
to
597dbb0
Compare
597dbb0
to
3452f4d
Compare
3452f4d
to
fe2220b
Compare
/run-all-tests |
1 similar comment
/run-all-tests |
72fe923
to
8fbe7c8
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rest lgtm
8fbe7c8
to
829c5c5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/run-all-tests |
@eurekaka please cherry pick this PR to release-2.1 |
What problem does this PR solve?
An attempt to fix issue #8563.
When column or index contains a lot of null values, our row count estimation for conditions is quite inaccurate. This PR tries to improve the estimation for column stats or single-column index stats. Note that, in #8563, the query is on a multi-column index, and I haven't figured out a way to work in multi-column index scenarios under the current multi-column index framework.
What is changed and how it works?
Histogram
,Column
andIndex
;fix periodical panic ofsplit to executor: only show valid columns inshow stats_histograms
, I will split this to a separate PR later;stats_histogram
#9487Check List
Tests
Code changes
Side effects
Related changes
N/A