-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minor: Add comment explaining rationale for hash check #11750
Conversation
Seems duckdb has the similar check(but just use the u16 prefix), maybe we can mention? |
The linked PR is interesting: duckdb/duckdb#9575 |
I am not sure what we would mention 🤔 Maybe can you propose the wording ? Or maybe we can leave a comment to the duckdb art in one of the comments on #11718 ? |
Interesting, it seems the common point with #11718 is that we should consider the increasing collision when the hash table is filling up. |
In theory I think this is the kind of optimization that we are relying on |
Yes, as @2010YOUY01 mentioned in #11, hashbrown did a lot of design work about it. Actually, I am thinking that, maybe exist a threshold (like xx% of the bucket have been filled), and it is just reasonable to check the hash first when exceed it. |
It might be worth checking -- one thing that we have to be careful of is that the overhead of the check itself may end up large |
Make sense, will make more experiments about this when having free time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm thanks @alamb for adding the tracker
Which issue does this PR close?
Follow on to #11718
Closes #.
Rationale for this change
In #11718 @Rachelint found that comparing hash values before group key values improved performance. The reason this could be faster was not obvious at first
What changes are included in this PR?
Add doc comments explaining the rationale and linking to #11718
Are these changes tested?
CI
Are there any user-facing changes?
No, just comments