-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize queries that contain IN
subqueries by rewriting as semi join
#488
Comments
@Dandandan I would like to work on this ticket. Some pointers would be great, so i will be in the right direction. 😎 |
Hey @msathis that would be great. Effectively it means rewriting queries from:
Could be written as (minus SQL syntax)
So the work will be
I think we can return an error in case the logical plan still contains a One complication I saw is that adding a |
@Dandandan I was debugging to see what happens with the current code. Seems like all |
The This currently results in an error if you try it (something like |
@Dandandan Got it. Now it's all clear 👍 |
Perhaps now duplicated by #1209 |
IN
to semi joinIN
subqueries by rewriting as semi join
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Currently
IN
in queries is not supported.The recently added semi hash join can be used for those queries.
Describe the solution you'd like
Add this to planner/optimizer.
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: