-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add an "in" filter predicate #993
Comments
@jaychia I would like to work on this issue as my first issue. Can you point to the files where I need to implement this? |
You can look at the implementation of Here are the files you'll want to look at and I linked the implementations for
Feel free also to chat with us on our Slack if you have any other questions! |
I actually just did #1174 which implements this functionality for However, we still need a way to enable the Python I.e. it would be nice to go from: (before)
To (after)
|
Tried it out and I don't think overriding the See: https://stackoverflow.com/questions/53351829/is-it-possible-for-contains-to-return-non-boolean-value and https://docs.python.org/3.7/reference/expressions.html#membership-test-operations |
Closes #993 The `is_in` expression checks whether the values of a series are contained in a given list of items, and produces a series of boolean values as the results of this membership test. Changes: - Added a Literal Series so that Series can be passed into the expression - Added `is_in` expression and kernel - Added tests
Is your feature request related to a problem? Please describe.
We should have an "in" filter predicate:
Currently, the fallback to this is to perform an inner join with another dataframe, but this is unnecessarily slow and requires a shuffle.
The text was updated successfully, but these errors were encountered: