[FEA] OneHotEncoder support in sklearn Pipeline #3053
Labels
Algorithm API Change
For tracking changes to algorithms that might effect the API
Cython / Python
Cython or Python issue
feature request
New feature or request
Is your feature request related to a problem? Please describe.
I would like to use cuml.preprocessing.OneHotEncoder as part of Pipeline so that I can assemble additional steps and multiple preprocessors. However, OneHotEncoder alone in a Pipeline fails. For example:
Describe the solution you'd like
OneHotEncoder works with Pipeline similar to sklearn's OneHotEncoder:
Describe alternatives you've considered
Omit using Pipeline with cuml OneHotEncoder.
Additional context
As a reference, I am using cuml version 0.16.0a+882.g5851f4140 and sklearn version 0.23.1.
@tfeher here's an example with a categorical transformer and sklearn's Pipeline. It would be great to have OneHotEncoder in a Pipeline, I don't see this error with some of the cuml experimental preprocessors.
The text was updated successfully, but these errors were encountered: