Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grouping should be parallelized #1284

Closed
st-pasha opened this issue Sep 11, 2018 · 0 comments · Fixed by #1285
Closed

Grouping should be parallelized #1284

st-pasha opened this issue Sep 11, 2018 · 0 comments · Fixed by #1285
Assignees
Labels
groupby Group-by functionality and Reducers performance Issues focused on the speed of execution of various datatable functions.
Milestone

Comments

@st-pasha
Copy link
Contributor

st-pasha commented Sep 11, 2018

Grouping calculation is still not parallel in datatable, which means its performance can be improved by going parallel. See the current benchmark results.
We should try to do it before the standup.

Similar FR was also filed for data.table: Rdatatable/data.table#3042

@st-pasha st-pasha self-assigned this Sep 11, 2018
@st-pasha st-pasha added performance Issues focused on the speed of execution of various datatable functions. groupby Group-by functionality and Reducers labels Sep 11, 2018
@st-pasha st-pasha added this to the Release 0.7.0 milestone Jan 29, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
groupby Group-by functionality and Reducers performance Issues focused on the speed of execution of various datatable functions.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant