-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[<Ray component: Ray>] map_group doen't support actor #41406
Comments
in ray 2.4.0, the error for the map_group+ actor is same as this one |
after discussion we should change map_groups() to allow (and require) batch_size iff gpu num is set. @scottjlee to follow up on this change, |
I think it's not only a problem of the gpu + batch_size
it will failed with error.
it works for the function as the input
here are the whole codes
|
Ya, I think
|
@wingkitlee0 I see! Thanks. Based on my search results, the
|
@wingkitlee0 Also what is the motivation that we use |
[Not Ray team, but I used map_groups quite a lot lately] You may find previous discussions about
According to Ray team, the data are materialized, but not necessarily gathered into one node. (After all, sorting is distributed)
I believe this is simply because they haven't implemented for callable class yet. This should be a straight-forward PR unless there are some limitations (in that case Ray team knows better)
It's about the block boundary (in Ray) vs group boundary (in data). We want the data that goes into the mapping function contains the whole group. |
@wingkitlee0 Thanks a lot for you classification! |
I encountered this issue as well. I have some relatively expensive state to initialize in an actor. However, unlike the operators The workaround w/o using an actor means that I have to initialize such as state per call. Really look forward to ray 2.10 for this feature. |
@Bye-legumes - per our meeting last week; do you think you'd have the bandwidth in May to pick this up? We can help shepherd/review to get this merged but it'll be faster to resolve/mitigate this issue especially if it's impacting your scenario/use-case. |
See #41406 Signed-off-by: Balaji Veeramani <[email protected]>
What happened + What you expected to happen
What happened
What happened was that when I used the group_by operation with the batch_size parameter, I encountered an error with “multiple values”. However, when I tried to use the group_by operation without the batch_size parameter, I encountered an exception that required the batch_size parameter.”
for the code above, there is an error
If I delete the key_word batch_size, the error is
What I expected
Ray data can perform the groupby+map_groups for actors as mentioned in the doc. https://docs.ray.io/en/latest/data/api/doc/ray.data.grouped_data.GroupedData.map_groups.html
Versions / Dependencies
ray 2.8.0
Reproduction script
Issue Severity
Medium: It is a significant difficulty but I can work around it.
The text was updated successfully, but these errors were encountered: