Deprecate dims methods for iterator forms? #35292

ChrisRackauckas · 2020-03-28T22:19:55Z

It seems like the dims-based methods could be from another time. It seems that we now have a way to do this using iterators:

A = rand(4,4)
sum(A,dims=2)
sum(eachcol(A))

Could we make our APIs simpler by dropping the dims-based versions and instead just requiring that the user uses an appropriate iterator before the call? One thing that would have to be true when doing this is that it should be just as efficient, so I wonder if @maleadt could comment on whether there's any extra optimizations on things like GPUs that are available in sum(A,dims=2) vs sum(eachcol(A)). Otherwise, it might be interesting to drop in a Julia 2.0?

(One note is that we'd need iterators on higher dimensions as well, though that could be done)

The text was updated successfully, but these errors were encountered:

tkf · 2020-03-29T01:00:51Z

I think using eachcol/eachrow/eachslice is problematic in the sense that:

The reduced dimensions are dropped. For example, sum(A,dims=2) :: Matrix while sum(eachcol(A)) :: Vector when A :: Matrix.
It works only when the reducing function (e.g., +) is implicitly "broadcasted." For example, prod(eachcol(A :: Matrix)) throws ATM.

I proposed to add yet another lazy object type to solve these problems: #16606 (comment). See also #33130.

mcabbott · 2020-03-31T18:25:03Z

Bullet 2 points towards prod.(eachrow(A :: Matrix)) instead. Should this then fuse with other operations?

We could have variants which don't drop dimensions. StarSlice.jl is a sketch of this, writing A[:, *] for the getindex equivalent of eachcol, and A[!, *] for a variant with a 1×4 container, thus sum.(A[!, *]) == sum(A, dims=1). Plus another variant with 2×1 slices, sum(A[:, &]) == sum(A, dims=2).

Also ref EachSlice types of #32310, where I thought the point was to pretend you were passing in slices, while e.g. Distances.jl can still compute efficiently on the whole array.

tkf · 2020-04-01T02:12:33Z

Bullet 2 points towards prod.(eachrow(A :: Matrix)) instead.

We could have variants which don't drop dimensions. StarSlice.jl is a sketch of this

Thanks, that's a good point (my understating is that we can still keep array-of-arrays interface). I guess we can also make something like dropdims(A .- mean.(A[!, *])) work as well? I'm thinking to propagate "dropped dims" information by embedding in the wrapper type and returning it from the StarSlice getindex.

Should this then fuse with other operations?

I don't think this is required to be implemented from the get-go but I think (eventually) making things like dest .= sum.(eachrow(A)) non-allocating and locality-aware would be nice.

simonbyrne · 2021-02-15T03:33:50Z

It works only when the reducing function (e.g., +) is implicitly "broadcasted." For example, prod(eachcol(A :: Matrix)) throws ATM.

On the other hand it is equivalent to map(sum, eachrow(X)) (or mapslices(sum, X, dims=2)).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate dims methods for iterator forms? #35292

Deprecate dims methods for iterator forms? #35292

ChrisRackauckas commented Mar 28, 2020 •

edited

Loading

tkf commented Mar 29, 2020 •

edited

Loading

mcabbott commented Mar 31, 2020

tkf commented Apr 1, 2020 •

edited

Loading

simonbyrne commented Feb 15, 2021 •

edited

Loading

Deprecate dims methods for iterator forms? #35292

Deprecate dims methods for iterator forms? #35292

Comments

ChrisRackauckas commented Mar 28, 2020 • edited Loading

tkf commented Mar 29, 2020 • edited Loading

mcabbott commented Mar 31, 2020

tkf commented Apr 1, 2020 • edited Loading

simonbyrne commented Feb 15, 2021 • edited Loading

ChrisRackauckas commented Mar 28, 2020 •

edited

Loading

tkf commented Mar 29, 2020 •

edited

Loading

tkf commented Apr 1, 2020 •

edited

Loading

simonbyrne commented Feb 15, 2021 •

edited

Loading