implement `@fill([0, 0], 2)` #41209

CameronBieganek · 2021-06-12T13:47:18Z

Original Title: Change fill([0, 0], 2) behavior for 2.0

The behavior of fill when its first argument is an array is to create an array of arrays, where each of the elements is a reference to the exact same array in memory. This is counterintuitive to many users (see Discourse links below), and it seems kind of pointless. I propose that in 2.0 fill with an array input should fill the resulting array with copies of the input array.

Below are some examples where this came up on Discourse. There may be more.

https://discourse.julialang.org/t/simple-question-about-assignment-to-a-vector-of-vectors/62744
https://discourse.julialang.org/t/fill-anarray-2-behaviour/22429
https://discourse.julialang.org/t/initialization-of-array-of-arrays-with-fill-ones-1-2-2-only-one-vector-is-created/48048
https://discourse.julialang.org/t/how-can-i-fill-an-array-with-empty-2d-arrays/60895

The text was updated successfully, but these errors were encountered:

simeonschaub · 2021-06-13T12:19:39Z

Copying by default seems even more confusing to me. This would cause lots of allocations any time an array is filled with mutable objects. There are many situations where you actually want to fill an array with references to the same mutable object, for example pretty much any time you don't plan to mutate the items in an array themselves, which is a frequent use case when working with DataFrames. Implicit copies can also lead to wrong results in some cases, for example Measurements.jl uses references to other measurements to keep track of correlations. It is also not clear how this should behave for recursively mutable objects: if it's just the default copy behavior, you would run into exactly the same issues when filling an array with arrays of arrays, you could use deepcopy, but that can be super slow and can easily blow up allocations for more complex objects.

I think we should just document the current behavior better. Once you have a basic understanding how memory and references work, the current behavior makes a lot of sense, so I don't think that needs to be changed. One thing we could do is add a copy keyword argument to fill though, so if someone wants the copying behavior, it's really easy to get.

JeffBezanson · 2021-06-14T17:24:25Z

Fully agree with Simeon. Calling copy internally is a hack. I think what you really want here is to evaluate the argument expression for each element, not just get a copy, e.g. consider something like fill(rand(), n). Then the copy behavior, if you want it, is an easy special case. The easiest way to do this now is with a comprehension, but the syntax is unfortunately very different from fill. Maybe a @fill(expr, ...) that does repeated evaluation?

simeonschaub · 2021-06-14T20:31:46Z

Maybe a @fill(expr, ...) that does repeated evaluation?

Does that really add something over just using an array comprehension though?

JeffBezanson · 2021-06-14T21:02:15Z

Not really, it's just shorter.

PaulSoderlind · 2021-06-15T15:10:05Z

I believe those discourse threads signal that the documentation could be improved. If x is an object reference, all elements will refer to the same object: is somewhat opaque. Adding the example (done in #35683) was a step in the right direction, but perhaps not enough.

In fact, the expression object reference does not seem to be widely used in the docs. Searching the pdf gives 4 hits: 2 on unsafe_pointer... and 2 on fill/fill!. Is there a better (simpler) terminology?

goretkin · 2021-06-15T15:19:25Z

Is it wrong to say if x is mutable (ismutable(x)) instead of if x is an object reference?
(yes, because of #30210 , but otherwise?)

PaulSoderlind · 2021-06-23T19:19:46Z

@goretkin

Could I ask you submit a PR - and then we see how this plays out?
Maybe we could try If x is mutable (ismutable(x)), for instance, an Array, then all elements will refer to the same object:?

simeonschaub · 2021-06-23T19:50:25Z

ismutable(x) here is not quite correct, since for example:

julia> ismutable(1 => Ref(2))
false

while this will still run into the same issues as mentioned above.

PaulSoderlind · 2021-06-23T19:58:56Z

Thanks. I (and several others, it seems) am still struggling with If x is an object reference, all elements will refer to the same object. I guess just changing to If x is an object reference (for instance, an array), all elements will refer to the same object would be a step forward, but maybe you have a better idea?

mbauman · 2021-06-23T20:06:08Z

There's no if. All elements always are the same object. The only time it's observable, though, is if there's something mutable.

CameronBieganek · 2021-06-23T20:09:03Z

I concede that there isn't really a sensible way to do this. We would need to use either a macro or a function myfill whose first argument is a function (the name of the function would have to be different from fill, since the first argument of fill is generic for the existing method definitions, unless we want to use the f::Union{Function, Type} hack). At that point it makes more sense to just use a comprehension or a generator.

I'm not too invested in this anyways---I just noticed that this question came up a lot on Discourse.

CameronBieganek · 2021-06-23T20:12:29Z

On second thought, could we branch on isbits? If an object is not isbits, then we do a deepcopy?

julia> myismutable = !isbits
#76 (generic function with 1 method)

julia> myismutable([1, 2])
true

julia> myismutable(1 => Ref(2))
true

julia> struct A
           x::Vector{Int}
       end

julia> myismutable(A([1, 2]))
true

simeonschaub · 2021-06-23T20:14:09Z

That wouldn't make any difference, since deepcopy is just a no-op for isbits objects.

CameronBieganek · 2021-06-23T20:17:04Z

Yeah, I just noticed that deepcopy has

isbitstype(typeof(x)) && return x

for its first line.

That wouldn't make any difference, since deepcopy is just a no-op for isbits objects.

But that's fine. We want it to be a no-op for isbits objects! The point is that its not a no-op if the object is not isbits.

simeonschaub · 2021-06-23T20:19:48Z

That wouldn't address any of the issues raised by Jeff and myself above though.

goretkin · 2021-06-23T20:22:38Z

ismutable(x) here is not quite correct, since for example:
julia> ismutable(1 => Ref(2))
false
while this will still run into the same issues as mentioned above.

@simeonschaub , right, good. Is 1 => Ref(2) a so-called "object reference"? Is there a programmatic to check this property, whatever we want to call it? A deepismutable, if you will. A non-executable definition that I suspect is nonetheless accurate, is that isequal(a, b) implies a === b

CameronBieganek · 2021-06-23T20:25:43Z

There's clearly some demand for the copying behavior. In order to keep the current fill behavior available, the new behavior could be a different function, like copyfill or fillcopy, or it could be available via a copy keyword argument as you suggested.

Though I don't have any data to back up this hypothesis, I would guess that most people who use fill([0, 0], 2) are looking for the copying behavior rather than the current behavior.

simeonschaub · 2021-06-23T20:30:15Z

Is 1 => Ref(2) a so-called "object reference"?

That's a good question. I don't think the current wording is very accurate, it should probably say "object contains any references" rather than "object is a reference". I also agree with Matt that "all elements will refer to the same object" is not really different between objects with and without references, we should clarify that the only difference is that this is directly observable with references, while you wouldn't be able to tell for immutable objects.

goretkin · 2021-06-23T20:33:31Z

the new behavior could be a different function, like copyfill or fillcopy

My inclination (as always) in these situations is to introduce a new type name and rely on multiple dispatch, as opposed to introducing a new function name. The call would then be something like fill(Copier([0, 0]), 2). The name Copier is just an example. Ideally this type would be beneficial in other interfaces too.

The benefit over fill(Copier(... over fillcopy is "orthogonality". A method can be written in terms of fill(x, ...), and if x comes from the caller of that method, then both options are possible, without making a commitment between fill and fillcopy.

A downside has something to do with a proliferation of wrapper types and not a lot of great idioms for dealing with them, especially when there are multiple wrappers.
[EDIT] another downside is that now you can't fill an array with Copier objects without introducing some sort of escape mechanism. I think an array-equivalent of ntuple that takes a function of an index is the way to go.

CameronBieganek · 2021-06-23T20:34:32Z

@goretkin

A deepismutable, if you will.

As far as I can tell, you could define deepismutable(x) = !isbits(x). But maybe I'm missing some corner cases.

CameronBieganek · 2021-06-23T20:39:24Z

I also think that just updating the documentation a bit would be fine. We could add a sentence that suggests using a comprehension if you want to make copies of a non-isbits object. Something like this:

Add this to `fill` docstring:

If you want to fill an array with copies of a non-isbits object, use map or a comprehension:

julia> struct A
           x::Vector{Int}
       end

julia> map(_ -> A([1, 2]), 1:3)
3-element Vector{A}:
 A([1, 2])
 A([1, 2])
 A([1, 2])

julia> [A([1, 2]) for _ in 1:3]
3-element Vector{A}:
 A([1, 2])
 A([1, 2])
 A([1, 2])

goretkin · 2021-06-23T20:40:05Z

@goretkin

A deepismutable, if you will.

As far as I can tell, you could define deepismutable(x) = !isbits(x). But maybe I'm missing some corner cases.

That seems right! I already mentioned strings earlier. I am pretty confused about strings with respect to mutability. (Because String is "semantically" immutable, but for implementation reasons are "internally" mutable, for now). I wonder if that's a corner case here too:

julia> isbits("hey")
false

though I don't think there is any non-unsafe way to wreak havoc if a user does e.g. fill("hey", 2)

Ref #41209

mbauman · 2021-06-23T21:19:30Z

See if the my suggested changes to fill in #41340 are any easier to understand. This is a very common conceptual misunderstanding and is fundamental to understanding Julia, but can be difficult to succinctly and accurately describe. That means, though that it's not just fill. Understand this, conquer ~~the world~~ Julia.

CameronBieganek · 2021-06-26T15:19:31Z

This is a very common conceptual misunderstanding and is fundamental to understanding Julia, but can be difficult to succinctly and accurately describe. That means, though that it's not just fill. Understand this, conquer ~~the world~~ Julia.

To be clear, I already understood that. I just noticed that the fill question came up a lot on Discourse. And filling an array with references to the same value did not seem very useful to me.

Ref #41209

BioTurboNick · 2021-12-14T02:53:59Z

I'll just put in a vote here for making a copying fill (just the top level, not a deep copy) the default in 2.0 and a non-copying fill the "advanced" version. Whether that's a kwarg switch or another function, could go either way.

Though if someone can show there's a really common case that would get slowed down and/or become annoying because they keep having to add a new argument to avoid copying, that would be a good reason to not do this.

KristofferC · 2021-12-14T10:51:37Z

This seems simple to me... One just introduces a new function, say fillf, with the desired semantics. It takes a function as the first argument and calls that function for every element.

fillf(rand, 5, 5)
fillf(() -> zero(2,2), 3, 3)

etc. And (which has been mentioned) there could be a @fill(expr, ...) that transforms into fillf(() -> expr, ...).

goretkin · 2021-12-14T17:01:57Z

e.g.

fillf(f, args...) = [f() for k in keys(fill(nothing, args...))]

It seems like you might as well pass the indices to f, analogous to ntuple, but then you have to decide how to pass it (a CartesianIndex is inconvenient) and if you want to ignore it, you'll have to include a dummy argument (like _) in the anonymous function.

mbauman · 2021-12-14T17:50:00Z

Though if someone can show there's a really common case that would get slowed down and/or become annoying because they keep having to add a new argument to avoid copying, that would be a good reason to not do this.

Simeon listed a number of these in this comment above: #41209 (comment). It's not about performance or annoyance — it's about semantics and what you should expect from the language. fill(x, size...) just puts that x everywhere.

At its heart, I think this is a natural language/computer language mismatch. I'm hopeful that the new documentation in v1.8 will help some. Maybe there'd be a different English-language verb that would more clearly express this behavior, but I'm doubtful we could find anything dramatically clearer; in the natural world we typically can't put the same thing in multiple places at once.

I don't think a higher order function like fillf or macro like @fill would be helpful to the folks that stumble on this.

PaulSoderlind · 2021-12-14T21:21:59Z

at the new documentation in v1.8

yes, that will help. Much appreciated.

I don't think a higher order function like fillf or macro like @fill would be helpful

I disagree here. Some sort of convenience function for pre-allocating an array of arrays could be very helpful.

goretkin · 2021-12-15T01:05:45Z

I disagree here. Some sort of convenience function for pre-allocating an array of arrays could be very helpful.

I think @mbauman is saying that if a naïve user knows that they can do fill(0, ...) to get an array of 0s, then they will also do fill([0], ...) to get an array of arrays. The existence of fillf or @fill won't automatically get them to use it if they don't already know better, which they don't, since they are stumbling upon this issue.

It could still be worth having a function like fillf to give an alternative spelling for an array comprehension. But really that is the job of map.

PaulSoderlind · 2021-12-15T12:09:48Z

OK, fair enough. I think there are two points of having a fillf: (1) it is convenient; (2) it highlights the potential pitfall of fill: the documentation of the latter could read like "To create an array of many independent inner arrays, use a comprehension or fillf()" .

StefanKarpinski · 2021-12-15T14:49:20Z

The main advantage I can see for @fill is that if you have a fill([], n) or whatever and you realize it's wrong, you can fix it very simply by changing it to @fill([], n) which is appealing. I still strongly feel that changing fill to copy its argument is a bad idea that just makes it harder for people to understand and use the language. With copying they do fill([], n) and it works, making them think that it has the same semantics as [[] for _ = 1:n]. Later they do fill(rand(), n) and find that it produces an array of n copies of the same random number. The copying behavior has misled them about the basic semantics of the language and caused confusion rather than really helping. The sooner someone understands that a function argument is only evaluated once, the better off they are.

mbauman · 2021-12-15T15:18:55Z

Yeah, my reasoning behind saying that fillf and @fill are unhelpful is that:

Higher order functions and macros are more advanced techniques, whereas this is often stumbled upon by language learners.
They're fiddly variations on the same name... and the name itself doesn't actually give any indication about why they're different to someone who doesn't understand what fill itself is doing. In the documentation for fill, we can say "if you don't want this behavior, use @fill", but I'd rather say "if you don't want this behavior use a comprehension." The former is magic, the latter is generalizable.

I'm not averse to a more succinct comprehension-like syntax, but I'd want something whose name is a little more clearly distinct from fill... because the behavior is quite distinct.

StefanKarpinski · 2022-02-20T20:57:46Z

I'm not that worried about beginners—they're going to be confused at some point and need to learn this either way and as you say, fill versus @fill isn't that enlightening unless you already understand this. The reason I think @fill would be good is for users who understand this to be able to easily switch between the behaviors. Yes, you can use a comprehension, but if you have already written fill([], n) and realize you need different arrays, changing it to @fill([], n) is much easier, and shorter than writing [[] for _ = 1:n].

CameronBieganek · 2022-04-02T00:09:56Z

Given the change in the name of this issue, can we remove the "breaking" label?

mbauman · 2024-07-17T21:09:25Z

I wonder if VSCode/Lint could help flag places where any value in a fill is mutated? It's probably always a mistake to do so!

oscardssmith added the breaking This change will break code label Jun 12, 2021

mbauman added a commit that referenced this issue Jun 23, 2021

Improve documentation for fill

373763b

Ref #41209

mbauman mentioned this issue Jun 23, 2021

Improve documentation for fill #41340

Merged

vtjnash pushed a commit that referenced this issue Jul 19, 2021

Improve documentation for fill (#41340)

e4a6f1d

Ref #41209

Seelengrab mentioned this issue Oct 16, 2021

Error when using Float64: ERROR: UndefRefError: access to undefined reference FluxML/NNlib.jl#490

Closed

StefanKarpinski changed the title ~~Change fill([0, 0], 2) behavior for 2.0~~ implement @fill([0, 0], 2) Feb 20, 2022

oscardssmith removed the breaking This change will break code label Apr 2, 2022

simeonschaub mentioned this issue Aug 10, 2022

Unclear that Vector{Vector}s behave differently when initialized differently #46305

Closed

brenhinkeller added breaking This change will break code feature Indicates new feature / enhancement requests and removed breaking This change will break code labels Nov 19, 2022

mbauman mentioned this issue Jul 17, 2024

[WONTFIX] fill() is a footgun #55158

Closed

implement @fill([0, 0], 2) #41209

implement @fill([0, 0], 2) #41209

Comments

CameronBieganek commented Jun 12, 2021 • edited by StefanKarpinski Loading

simeonschaub commented Jun 13, 2021 • edited Loading

JeffBezanson commented Jun 14, 2021

simeonschaub commented Jun 14, 2021 • edited Loading

JeffBezanson commented Jun 14, 2021

PaulSoderlind commented Jun 15, 2021

goretkin commented Jun 15, 2021 • edited Loading

PaulSoderlind commented Jun 23, 2021

simeonschaub commented Jun 23, 2021 • edited Loading

PaulSoderlind commented Jun 23, 2021

mbauman commented Jun 23, 2021 • edited Loading

CameronBieganek commented Jun 23, 2021

CameronBieganek commented Jun 23, 2021

simeonschaub commented Jun 23, 2021

CameronBieganek commented Jun 23, 2021 • edited Loading

simeonschaub commented Jun 23, 2021

goretkin commented Jun 23, 2021 • edited Loading

CameronBieganek commented Jun 23, 2021

simeonschaub commented Jun 23, 2021 • edited Loading

goretkin commented Jun 23, 2021 • edited Loading

CameronBieganek commented Jun 23, 2021

CameronBieganek commented Jun 23, 2021 • edited Loading

Add this to fill docstring:

goretkin commented Jun 23, 2021 • edited Loading

mbauman commented Jun 23, 2021 • edited Loading

CameronBieganek commented Jun 26, 2021

BioTurboNick commented Dec 14, 2021

KristofferC commented Dec 14, 2021 • edited Loading

goretkin commented Dec 14, 2021

mbauman commented Dec 14, 2021 • edited Loading

PaulSoderlind commented Dec 14, 2021 • edited Loading

goretkin commented Dec 15, 2021 • edited Loading

PaulSoderlind commented Dec 15, 2021

StefanKarpinski commented Dec 15, 2021

mbauman commented Dec 15, 2021 • edited Loading

StefanKarpinski commented Feb 20, 2022 • edited Loading

CameronBieganek commented Apr 2, 2022

mbauman commented Jul 17, 2024

implement `@fill([0, 0], 2)` #41209

implement `@fill([0, 0], 2)` #41209

CameronBieganek commented Jun 12, 2021 •

edited by StefanKarpinski

Loading

simeonschaub commented Jun 13, 2021 •

edited

Loading

simeonschaub commented Jun 14, 2021 •

edited

Loading

goretkin commented Jun 15, 2021 •

edited

Loading

simeonschaub commented Jun 23, 2021 •

edited

Loading

mbauman commented Jun 23, 2021 •

edited

Loading

CameronBieganek commented Jun 23, 2021 •

edited

Loading

goretkin commented Jun 23, 2021 •

edited

Loading

simeonschaub commented Jun 23, 2021 •

edited

Loading

goretkin commented Jun 23, 2021 •

edited

Loading

CameronBieganek commented Jun 23, 2021 •

edited

Loading

Add this to `fill` docstring:

goretkin commented Jun 23, 2021 •

edited

Loading

mbauman commented Jun 23, 2021 •

edited

Loading

KristofferC commented Dec 14, 2021 •

edited

Loading

mbauman commented Dec 14, 2021 •

edited

Loading

PaulSoderlind commented Dec 14, 2021 •

edited

Loading

goretkin commented Dec 15, 2021 •

edited

Loading

mbauman commented Dec 15, 2021 •

edited

Loading

StefanKarpinski commented Feb 20, 2022 •

edited

Loading