Dual leakage #31

MikeInnes · 2020-01-20T17:02:12Z

I can make the dual type leak out into a global variable:

julia> function f(x)
         global y = sin(x)
       end
f (generic function with 1 method)

julia> ForwardDiff2.D(f)(1)*1
0.5403023058681398

julia> y
(0.8414709848078965 + 0.5403023058681398ϵ₁)

Presumably this can happen any time a differentiated value escapes the program, e.g. when you have a global cache or similar.

ChrisRackauckas · 2020-01-20T17:03:44Z

I am assume that's going to be useful for defining mutable buffers.

MikeInnes · 2020-01-20T17:20:30Z

FWIW, this is not just academic since it can lead to bad gradients (effectively a form of perturbation confusion). For example:

julia> function f(x)
         global y = sin(x)
       end
f (generic function with 1 method)

julia> g(x) = x+y
g (generic function with 1 method)

julia> ForwardDiff2.D(f)(2)*1
-0.4161468365471424

julia> ForwardDiff2.D(g)(2)*1
0.5838531634528576

Compare Zygote:

julia> gradient(f, 2)[1]
-0.4161468365471424

julia> gradient(g, 2)[1]
1.0

This is a little contrived, but a function that updates and uses a global cache in some way could follow the same pattern and get silent incorrect gradients.

MikeInnes mentioned this issue Jan 20, 2020

Hybrid overloading + SCT; not pure SCT? #24

Open

MikeInnes mentioned this issue Jan 24, 2020

Incorrect gradient + dual leakage #40

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dual leakage #31

Dual leakage #31

MikeInnes commented Jan 20, 2020

ChrisRackauckas commented Jan 20, 2020

MikeInnes commented Jan 20, 2020

Dual leakage #31

Dual leakage #31

Comments

MikeInnes commented Jan 20, 2020

ChrisRackauckas commented Jan 20, 2020

MikeInnes commented Jan 20, 2020