Build function efficient multiple returns #46

raphaelchinchilla · 2020-11-06T05:51:05Z

It is often the case (for instance in optimization) that one needs to calculate some functions, such as the gradient and the hessian, that have some shared calculations. Is there a way in modelingtoolkit to use symbolic computation and build_function to generate functions that act something like?

function gradhess!(grad,hess)

aux=shared calculations

if grad != nothing

  grad.=compute_grad (aux)

end

if hessian != nothing

  hess.= compute_hess(aux)

end

end

┆Issue is synchronized with this Trello card by Unito

The text was updated successfully, but these errors were encountered:

ChrisRackauckas · 2020-11-06T06:13:30Z

If you make an array of array of expressions it'll output that array of array. It think we might want to try and let a tuple of expressions do that. @shashi what do you think on this one?

I think we'd just generate the full expressions and let CSE reduce it.

raphaelchinchilla · 2020-11-06T16:14:38Z

Thanks Chris,

But just returning an array of array would be enough to avoid repeating computation and to only compute, for instance the gradient if the hessian is not needed?

To be clear, I want to know whether there is way to symbolically and automatically generate something like what they mention in the Optim.jl documentation:

function fg!(F,G,x)
  # do common computations here
  # ...
  if G != nothing
    # code to compute gradient here
    # writing the result to the vector G
  end
  if F != nothing
    # value = ... code to compute objective function
    return value
  end
end

ChrisRackauckas · 2020-11-07T21:54:56Z

Oh a doubly-mutating function? Yeah, you could make a function _fg!(Y,x) where Y = [F,G] and then mutate that array of arrays.

raphaelchinchilla · 2020-12-03T03:38:19Z

Hi Chris,

Sorry for taking so long to answer back, I wanted to have a MWE that reflected my questions. The matter is not exactly to have a doubly-mutating function. Consider the following example inspired by the one in the documentation of DiffResults.jl

using ModelingToolkit
@variables x[1:2]
f = prod(tan, x) * sum(sqrt, x)
g=ModelingToolkit.gradient(f,x)

One can see in the result below that many operations used in f can be reused in g:

f=((tan(x₁) * tan(x₂)) * (sqrt(x₁) + sqrt(x₂)))

and 

g=[(tan(x₂) * (sec(x₁) ^ 2) * (sqrt(x₁) + sqrt(x₂))) + (0.5 * tan(x₁) * tan(x₂) * (sqrt(x₁) ^ -1));
    (tan(x₁) * (sqrt(x₁) + sqrt(x₂)) * (sec(x₂) ^ 2)) + (0.5 * tan(x₁) * tan(x₂) * (sqrt(x₂) ^ -1))]

Suppose one is running a gradient descent algorithm with a line search. Given an input vector x one will need functions to
i) compute only f
ii) compute only g
ii) compute both f and g

Question 1: Is there a better way to do this than to do (independent of inplace or not)

builted_f=build_function(f,x)[1]
builted_g=build_function(g,x)[1]
builted_fg=build_function([f;g],x)[1]

Question 2: It seems to me when I run for instance

julia_fg=eval(builted_fg)
@code_llvm julia_fg([1.,2.])

that LLVM does not realize that there are multiple operations that could be reused. I might be wrong about that because I am not very familiar with LLVM. If indeed LLVM does not reuse the operations, is there a functionality that could be implemented on build_function itself that would allow for that?

Question 3: (This is actually probably a bug) When building the function builted_fg using [f;g] it returns a vector of size 3, which is not a problem for a simple example but for more complex example, keeping track of the index might be challenging. If I try to run build_function([(f,g),x)[1] I receive an error. If I try build_function([f , g],x)[1] (i.e. using " , " instead of " ; ") I do not get an error, but if I go on to run

julia_fg=eval(builted_fg)
julia_fg([1.,2.])

the output is

2-element Array{Any,1}:
 -8.21556383191817
   Expr[:((+)((*)((tan)(x₂), (^)((sec)(x₁), 2), (+)((sqrt)(x₁), (sqrt)(x₂))), (*)(0.5, (tan)(x₁), (tan)(x₂), (^)((inv)((sqrt)(x₁)), 1)))), :((+)((*)((tan)(x₁), (+)((sqrt)(x₁), (sqrt)(x₂)), (^)((sec)(x₂), 2)), (*)(0.5, (tan)(x₁), (tan)(x₂), (^)((inv)((sqrt)(x₂)), 1))))]

which seems like a bug, or I might be doing something wrong.

ChrisRackauckas · 2020-12-03T10:09:14Z

For Q1, would could in theory allow thunks.

For Q2, I would've thought LLVM would CSE it. @shashi @YingboMa comments on that?

For Q3, yes tuple outputs would solve this but it's not implemented yet.

shashi · 2020-12-05T00:04:46Z

We can do Q2 symbolically instead of relying on LLVM.

YingboMa · 2020-12-06T06:39:02Z

Also, doing CSE is essential for producing readable code after reduction, too.

raphaelchinchilla · 2023-08-06T18:04:48Z

For Q3, yes tuple outputs would solve this but it's not implemented yet.

I was looking into how to implement tuple returns in build_function. I discovered that if we include the dispatch

function toexpr(O::Tuple, st)
    :(($(toexpr.(O, (st,))...),))
end

in SymbolicUtils.Code, the functionality works.

I was about to do a PR to SymbolicUtils including this dispatch and a test. However, in the last minute, I realized that maybe this is the whole point of the type SymbolicUtils.Code.MakeTuple. Can someone confirm this to me? In this case I will figure out what needs to be changed in build_function to make this work.

shashi · 2023-08-20T10:51:20Z

Yeah I think I didn't really have toexpr on Tuple in mind, Just like toexpr is not defined on a vector. toexpr is defined on term-like types. We also have MakeArray, and SetArray. I'm more conservative about methods because it changes the usefulness of toexpr.

shashi · 2023-08-20T11:00:32Z

You can definitely define something like

_toexpr(x) = toexpr(x); _toexpr(x::Tuple) = MakeTuple(x) and use that in build_function.

raphaelchinchilla · 2023-08-20T13:58:17Z

Yeah, that could have been an idea. Why did you decide to merge the PR?

ChrisRackauckas transferred this issue from SciML/ModelingToolkit.jl Feb 26, 2021

raphaelchinchilla mentioned this issue Aug 19, 2023

Included dispatch to toexpr that creates Tuples correctly JuliaSymbolics/SymbolicUtils.jl#542

Merged

raphaelchinchilla closed this as completed Aug 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build function efficient multiple returns #46

Build function efficient multiple returns #46

raphaelchinchilla commented Nov 6, 2020 •

edited by sync-by-unito bot

Loading

ChrisRackauckas commented Nov 6, 2020

raphaelchinchilla commented Nov 6, 2020 •

edited

Loading

ChrisRackauckas commented Nov 7, 2020

raphaelchinchilla commented Dec 3, 2020 •

edited

Loading

ChrisRackauckas commented Dec 3, 2020

shashi commented Dec 5, 2020

YingboMa commented Dec 6, 2020

raphaelchinchilla commented Aug 6, 2023

shashi commented Aug 20, 2023

shashi commented Aug 20, 2023

raphaelchinchilla commented Aug 20, 2023

Build function efficient multiple returns #46

Build function efficient multiple returns #46

Comments

raphaelchinchilla commented Nov 6, 2020 • edited by sync-by-unito bot Loading

ChrisRackauckas commented Nov 6, 2020

raphaelchinchilla commented Nov 6, 2020 • edited Loading

ChrisRackauckas commented Nov 7, 2020

raphaelchinchilla commented Dec 3, 2020 • edited Loading

ChrisRackauckas commented Dec 3, 2020

shashi commented Dec 5, 2020

YingboMa commented Dec 6, 2020

raphaelchinchilla commented Aug 6, 2023

shashi commented Aug 20, 2023

shashi commented Aug 20, 2023

raphaelchinchilla commented Aug 20, 2023

raphaelchinchilla commented Nov 6, 2020 •

edited by sync-by-unito bot

Loading

raphaelchinchilla commented Nov 6, 2020 •

edited

Loading

raphaelchinchilla commented Dec 3, 2020 •

edited

Loading