Canonicalize flow.tensor.clone to keep it more local to usage #5291
Labels
compiler/dialects
Relating to the IREE compiler dialects (flow, hal, vm)
performance ⚡
Performance/optimization related work across the compiler and runtime
Now that we are performing in-place operations there are
flow.tensor.clone
s appearing to preserve correctness. In cases of wide fan-out these cause some pathological behavior:Some simple code motion will help here: sinking the clones to immediately prior to their use will shorten the lifetime of these transient tensors.
The text was updated successfully, but these errors were encountered: