Convert remaining operators to use tensor pool #109

robertknight · 2024-04-23T08:14:19Z

#108 introduced a buffer pool to enable re-use of tensor buffers that are no longer needed, as outputs for future non-mutating operations. Only a subset of operators currently allocate from the pool. To get the full benefit the majority of ops need to be converted. Non-converted ops can be found by searching for operators which ignore their pool argument. This is indicated by having an underscore-prefixed _pool argument in their Operator::run implementation.

The text was updated successfully, but these errors were encountered:

robertknight · 2024-04-26T08:21:06Z

This is now done except for a few operators where the output size is data-dependent (NonZero, Range) and the RNN operators. For the RNN ops, I need to complete #95 first to avoid conflicts.

Even though I'm planning to significantly rework the internals of these operators soon [1], converting them to use the pool now allows #109 to be resolved. Also using the pool now only requires smaller code changes which won't lead to major merge conflicts. [1] #85

robertknight added the performance Issues that affect model inference or loading performance label Apr 23, 2024

This was referenced Apr 27, 2024

Convert RNN ops to use pool #126

Merged

Enable RTEN_USE_POOL by default #127

Merged

robertknight closed this as completed Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert remaining operators to use tensor pool #109

Convert remaining operators to use tensor pool #109

robertknight commented Apr 23, 2024

robertknight commented Apr 26, 2024

Convert remaining operators to use tensor pool #109

Convert remaining operators to use tensor pool #109

Comments

robertknight commented Apr 23, 2024

robertknight commented Apr 26, 2024