Use buffer pool for prepacked matrices #128

robertknight · 2024-04-27T08:25:47Z

A buffer pool was added in #108 to enable re-use of buffers for operator outputs and temporary tensors. This is now used by almost all operators and for most large temporary buffers, with one exception: prepacked matrix weights created by GemmExecutor::{prepack_a, prepack_b} are not allocated from the pool yet. They should be.

The text was updated successfully, but these errors were encountered:

Allocate and auto-return prepacked matrices to the shared pool. There is an exception for the GRU op, because I'm expecting to revise its internals shortly as part of #85. Fixes #128

robertknight added the performance Issues that affect model inference or loading performance label Apr 27, 2024

robertknight mentioned this issue Apr 27, 2024

Allocate buffers for prepacked matrices from pool #130

Merged

robertknight closed this as completed in #130 Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use buffer pool for prepacked matrices #128

Use buffer pool for prepacked matrices #128

robertknight commented Apr 27, 2024

Use buffer pool for prepacked matrices #128

Use buffer pool for prepacked matrices #128

Comments

robertknight commented Apr 27, 2024