Is there interest in `ggml_reduce` or `ggml_add_ext`? #868

balisujohn · 2024-06-22T18:02:58Z

So I need to reduce a 4d tensor along a dimension with the operation addition. I can either add ggml_add_ext that lets you specify a dimension for reduction, or I can add a new op ggml_reduce that lets you specify a dimension and an op as an argument (maybe +,/,-,* to start) and reduces along that dimension with that op. Which of these would be preferable?

In the meantime, I can implement this in tortoise.cpp with view slices and and a for loop, but I think a inbuilt reduction op will probably be faster.

The text was updated successfully, but these errors were encountered:

balisujohn · 2024-06-22T18:03:50Z

it occurs to me ggml_add isn't a unary op, so I'd lean towards the ggml_reduce idea.

ggerganov · 2024-06-25T13:13:46Z

There is already ggml_sum_rows(). A ggml_sum_dim() should be possible to implement via ggml_permute() + ggml_sum_rows() + ggml_reshape() I think, without having to write new kernels

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there interest in `ggml_reduce` or `ggml_add_ext`? #868

Is there interest in `ggml_reduce` or `ggml_add_ext`? #868

balisujohn commented Jun 22, 2024

balisujohn commented Jun 22, 2024

ggerganov commented Jun 25, 2024 •

edited

Loading

Is there interest in ggml_reduce or ggml_add_ext? #868

Is there interest in ggml_reduce or ggml_add_ext? #868

Comments

balisujohn commented Jun 22, 2024

balisujohn commented Jun 22, 2024

ggerganov commented Jun 25, 2024 • edited Loading

Is there interest in `ggml_reduce` or `ggml_add_ext`? #868

Is there interest in `ggml_reduce` or `ggml_add_ext`? #868

ggerganov commented Jun 25, 2024 •

edited

Loading