Reconsider min implementation as negative of max #904

ricardoV94 · 2024-07-08T15:06:42Z

Description

For historical reasons pt.min returns pt.neg(pt.max(pt.neg(x))). This seems to have been the case mostly to avoid having to define Min Op and its gradient. There is a later "uncanonicalize" phase that converts those expressions to min, suggesting we prefer them, but don't put it in place because of the lack of gradient.

We should reassess this as is adds some unwelcome complexity. The L_op implementation (cleaned up in #901) works directly for min. R_op, on the other hand uses Argmax (not sure why this is needed in the forward but not backward pass, CC @aseyboldt), so a similar Min.R_op may need to use Argmin. Similar to Min that's currently implemented as Argmax of negative of x, which is probably fine? We could also consider a direct Argmin but that is not as ubiquitous and hence less annoying.

The text was updated successfully, but these errors were encountered:

ricardoV94 added maintenance Op implementation labels Jul 8, 2024

ricardoV94 mentioned this issue Jul 8, 2024

Merge consecutive reduces #888

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconsider min implementation as negative of max #904

Reconsider min implementation as negative of max #904

ricardoV94 commented Jul 8, 2024 •

edited

Loading

Reconsider min implementation as negative of max #904

Reconsider min implementation as negative of max #904

Comments

ricardoV94 commented Jul 8, 2024 • edited Loading

Description

ricardoV94 commented Jul 8, 2024 •

edited

Loading