+Add cuberoot function #538

Hallberg-NOAA · 2023-12-20T22:30:52Z

Added the new functions cuberoot() and intrinsic_functions_unit_tests() to the MOM_intrinsic_functions module, and call intrinsic_functions_unit_tests from unit_tests to confirm that this new function works as intended. Separately, cuberoot was tested by replacing expressions like A**(1./3.) with cuberoot(A) in MOM_energetic_PBL and verifying that the answers only change at roundoff, but that it can give bitwise identical results when the argument is scaled by an integer power of 8 and then unscaled by the corresponding integer power of 2, but that change will occur in a subsequent commit as it can change answers depending on an ANSWER_DATE flag. With this commit, cuberoot is not yet being used so all answers are bitwise identical, although there are new publicly visible routines.

codecov · 2023-12-20T22:34:03Z

Codecov Report

Attention: 7 lines in your changes are missing coverage. Please review.

Comparison is base (5137442) 37.23% compared to head (655f0d5) 37.20%.

Files	Patch %	Lines
src/framework/MOM_intrinsic_functions.F90	88.88%	3 Missing and 2 partials ⚠️
src/core/MOM_unit_tests.F90	0.00%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           dev/gfdl     #538      +/-   ##
============================================
- Coverage     37.23%   37.20%   -0.04%     
============================================
  Files           271      271              
  Lines         80170    80355     +185     
  Branches      14978    14985       +7     
============================================
+ Hits          29855    29894      +39     
- Misses        44761    44903     +142     
- Partials       5554     5558       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

marshallward

I am seeing a small issue with the cuberoot function. If I evaluate cuberoot(0.125) then it gives me 1.0 rather than 0.5. Any other value that is a power of 0.125 (e.g. 2.**-6) shows the same problem.

I believe it is due to the elseif in this if-block:

    if (asx > 1.0) then
      asx = 0.125 * asx ; ex_3 = ex_3 + 1
    elseif (asx <= 0.125) then
      asx = 8.0 * asx
    endif

Values greater that 1.0 correct their exponent (ex_3 += 1) after scaling down, but values below 0.125 do not include this index correct after scaling up. It looks like the bug can be fixed with ex_3 = ex_3 - 1.

I looked at the behavior of the uncorrected asx function, scale(abs(x), -3 * (exponent(x) / 3)), and it appears to already be bounded between 0.125 and 4; only powers of 0.125 are evaluated to 0.125, so it's an extremely small number of values that would be affected by this missing term.

I considered removing the entire elseif block, but it seemed to introduce a minor roundoff that could not be removed by iteration. I don't know how important it is to reproduce the answer to machine precision, but something else to consider. (Are all values correct to machine precision?) (Edit: Nevermind, I had modified the 1e100/1e-100 convergence values. 0.125 converges to 0.5 without the elseif block.)

Hallberg-NOAA · 2024-01-04T14:24:31Z

Thank you for pointing out the bug with the (asx <= 0.125) case. I just fixed this in a force-push.

The Newton's method iterations do give solutions that are accurate to machine precision, but in some cases when we have arrived at the answer the last bit bounces back and forth around the exact (irrational) solution with added iterations. It is worth noting that the Newton's method convergence from above is monotonic (at least mathematically). If the first guess is an under-estimate, Newton's method will overshoot (a little) and then all subsequent iterations converge monotonically from above. So the case where the last bit is jumping around, it means that we are in a perpetual roundoff-induced two-step limit cycle.

Given this limiting behavior at effective convergence, we could choose to break the iterations when they are at the minimum of two successive solutions that are within roundoff of each other, but it would involve adding lines to store the previous value, and then to compare the solutions to make sure that they are not increasing, and this would add to the cost of the function without giving any meaningful increase in accuracy. Is it worth the (hopefully small) extra cost to avoid incorporating any arbitrary choices about the number of iterations to apply, even if it means that our version of the cube root would then always become an under-estimate of some irrational solutions?

ashao · 2024-01-05T17:32:34Z

What about checking the size of the Newton step as a stopping criterion if you're ok with sacrificing a little bit of extra accuracy in the solution? Something like:

if (ABS(update) < 2.0*10.0*EPSILON(root)) exit

(factor of 2.0 for the roundoff calculation and a factor of 10.0 to go one order of magnitude above roundoff).

marshallward · 2024-01-05T19:27:33Z

I have a suggestion to keep the value of asx bounded between 0.125 and 1. The current form has some odd behavior due to Fortran-isms related to integer division.

The current formula, asx = scale(abs(x), -3*(exponent(x) / 3)), looks something like this:

After which the if-blocks are required to bound the values above 1 to something within 0.125 and 1. (I did not look at the actual curve, but surely it will be some piecewise monstrosity.)

This is happening because Fortran integer division simply drops the decimal part of the number, rather than consistently rounding up or down. For example, 1.5 / 2 is zero, but -1.5 / 2 is also zero rather than -1. If we replace exponent(x) / 3 with ceiling(exponent(x) / 3.) and remove the if-blocks (which now do nothing), then we get a more consistent curve:

This new expression does not appear to change answers in your test.

Hallberg-NOAA · 2024-01-05T21:48:09Z

Thank you for all of this helpful feedback, @marshallward. I have incorporated all of this into the updated version of this PR, which is now algorithmically lean and thoroughly commented!

Hallberg-NOAA · 2024-01-05T23:39:23Z

@ashao, thank you for your suggestion with respect to checking the increment size, but in this case it turns out that we have another way to check for convergence and the extra iterations seem pretty cheap, so we can iterate to convergence and then take the better of two near-solutions if the iterations encounter a roundoff-level limit cycle.

marshallward

I've added what should be my final comments. It seems to work very well.

I have not checked whether it gives the same answers across compilers and hardware, but we should probably aspire to this at some point.

src/framework/MOM_intrinsic_functions.F90

Added the new functions cuberoot and intrinsic_functions_unit_tests to the MOM_intrinsic_functions module, and call intrinsic_functions_unit_tests from unit_tests to confirm that this new function works as intended. Separately, cuberoot was tested by replacing expressions like A**(1./3.) with cuberoot(A) in MOM_energetic_PBL and verifying that the answers only change at roundoff, but that it can give bitwise identical results when the argument is scaled by an integer power of 8 and then unscaled by the corresponding integer power of 2, but that change will occur in a subsequent commit as it can change answers depending on an ANSWER_DATE flag. With this commit, cuberoot is not yet being used so all answers are bitwise identical, although there are new publicly visible routines.

marshallward

Everything looks good!

marshallward · 2024-01-09T18:41:57Z

Gaea regression: https://gitlab.gfdl.noaa.gov/ogrp/MOM6/-/pipelines/21886 ✔️

Hallberg-NOAA added the enhancement New feature or request label Dec 20, 2023

Hallberg-NOAA force-pushed the add_cuberoot branch from 8c91c38 to c4ee310 Compare December 20, 2023 22:42

Hallberg-NOAA force-pushed the add_cuberoot branch from c4ee310 to a8c28e1 Compare December 30, 2023 04:44

marshallward requested changes Jan 3, 2024

View reviewed changes

Hallberg-NOAA force-pushed the add_cuberoot branch from a8c28e1 to bb71caa Compare January 4, 2024 14:15

Hallberg-NOAA force-pushed the add_cuberoot branch from bb71caa to 56432ee Compare January 5, 2024 19:27

Hallberg-NOAA force-pushed the add_cuberoot branch from 56432ee to dd740e8 Compare January 5, 2024 20:01

Hallberg-NOAA requested a review from marshallward January 6, 2024 16:02

marshallward requested changes Jan 8, 2024

View reviewed changes

Hallberg-NOAA force-pushed the add_cuberoot branch from dd740e8 to 751a2ce Compare January 8, 2024 23:45

marshallward reviewed Jan 9, 2024

View reviewed changes

src/framework/MOM_intrinsic_functions.F90 Show resolved Hide resolved

marshallward reviewed Jan 9, 2024

View reviewed changes

src/framework/MOM_intrinsic_functions.F90 Outdated Show resolved Hide resolved

Hallberg-NOAA force-pushed the add_cuberoot branch from 751a2ce to 655f0d5 Compare January 9, 2024 17:29

marshallward approved these changes Jan 9, 2024

View reviewed changes

marshallward merged commit a03bb95 into NOAA-GFDL:dev/gfdl Jan 9, 2024
12 checks passed

This was referenced May 16, 2024

GFDL to main (2024-05-16) mom-ocean/MOM6#1626

Closed

GFDL to main 2024-05-16 mom-ocean/MOM6#1627

Closed

marshallward mentioned this pull request May 31, 2024

GFDL to main (2024-05-31) mom-ocean/MOM6#1631

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

+Add cuberoot function #538

+Add cuberoot function #538

Hallberg-NOAA commented Dec 20, 2023

codecov bot commented Dec 20, 2023 •

edited

Loading

marshallward left a comment •

edited

Loading

Hallberg-NOAA commented Jan 4, 2024

ashao commented Jan 5, 2024

marshallward commented Jan 5, 2024

Hallberg-NOAA commented Jan 5, 2024

Hallberg-NOAA commented Jan 5, 2024

marshallward left a comment •

edited

Loading

marshallward left a comment

marshallward commented Jan 9, 2024

+Add cuberoot function #538

+Add cuberoot function #538

Conversation

Hallberg-NOAA commented Dec 20, 2023

codecov bot commented Dec 20, 2023 • edited Loading

Codecov Report

marshallward left a comment • edited Loading

Choose a reason for hiding this comment

Hallberg-NOAA commented Jan 4, 2024

ashao commented Jan 5, 2024

marshallward commented Jan 5, 2024

Hallberg-NOAA commented Jan 5, 2024

Hallberg-NOAA commented Jan 5, 2024

marshallward left a comment • edited Loading

Choose a reason for hiding this comment

marshallward left a comment

Choose a reason for hiding this comment

marshallward commented Jan 9, 2024

codecov bot commented Dec 20, 2023 •

edited

Loading

marshallward left a comment •

edited

Loading

marshallward left a comment •

edited

Loading