master: merge HWRF version of saSAS with GFS version #423

climbfuji · 2020-04-01T20:13:38Z

This PR combines the HWRF version of saSAS for both deep and shallow convection with the GFS version. The default behavior is to run the GFS version, the HWRF version is activated using (namelist) variables (flags) hwrf_samfdeep and hwrf_samfshal. All this work was done by @mzhangw.

#423
NOAA-EMC/fv3atm#93
ufs-community/ufs-weather-model#94

For regression testing, see ufs-community/ufs-weather-model#94.

…06d48e31601912f2cbfe92435c47e' into HEAD

…true.

…previous version

…n_hafs_sas_for_master

physics/samfdeepcnv.f

grantfirl

This looks fine to me. I may have made different choices with respect to repeating statements in both branches of if (hwrf) because it makes it a little difficult to understand the differences for when the hurricane flag is on, but I imagine that it is fine from a performance point of view.

climbfuji · 2020-06-03T22:06:50Z

This looks fine to me. I may have made different choices with respect to repeating statements in both branches of if (hwrf) because it makes it a little difficult to understand the differences for when the hurricane flag is on, but I imagine that it is fine from a performance point of view.

There is no difference wrt performance between the current code and your suggested changes. I will make some changes after the new baseline was created and before the final round of regression tests against the new baseline is started.

climbfuji · 2020-06-04T13:26:28Z

@SMoorthi-emc @JongilHan66 for your information. This PR should be merged shortly. It combines the GFS and HWRF version of saSAS in one scheme with as little overhead as possible. This work is required for HAFS and other Hurricane Supplemental projects. Please let me know if you have any comments - thanks!

SMoorthi-emc · 2020-06-04T13:57:40Z

Dom, Thanks for the heads up. I hope this merge is done efficiently, without any penalty (either in cpu or in memory) for the original code. As Jongil is the original code owner, he may have more comments. Moorthi

…

On Thu, Jun 4, 2020 at 9:26 AM Dom Heinzeller ***@***.***> wrote: @SMoorthi-emc <https://github.com/SMoorthi-emc> @JongilHan66 <https://github.com/JongilHan66> for your information. This PR should be merged shortly. It combines the GFS and HWRF version of saSAS in one scheme with as little overhead as possible. This work is required for HAFS and other Hurricane Supplemental projects. Please let me know if you have any comments - thanks! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#423 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALLVRYX3JONZLDINBNLVQB3RU6OJLANCNFSM4LZQWV2Q> .

-- Dr. Shrinivas Moorthi Research Meteorologist Modeling and Data Assimilation Branch Environmental Modeling Center / National Centers for Environmental Prediction 5830 University Research Court - (W/NP23), College Park MD 20740 USA Tel: (301)683-3718 e-mail: [email protected] Phone: (301) 683-3718 Fax: (301) 683-3718

climbfuji · 2020-06-04T14:04:34Z

Dom, Thanks for the heads up. I hope this merge is done efficiently, without any penalty (either in cpu or in memory) for the original code. As Jongil is the original code owner, he may have more comments. Moorthi
…

There are no differences in memory at all. WRT runtime, my timing comparisons did not show any differences between the original code and the new code. The additional few if(hwrf...) statements have no impact on the runtime that could be measured within the run-to-run variation.

JongilHan66 · 2020-06-04T14:20:34Z

Hi Dom, As long as the merge does not affect the GFS run result, I am ok. Jongil

…

On Thu, Jun 4, 2020 at 10:04 AM Dom Heinzeller ***@***.***> wrote: Dom, Thanks for the heads up. I hope this merge is done efficiently, without any penalty (either in cpu or in memory) for the original code. As Jongil is the original code owner, he may have more comments. Moorthi … <#m_-6373985696410502980_> There are no differences in memory at all. WRT runtime, my timing comparisons did not show any differences between the original code and the new code. The additional few if(hwrf...) statements have no impact on the runtime that could be measured within the run-to-run variation. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#423 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALPHEH4ZRXEQP6QJBHCRPBDRU6SYLANCNFSM4LZQWV2Q> .

-- Jongil Han, PhD *SRG* at NOAA/NWS/NCEP/EMC 5830 University Research Ct., Rm. 2091 College Park, MD 20740 [email protected] 301-683-3788

climbfuji · 2020-06-04T14:47:16Z

Hi Dom, As long as the merge does not affect the GFS run result, I am ok. Jongil
…

I had added a description of this to the ufs-weather-model PR when I started working on this a few months back, please see here ufs-community/ufs-weather-model#94 (comment). I'll summarize it again here for the sake of completeness:

We tested this code in DEBUG, REPRO and PROD mode. Results are b4b identical for all tests in DEBUG and REPRO mode. In PROD mode, for all except one of the regression tests (fv3_ccpp_stretched_nest), the results are bit-for-bit identical between the original and new code, too. For fv3_ccpp_stretched_nest, I spent quite some time debugging the code in PROD mode. I identified the section of code that leads to differences in the last significant bits of one or two internal variables. By adding debug print statements before/after those lines, the differences go away. Alternatively, changing the flag hwrf_sas... from an input argument (whose value is not known at compile time, but which is .false. for all tests except the newly added hwrfsas test) to a parameter set to .false., the differences go away, too. Thus, clearly a compiler optimization issue.

When looking closer at the regression test fv3_ccpp_stretched_nest, I realized that this test was using non-uniform blocksizes (which we shouldn't do ever in PROD mode, because the AVX2 compiler flags create peel loops that can easily lead to b4b differences; Jun agrees). I thus changed the regression test setup so that block sizes are now uniform. This modification led to a change in the results for this particular test (for each version of the code, old and new) larger than the differences between the old and the new code described above. You can look at the differences between the 20200512 baseline (old version of code, non-uniform block sizes) and the 20200603 baseline (new version of the code, uniform block sizes; i.e. the differences you see is the "accumulation of changes from old to new code and from non-uniform to uniform block sizes") for this test fv3_ccpp_stretched_nest at

/scratch1/BMC/gmtb/Dom.Heinzeller/FV3_RT/TMP_DIFF_fv3_stretched_nest_ccpp_20200512_20200603

on hera. These are 48h integrations. After 48h, the maximum difference you see for the nest in 2m temperature (fv3_history2d.nest02.tile7.nc) are between -4K and +5K. Typically, we consider differences a butterfly effect when the maximum differences are between -4K and +4K within 24h of integration - here we have -4K and +5K after 48h of integration.

See also the screenshot attached here. In order to see "something", I am plotting the range -1 to +1K. Most of the grid points have zero differences.

Please not again that for all other 50+/- regression tests, the results are bit-for-bit identical between the old and new code, and that the differences I was describing above are 100% due to the compiler optimization.

JongilHan66 · 2020-06-04T14:57:00Z

Thanks for the detailed information on the change. -Jongil

…

On Thu, Jun 4, 2020 at 10:47 AM Dom Heinzeller ***@***.***> wrote: Hi Dom, As long as the merge does not affect the GFS run result, I am ok. Jongil … <#m_-1688970699463005170_> I had added a description of this to the ufs-weather-model PR when I started working on this a few months back, please see here ufs-community/ufs-weather-model#94 (comment) <ufs-community/ufs-weather-model#94 (comment)>. I'll summarize it again here for the sake of completeness: We tested this code in DEBUG, REPRO and PROD mode. Results are b4b identical for all tests in DEBUG and REPRO mode. In PROD mode, for all except one of the regression tests (fv3_ccpp_stretched_nest), the results are bit-for-bit identical between the original and new code, too. For fv3_ccpp_stretched_nest, I spent quite some time debugging the code in PROD mode. I identified the section of code that leads to differences in the last significant bits of one or two internal variables. By adding debug print statements before/after those lines, the differences go away. Alternatively, changing the flag hwrf_sas... from an input argument (whose value is not known at compile time, but which is .false. for all tests except the newly added hwrfsas test) to a parameter set to .false., the differences go away, too. Thus, clearly a compiler optimization issue. When looking closer at the regression test fv3_ccpp_stretched_nest, I realized that this test was using non-uniform blocksizes (which we shouldn't do ever in PROD mode, because the AVX2 compiler flags create peel loops that can easily lead to b4b differences; Jun agrees). I thus changed the regression test setup so that block sizes are now uniform. This modification led to a change in the results for this particular test (for each version of the code, old and new) larger than the differences between the old and the new code described above. You can look at the differences between the 20200512 baseline (old version of code, non-uniform block sizes) and the 20200603 baseline (new version of the code, uniform block sizes; i.e. the differences you see is the "accumulation of changes from old to new code and from non-uniform to uniform block sizes") for this test fv3_ccpp_stretched_nest at /scratch1/BMC/gmtb/Dom.Heinzeller/FV3_RT/TMP_DIFF_fv3_stretched_nest_ccpp_20200512_20200603 on hera. These are 48h integrations. After 48h, the maximum difference you see for the nest in 2m temperature (fv3_history2d.nest02.tile7.nc) are between -4K and +5K. Typically, we consider differences a butterfly effect when the maximum differences are between -4K and +4K within 24h of integration - here we have -4K and +5K after 48h of integration. See also the screenshot attached here. In order to see "something", I am plotting the range -1 to +1K. Most of the grid points have zero differences. *Please not again that for all other 50+/- regression tests, the results are bit-for-bit identical between the old and new code, and that the differences I was describing above are 100% due to the compiler optimization.* [image: Screen Shot 2020-06-04 at 8 40 42 AM] <https://user-images.githubusercontent.com/8006981/83771075-1a24ef00-a63f-11ea-8426-74566fac6d0a.png> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#423 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALPHEH55JWWFCTEBYLZQUNDRU6XYLANCNFSM4LZQWV2Q> .

-- Jongil Han, PhD *SRG* at NOAA/NWS/NCEP/EMC 5830 University Research Ct., Rm. 2091 College Park, MD 20740 [email protected] 301-683-3788

@MinsukJi-NOAA

Changes in this PR: * add a new regression test that uses the HWRF versions of saSAS deep and shallow convection * update how the sutils module is loaded on hera and jet * contains @MinsukJi-NOAA's unit testing branch * contains @DusanJovic-NOAA's butterfly effect branch (changes in GFDL_atmos_cubed_sphere only) Note: The changes in the ccpp-physics PR NCAR/ccpp-physics#423 lead to different results for two existing regression tests in PROD mode: fv3_ccpp_regional_c768 and fv3_ccpp_stretched_nest. In a separate comment below, I will describe in detail my investigation that allowed me to conclude that this change is acceptable. Co-authored-by: MinsukJi-NOAA <[email protected]> Co-authored-by: Dusan Jovic <[email protected]>

- read 2 month merra2 data instead of 12 months decrease memory usage by 6 times

mzhangw and others added 8 commits December 13, 2019 16:48

initialize HWRF sasas scheme using preprocessor directives controlled

5ca808e

add preprocessor directives for HWRF in samfshalcnv

bff2547

fix bugs to pass compilation

a4ac852

delete HWRF ensemble capability

beb3a33

remove if outside of loop per Doms suggestion

c825f5f

bug fix

029f448

Merge branch 'man_hafs_sas_without_updates_of_dtc_develop_029f4489d4f…

178ce50

…06d48e31601912f2cbfe92435c47e' into HEAD

physics/samfdeepcnv.f: bugfix, ca_deep only allocated when do_ca is .…

bccf301

…true.

This was referenced Apr 1, 2020

develop: merge HWRF version of saSAS with GFS version NOAA-EMC/fv3atm#93

Merged

develop: merge HWRF version of saSAS with GFS version ufs-community/ufs-weather-model#94

Merged

climbfuji marked this pull request as ready for review April 1, 2020 21:45

climbfuji requested review from grantfirl, JulieSchramm and llpcarson as code owners April 1, 2020 21:45

climbfuji added 2 commits April 3, 2020 15:41

physics/samfshalcnv.f: bugfix, move assignment inside if block as in …

71eace1

…previous version

physics/GFS_debug.F90: add capability to debug 1-d logical arrays

b61ea19

This was referenced Apr 11, 2020

dtc/hwrf-physics: merge HWRF saSAS with GFS version, update to a more recent version of ccpp-physics from master #433

Merged

HWRF SAS deep/shallow convective schemes #373

Closed

climbfuji and others added 3 commits June 3, 2020 08:09

Merge branch 'master' of https://github.com/NCAR/ccpp-physics into ma…

388999d

…n_hafs_sas_for_master

fix unitialized parameters in samfdeepcnv

ba106f7

Bugfixes, and formatting changes in physics/samfdeepcnv.f

42aa6e4

climbfuji mentioned this pull request Jun 3, 2020

Butterfly effect (from @DusanJovic-NOAA) NOAA-EMC/GFDL_atmos_cubed_sphere#21

Merged

grantfirl reviewed Jun 3, 2020

View reviewed changes

physics/samfdeepcnv.f Outdated Show resolved Hide resolved

grantfirl reviewed Jun 3, 2020

View reviewed changes

physics/samfdeepcnv.f Outdated Show resolved Hide resolved

grantfirl reviewed Jun 3, 2020

View reviewed changes

physics/samfdeepcnv.f Show resolved Hide resolved

grantfirl approved these changes Jun 3, 2020

View reviewed changes

Update of samfdeepcnv.f based on code review

4fcdf2f

climbfuji requested a review from ChunxiZhang-NOAA June 4, 2020 13:45

climbfuji merged commit 6a6dd2c into NCAR:master Jun 4, 2020

ChunxiZhang-NOAA approved these changes Jun 4, 2020

View reviewed changes

climbfuji deleted the man_hafs_sas_for_master branch June 27, 2022 03:17

hannahcbarnes pushed a commit to hannahcbarnes/ccpp-physics that referenced this pull request Aug 3, 2022

Decrease the memory usage by MERRA2 for 6 times (NCAR#423)

08b710b

- read 2 month merra2 data instead of 12 months decrease memory usage by 6 times

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

master: merge HWRF version of saSAS with GFS version #423

master: merge HWRF version of saSAS with GFS version #423

climbfuji commented Apr 1, 2020 •

edited

Loading

grantfirl left a comment

climbfuji commented Jun 3, 2020

climbfuji commented Jun 4, 2020

SMoorthi-emc commented Jun 4, 2020 via email

climbfuji commented Jun 4, 2020

JongilHan66 commented Jun 4, 2020 via email

climbfuji commented Jun 4, 2020

JongilHan66 commented Jun 4, 2020 via email

master: merge HWRF version of saSAS with GFS version #423

master: merge HWRF version of saSAS with GFS version #423

Conversation

climbfuji commented Apr 1, 2020 • edited Loading

grantfirl left a comment

Choose a reason for hiding this comment

climbfuji commented Jun 3, 2020

climbfuji commented Jun 4, 2020

SMoorthi-emc commented Jun 4, 2020 via email

climbfuji commented Jun 4, 2020

JongilHan66 commented Jun 4, 2020 via email

climbfuji commented Jun 4, 2020

JongilHan66 commented Jun 4, 2020 via email

climbfuji commented Apr 1, 2020 •

edited

Loading