Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Floating point exception in OM4-025 #1146

Closed
jiandewang opened this issue Jun 26, 2020 · 1 comment
Closed

Floating point exception in OM4-025 #1146

jiandewang opened this issue Jun 26, 2020 · 1 comment

Comments

@jiandewang
Copy link
Collaborator

jiandewang commented Jun 26, 2020

I am having Floating point exception when use MOM6-examples (dev/master eb2f3b4 Merge branch 'dev-master-candidate-2020-05-15' into dev/master) for OM4-025. Here are some details:
(1) two changes in MOM_input: dt-thermal=1800s, VERTEX_SHEAR=T (they are 7200s and F in original setting)
(2) this happens only in "DEBUG" mode, "REPRO" mode works fine.
(3) intel 18 compiler
(4) the crash happened in the beginning of run

Below is error message:
[Orion-03-64:400365:0:400365] Caught signal 8 (Floating point exception: floating-point invalid operation)
[Orion-03-61:246749:0:246749] Caught signal 8 (Floating point exception: floating-point invalid operation)
*** Error in /./MOM6': munmap_chunk(): invalid pointer: 0x000000001d426ab0 *** *** Error in ./MOM6': munmap_chunk(): invalid pointer: 0x000000001ded96b0 ***
======= Backtrace: =========
/lib64/libc.so.6(+0x7f754)[0x7f12e68a9754]
======= Backtrace: =========
/usr/lib64/libucs.so.0(+0x591a0)[0x7f11e56fe1a0]
..................................
......................................
MOM6 00000000058176DB Unknown Unknown Unknown
libpthread-2.17.s 00007F91427925D0 Unknown Unknown Unknown
MOM6 0000000001B31199 mom_diag_remap_mp 343 MOM_diag_remap.F90
MOM6 000000000116DCE7 mom_diag_mediator 3267 MOM_diag_mediator.F90
MOM6 0000000003CF4D62 mom_mp_step_mom_t 1174 MOM.F90
MOM6 0000000003CE4FF3 mom_mp_step_mom_ 774 MOM.F90
MOM6 0000000002B3B355 ocean_model_mod_m 579 ocean_model_MOM.F90
MOM6 0000000000DF91E7 MAIN__ 1082 coupler_main.F90
MOM6 0000000000410E22 Unknown Unknown Unknown
libc-2.17.so 00007F91423D8495 libc_start_main Unknown Unknown
MOM6 0000000000410D29 Unknown Unknown Unknown
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
MOM6 00000000058176DB Unknown Unknown Unknown
libpthread-2.17.s 00007FC21AEA15D0 Unknown Unknown Unknown
MOM6 0000000003796060 mom_kappa_shear_m 482 MOM_kappa_shear.F90
MOM6 000000000313D0CC mom_set_diffusivi 354 MOM_set_diffusivity.F90
MOM6 0000000001FB6DA7 mom_diabatic_driv 604 MOM_diabatic_driver.F90
MOM6 0000000001FABFAB mom_diabatic_driv 372 MOM_diabatic_driver.F90
MOM6 0000000003CF7EC3 mom_mp_step_mom_t 1268 MOM.F90
MOM6 0000000003CE5BF2 mom_mp_step_mom
801 MOM.F90
MOM6 0000000002B3B355 ocean_model_mod_m 579 ocean_model_MOM.F90
MOM6 0000000000DF91E7 MAIN
_ 1082 coupler_main.F90

Hallberg-NOAA added a commit to Hallberg-NOAA/MOM6 that referenced this issue Jun 26, 2020
  Corrected halo size in density derivative calculations in smoothed_dRdT_dRdS
This fixes an i-parallelization problem that was recently introduced (as a part
of MOM6 PR#1089) when VERTEX_SHEAR is True, and closes MOM6 issue mom-ocean#1146.  All
answers in the existing MOM6-examples test suite are bitwise identical, but this
does change (correct) answers when VERTEX_SHEAR is true.
@Hallberg-NOAA
Copy link
Collaborator

This problem was corrected with https://github.com/NOAA-GFDL/MOM6/pull/1147.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants