-
Notifications
You must be signed in to change notification settings - Fork 542
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ww3_ufs1.3 reproducibility issue #452
Comments
Should this be solved before @ricampos starts his tests? @aliabdolali or @ricampos have either of you looked at this yet? |
This sounds more critical/important than the other issue about netcdf (where in the worst scenario we could just pick netcdf/4.7.2), since it is generating different initial conditions. Let me try to check here... |
I agree that we should fix these two issues, but the issue with the partitions seems more important to me than restart reproducibility, as Ricardo's work does not need to meet operational requirements. |
He's running cycled tests which rely on restart files... so actually I would say restarts are important. |
Hi. I ran 2 independent tests with the same regtest ufs1.3 at 2 ww3 installations. Same number of cores (120) and I could see the restart files are slightly different: |
@ricampos using the vbindiff can you see which variables are different? |
Hi @ricampos, if this is the only difference, I believe you're in the clear. I think this is just saying the two ascii header strings are different. As a check, the first red hex number (top) 6C is ascii code for 'l', and bottom, 45 is 'E'. (ps, I didn't know about vbindiff, very cool!) |
@ricampos Is vbindiff available on Orion or do you need to compile it? This is very useful, thanks As another way to debug, we should see what is the difference between ww3_ufs1.3 (not identical) and one other similar case ww3_ufs1.2 b4b identical) . Both are using the same switch on three grids. We should compare the ww3_grid_glo_15m.inp with ww3_grid_gnh_10m.inp to see the differences. |
thank you for letting us know, Matthew. I use vbindiff on my personal laptop only, not on Orion yet. I sent an email to Helpdesk to see if could be possible to use it there too. |
Describe the bug
When running with the develop branch, the ww3_ufs1.3 test twice, the 20190830.030000.restart.glo_15m is not the same.
To Reproduce
Clone two copies of the develop branch (8/23/21) and run the reg test:
./bin/run_test -w work_a -m grdset_a -f -p mpirun -n 140 -t 4 -o all ../model ww3_ufs1.3
and then use matrix.comp to compare the output.
Expected behavior
All files should be the same.
FYI @ricampos @aliabdolali
The text was updated successfully, but these errors were encountered: