Changes in DALES

Version 4.4.2

Additional output, support for running RCEMIP experiments.

Bugs fixed

Fix typo in bulkmicro accessing uninitialized qcbase in place of qrbase. 6771d557f
#68, crash with warmstart and kappa scheme cde8e768
vtke in profiles.nc: include density and possibly variable dz. Fix unit. 8e9a228e
modtimestat: don't replace zero cloud fraction by missing value 680bfc6d8

Changes

the behavior of lpresgrad has changed. 47ac7fa6 lpressgrad (default=true) is now always respected, previously the flag had an effect only if one microphysics scheme was enabled, otherwise the calculations were done as if the flag was true.
require cmake >= 3.5

Improvements

add PHYSICS/lconstexner flag to keep the Exner function constant, experimental. 28e9cdd66d
add option uvdamprate to dampen horizontal mean wind to geowind in whole domain (for RCEMIP) a481010b4c
restart files: add a fourth digit in the hours field to enable longer runs 136a7747c
RRTMG: add trace gas concetration settings in namoptions file 8fae48a4b8 Options co2_fraction, ch4_fraction, n2o_fraction in NAMRADIATION namelist
Put sgs_surface_fix back in NAMSUBGRID namelist (default false) 3fe0063
add several optional output fields to fielddump: cli cloud ice clw cloud liquid water ta temperature plw precipitation, liquid pli precipitation, ice hus specific humidity hur relative humidity tntr temperature due to radiative heating tntrs temperature due to radiative heating, shortwave tntrl temperature due to radiative heating, longwave
RRTMG now handles ice clouds 9d33d8b
add module for frozen moist static energy budget (for RCEMIP) 4bfb69c1b3
add coldpool detection variables to CAPE output (e219585) From Rochetin et al, JAMES 2021, doi:10.1029/2020MS002402 hmix - the lowest height where thv > + 0.2K, where the average is calculated from ground to hmix umix - average of u from ground to hmix vmix - average of v from ground to hmix thetavmix - average of theta_v from ground to hmix hinvsrf - lowest height where dT/dz < 0

Optimization

convert more fields to optional single precision aec874e, 83f3274, 89ad0459, c558f1fbe
faster thermodynamics, enabled with PHYSICS/lfast_thermo (default false) 51f183c11c
vectorize modforces 47ac7fa6f, dbd2834b6

Version 4.4.1 - 2023-06-07

A minor release with fixes, improved error reporting and statistics output.

Bugs fixed

Fix unsafe boundary exchange of ustar, commits f2d07e82, 45e38ace1
Fix output units and long_name attribute, commits 3358143c, 0744378c0, c8fdb21
fix #79: dtmax < 0.1 leads to crash, commit 4df5e0e1f1

Improvements

2D radiation statistics, commit 7e58773660e
Add variance budget module, commit 4646ff8e
modgenstat: add per-level cloud fraction, cfrac, to profiles.nnn.nc, commit 8aee2cb63c47
Make lCnstAlbedo true by default for less surprising behavior, commit 989e13a33675
Allow multiple planes for crossxz, crossyz and in any MPI tile, commit a4640bd20b
Add namelist option RUN/loutdirs to save model output in separate directories for each row, commit 3576a917ef

Improved error checking

Stop with error message if any dzh or dzf is zero or negative, commit 5f9f67046c
Add error checking when opening *.inp.nnn files, commit f1a38f24d
Print MPI error string on error, catch and explain nprocx * nprocy != nprocs, commit b6c5887ad8
Add MPI error checking in initialization, commit 6060f13a2ad
Check FFTW plan creation result for errors, commit a2666269b6

Optimization

bulkmicrophysics: give more of the temporary 3D fields configurable precision (field_r), commit fe5f0ea9

Version 4.4 - 2022-06-20

This version introduces configurable floating point precision (by Victor Azizi, Jisk Attema, Fredrik Jansson). The main 3D fields can be either double (default) or single precision. The choice is made at compile time, and reduces the memory requirements and computation time by about 40%. See the Wiki. A new two-moment ice microphysics scheme has been introduced (Jan Chylik, Roel Neggers, with tuning by Jisk Attema). This version also introduces support and tuning for the Fugaku supercomputer, see DALES on Fugaku.

Improvements

Automatic testing of git commits using GitHub actions (Victor Azizi)
All function interfaces are now explicit, reducing the chance of mistakes with variable type
Add twp_bar, rwp_bar to tmser.nnn.nc, commit 7da93bdc
fielddump: time windowing, save fields only when tmin <= current time <= tmax, commit a65b03
Add namelist flags to select individual fields for 3D fielddump, commit c68814
Relative time units (s since date) in netCDF output (if xday, xyear are specified in namelist)
All module interfaces made explicit, reducing risk of type errors
Automatic testing with GitHub Actions, by Victor Azizi, commit 297f18
Add netCDF error checking, commits 031d5c6, ecfeb0ba

Optimization

Configurable floating point precision (by Victor Azizi, Jisk Attema, Fredrik Jansson)
Don't exchange halos of m-fields, commit e25a23
Optimize kappa advection scheme, commit fe60b5e
Optimized warm microphysics by Jisk Attema, commit 1758dcc0
Reduced memory allocation, commit 41c1589

Bugs fixed

Remove option -malign=double
Close the tmser.nnn.nc file at the end
Fix slabsum calls sv0(1,1,1,n) -> sv0(:,:,:,n), commit 79712743

Version 4.3 - 2021-03-13

This version introduces a library interface to DALES, defined in daleslib.f90. With this interface, DALES can be used with the OMUSE framework, which provides a Python interface to DALES. Another new feature is the support of iterative Poisson solvers, see the Wiki and also an alternative FFT-based solver using the FFTW library.

Improvements

Add option ntimedep to PHYSICS namelist - max number of time points for timedependent forcings. Commit 14a84deb
Add support for non-uniform grid in the kappa advection scheme. Commit d8e6c68d
Add a namelist option lsync in NAMNETCDFSTATS to synchronize netCDF output files after writing
Iterative Poisson solver support with the HYPRE library (by J. Attema).
Don't overwrite ascii output files on warm start. Commit 25fae048
Warn when moduser routines are invoked from a case, when a custom moduser.f90 has not been compiled in. Commit ae148a3bd3
Allow advection schemes 5th and 6th with non-uniform grid again, add experimental kappa scheme (77) with better non-uniform grid support. Commit 4ab9a7571b
Add missing rho in divergence diagnostic, show reason for dt limit. Commit da3a32f951b5
Add qsat field for statistics, and statistics for accumulated surface rain flux in simpleice2. Commit 6036c1a93b
Add a library interface to DALES, to enable interfacing with OMUSE. Commits 6e0e39f2, 9f91f2d, 4b97aca, 640977e

Optimization

modpois: don't exchange the pwp field, ghost cells in it are not needed. Commit c7f72b1b
If lcoriol is .false., skip coriolis calculations. Commit c343772
advec_5th: move i,j loops inside if k. Advection calls ~3 times faster. Commit 0badd3a124
Asynchronous MPI halo exchanges. E,W in parallel, then N,S. By J. Attema. Commits a54b9f8e, 6951323, 0c08a262

Bugs fixed

Use the 2D tskin field as input for RRTMG LW_out instead of the domain mean value of tskin. Commit 910fa438
Issue #58 Disable lsmcrossection if isurf/=1
Issue #52 making modtimedepsv allocations consistent with modtimedep and removing redundant broadcast. Commit e6dd89a3
kappa advection scheme: fix index error on lowest level. Commit 98d34d2e
radiation and restarts : #40
modsimpleice2 and initialization order, #49 . Commit be3f9711bb6
timedependent forcings, finding the correct interval for interpolation with time dependent surface variables. #48 . Commit d6f2225, 1ed4357
modbudget: remove extra factor rhobf in calculation of sbtke average. From S. de Roode.
modbudget: add if(myid==0) around netcdf profile writing. Commit b7afc418c5
Fix ibas_prf=3 initialization when zf(k) is exactly 11000 (1st value in the lapse rate table). Issue #41, Commit ee6230bc00
modradstat: handle sign conventions of different radiation schemes (S. de Roode). Commit f43012
radiation: initialize {sw,lw}{Up,Down}_slice arrays before rrtmg calls, since the calls are not always performed (S. de Roode). Commit d4e979de7d
netCDF staistics: fix for determining the index where writing is resumed at warm start. Issue #42, Commit 8ff2cced3
Fix closing of crosssection netcdf files. Commit e72701e08

Version 4.2 - 2019-06-05

This is a summary of changes in DALES 4.2. More details of the changes follow below.

Optimization of advection and subgrid schemes
Improvement surface scheme (phi functions) for stable conditions
Further testing RRTMG (introducing aerosol)
Extension from one-big leaf to two big-leaves (sunlit and shaded)
Arbitrary distribution of sources and sinks scalar
Heterogeneity land-surface to account for canopy dynamic effects
Option for more agressive optimization flags (try export SYST=gnu-fast)

Bugs fixed

Typo in advec_5th and advec_52, in the advecv routines. The 5th scheme has the typo only in the code for the boundaries, while 52 has it also for the bulk. Issue #37 Commit c1ac043a038f7
Obukhov length calculation: catch case Rib = 0 - set L=1e6. Can happen in the first step if the surface flux is 0. The Obukhov length is capped at 1e6 anyway, so we may as well cap it also when Rib == 0. Issue #24 Commit a3c5d1b
adaptive time step: limit dt by ekh, not only ekm - ekh may be larger. Issue #29 Commit a67cf8f, 9f8304d83fe4
Test cases arm_brown, arm_unstable, bomex : correcting namelist entry SUBGRID to NAMSUBGRID. These three test cases had an entry &SUBGRID in the name list. This entry is ignored, since the proper name is &NAMSUBGRID. All three cases also defined subgrid parameters in the name list which are not expected there. These were removed, as the values were set to the defaults. Commit 12d910a
Add kind=longint to floor and ceiling functions of time to prevent overflow in timeleft. Commit 39f75a78e

Improvements

Add alternative simpleice - imicro=6. Contains fixes and optimizations but less statistics. Note that the original simpleice is still preserved and can be used with imicro=5 as before. Commit 52ca4d5.
- snow ventilation parameters differ from references Issue #26
- many loops were fused -> need fewer arrays, better cache efficiency
- rsgratio was sometimes used uninitialized
- avoid some numerical fixes: rsgratio(i,j,k)+1.e-6, qrr/(qrr+1.e-9)
- performance gain: ~10% faster simulation in total, case-dependent.
Add support for non-equidistant grid to _52 and _62 advection schemes. For these two schemes, support can be added with copy-paste from the 2nd order scheme, since also the 52 and 62 schemes are 2nd order in the vertical direction. Commits 4534fb8c638b, bc9606e86b5d
Stop with error message if the chosen advection is incompatible with non-equidistant grid. Most advection routines do not support a non-equidistant grid. Issue: The list of schemes compatible with a non-equidistant grid was overly restrictive. Propose to enable some of them again in the next version. Commit b8ce2d0df24
Surface scheme: define phim and phih as functions. Add cap on phi for zeta > 1 to prevent crashes in stable conditions. Issue #28 Issue #30 Commit 8469307
Make sgs_surface_fix = .false. by default, and remove it from the namelist options. This fix is not needed after the changes to the phi and psi functions in very stable conditions. It causes non-physical "flares" of high tke at stable conditions.
Add tke field to the netCDF cross sections. Having the TKE field available for inspection is good for testing the SGS and surface schemes. Commit c9b43e4
Allocate large arrays instead of storing on the stack. There were only a few of these. Avoids problems with the stack size being too small -> crash. Commit 1ab3ea1, 1950e32
Store vertical netCDF cross sections only in the first tile in x and in y. Commit 22f5643
Expose more subgrid scheme parameters in namelist: ch1,ch2,cm,ce1,ce2. Add forgotten MPI_BCAST of courantp. We have been discussing whether the Prandtl number could be increased from ⅓ to 1 (Smagorinsky scheme), and correspondingly the ch2 coefficient could be decreased from 2 to 0 (SGS-TKE scheme). Adding ch2 and some other coefficients to the name list for easy testing of this. Commit 13c2464
Better restart logic: if trestart=0, write restart file only at the end. If < 0, don't write a restart file. Commit 8067376f9c
Add macros to custom doxygen latex header.tex, to work with doxygen 1.8.2 Commit 1104f3b
Add ECMWF build option. Cray fortran fix: avoid -1**real. Commit 9fd1524
Remove increment of uninitialized and unused ii variable, in the case of only one process. Tripped ifort's -check all. Commit 2669b1d

Optimizations

Add more aggressive compiler options for gfortran, selectable with SYST=gnu-fast. Enables -Ofast -march=native. Test on 1h of bomex case: gain 30% speed, results are close. Also adding another system, SYST=ECMWF-intel, for using ifortran on the ECMWF Cray. Commit 3434d3372c
Add a null advection scheme. A way to use simpleice microphysics without advecting the unused scalar. Saves some percents of simulation time for free. Commit b3d87987e4 or 826c7b5
Optimization of advec_52, advec_62 - moving if(k==1) out of the k-loop, for a performance gain of about 8% (full simulation). Commit 3b298b0d03, e18eb56163
Remove rhoputin from advec_2nd, advec_52, advec_62, advec_5th. rhoputin was a temporary array used to re-use the result of a multiplication. ~10% whole-simulation performance gain when removing this, due to fewer memory accesses. Commit afd62e8, 6b9a3e41b7c
Optimized advec_hybrid scheme. advec_hybrid.f90 is the original implementation (scheme 55), a new file advec_hybrid_f.f90 (scheme 555) has been added with a different implementation of the same advection scheme. The advection routine itself is about 4 times faster, a full simulation using it is about 30% faster. Commit e5e029bc4627a, 8d6fd70
Modradfull optimization - calculate exp only when needed in coefft0. Commit 5c712f3
Optimization of the subgrid scheme save some divisions and intermediate arrays. Commit 42897d8
tstep_integrate: vector math instead of do loops faster, slightly shorter. Commit ddcb29c

Known issues in DALES 4.2

Runs with radiation do not restart properly. Issue #40
Radiation tendencies are not initialized to 0.
Tendencies in the radiation statistics (output only) are sometimes calculated with the wrong sign.

Wishlist for future improvements

Make the MPI halo exchange less synchronous, to overlap communication and calculations.
Permit longer time steps. The CFL criterion is formulated for an equidistant grid, and unnecessarily restrictive. adaptive time step: longer steps possible when dx,dy,dz non-equal, as in microHH. see discussion for issue #29. When dx,dy,dz are non-equal, the diffusion criterion is stronger than strictly necessary. Relaxing it offsets the performance impact of commit a67cf8f. This has not been rigorously tested! As this permits longer time steps, it makes the simulation less stable. This may need compensating by specifying a slightly smaller peclet number e.g. in the namoptions file. According to Chiel, microHH does the same. (Commit 1151312)
Don't do halo exchange of the m-fields The m-fields ghost cells are seldom needed. Saves maybe 1 % for a single process run, and a large amount of MPI communication in a multi-process run. This was tried but reverted since it interfered with chemistry, which does use the m-field halos. (Commit 0c2cf59)
Surface statistics 2D fields LWP, RWP, TWP and accumulated rain, output as netCDF in the cross-section routine. The accumulated rain works only for imicro=6. TODO:
- make this optional, selectable by a flag in the namelist.
- output the rain field only if imicro==6
- check if duplicated in modlsmstat or modAGScross.
Implementation of Mean-state acceleration [Jones et al, JAMES 2015, https://doi.org/10.1002/2015MS000488] A simple change in the time step routine, accelerating the rate of change of the horizontal averages of prognostic quantities. Useful for speeding up simulations if there is a large difference in time scale for eddies and for horizontal averages. The acceleration factor can be set in namoptions. Default is 1, meaning no acceleration. Huug proposes: add an option to enable acceleration for some time, for spinup. Handle chemicals sv fields. Add namelist options, maybe a vector for all sv - accelerate or not.
Merged netCDF writing. It is tedious to stitch together the separate netCDF files produced - one file per MPI task. There are scripts in the Dales repository, but they are from the time of slab parallelization and only stitch in one direction. Consider the parallel netCDF API where many MPI tasks can write together into a single file.
Synchronize netCDF files periodically. When switching to compressed netCDF v4, the files can be unreadable while DALES is running or after DALES crashes or is stopped in the middle of the run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHANGELOG.md

CHANGELOG.md

Changes in DALES

Version 4.4.2

Bugs fixed

Changes

Improvements

Optimization

Version 4.4.1 - 2023-06-07

Bugs fixed

Improvements

Improved error checking

Optimization

Version 4.4 - 2022-06-20

Improvements

Optimization

Bugs fixed

Version 4.3 - 2021-03-13

Improvements

Optimization

Bugs fixed

Version 4.2 - 2019-06-05

Bugs fixed

Improvements

Optimizations

Known issues in DALES 4.2

Wishlist for future improvements

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changes in DALES

Version 4.4.2

Bugs fixed

Changes

Improvements

Optimization

Version 4.4.1 - 2023-06-07

Bugs fixed

Improvements

Improved error checking

Optimization

Version 4.4 - 2022-06-20

Improvements

Optimization

Bugs fixed

Version 4.3 - 2021-03-13

Improvements

Optimization

Bugs fixed

Version 4.2 - 2019-06-05

Bugs fixed

Improvements

Optimizations

Known issues in DALES 4.2

Wishlist for future improvements