Atmospheric forcing #41

NoraLoose · 2024-06-18T15:59:16Z

Completes step 4 in #1.

This adds three classes: AtmosphericForcing, SWRCorrrection, and ForcingDataset, where the first two are the ones the user will interact with. I added an example notebook to the documentation that illustrates how the classes can be used in practice.

ToDo

There are two things left to do, and I will open two separate issues to address these:

Test performance and profile memory on different machines and clusters, and make recommendations to the user for what kind of dask clusters to use: (1) thread-based vs. process based, (2) how many workers for a given total amount of memory (making sure every worker has enough memory)
Change interpolation scheme for zonal and meridional wind, and the SWR correction to mimic the MATLAB results more closely, see below.

Python vs. MATLAB-based results

Both the roms-tools and the MATLAB-based scripts create 7 atmospheric forcing fields that are necessary to run a ROMS simulation. To create these fields both follow the same two-step procedure:

Fill in NaNs over land by spreading values from the ocean into land through diffusion.
Interpolate from ERA5 to ROMS grid

Here is a comparison for all 7 fields for a randomly chosen time step. For each atmospheric field, there are 4 panels.

The left lower panel shows the difference between python and matlab over land and ocean. There are significant differences over land due to the fact that python vs. matlab use different boundary conditions for their diffusion-based NaN filling.
Since we only care about differences over the ocean, the right lower panel masks out land. This is the important panel to look at.

We make the following observation:

For Tair, qair, rain, lwrad, the roms-tools and the MATLAB-based scripts lead to very similar results over the ocean. The reason is that both solutions use linear interpolation.
There are significant differences for uwnd, vwnd, and swrad because for the time being roms-tools uses linear interpolation while the MATLAB-based scripts use a modified akima scheme.

…orcing

…m-forcing

NoraLoose · 2024-06-27T19:00:45Z

@TomNicholas pinging you in case you have any thoughts on this PR. But no rush if you are busy with other stuff.

TomNicholas

Some minor comments for now

roms_tools/_version.py

TomNicholas · 2024-07-01T20:53:12Z

roms_tools/setup/atmospheric_forcing.py

+import xarray as xr
+import dask
+from dataclasses import dataclass, field
+from roms_tools.setup.grid import Grid
+from datetime import datetime, timedelta
+import glob
+import numpy as np
+from typing import Optional, Dict, Union
+from scipy.sparse import spdiags, coo_matrix
+from scipy.sparse.linalg import spsolve
+from roms_tools.setup.fill import lateral_fill
+import warnings
+import calendar


We want to be running the linting tool isort on this repo, which would automatically organize these according to which are builtins vs external dependencies vs internal imports.

Would it be useful to use pre-commit to run isort and other linting tools? https://pycqa.github.io/isort/docs/configuration/pre-commit.html

Definitely! ruff does a lot of the formatting as one tool now though, e.g.

https://github.com/zarr-developers/VirtualiZarr/blob/main/.pre-commit-config.yaml

Ok, I have run some linting tools on the main branch (PR #46) as well as this branch (commit 5b558e3). Pre-commit now also runs some linting tools as part of the test suite (see checks below). Thanks for this suggestion - this repo really needed a clean-up!

Excellent! @dafyddstephenson you might want to steal what Nora has done here for the C-Star repo.

TomNicholas · 2024-07-01T20:56:04Z

roms_tools/setup/atmospheric_forcing.py

+        # Load the dataset
+        with dask.config.set(**{'array.slicing.split_large_chunks': False}):
+            # initially, we wawnt time chunk size of 1 to enable quick .nan_check() and .plot() methods for AtmosphericForcing 
+            ds = xr.open_mfdataset(self.filename, combine='nested', concat_dim=self.dim_names["time"], chunks={self.dim_names["time"]: 1})


You probably want coords='minimal', compat='override' here - see pydata/xarray#8778

Thanks! Done via commit 0556bf1.

Co-authored-by: Tom Nicholas <[email protected]>

…m-forcing

…orcing

…m-forcing

NoraLoose · 2024-07-09T20:57:15Z

@TomNicholas do we want to wait until the dask deployment issue is fixed to merge this PR? Maybe we should defer this to another PR?

TomNicholas · 2024-07-09T21:11:48Z

@TomNicholas do we want to wait until the dask deployment issue is fixed to merge this PR? Maybe we should defer this to another PR?

As long as this code runs successfully for some dask setup somewhere, we can separate out the problem of deploying dask on other machines. So if this runs on Casper/your laptop that's good enough to merge this IMO.

Generally merging something which works first, then creating separate issues to track getting it to run in other setups or improving performance is a good idea.

NoraLoose · 2024-07-09T21:16:35Z

Sounds good! In my latest test, it actually ran on Perlmutter as well (with the default LocalCluster()), but there are still some weird things happening if other dask deployment strategies are used. I will try to document those in a separate issue.

NoraLoose added 29 commits May 22, 2024 09:10

Add draft for atmospheric forcing class

3a8eaf4

More capability for atmospheric forcing generation

a6cc61c

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

253ab0b

…orcing

Finish atmospheric forcing class

89a8997

Add plotting method to AtmosphericForcing

f835972

Add AtmosphericForcing to __init__.py

d0842b9

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

843c725

…orcing

Enable interpolation of atm forcing onto coarse grid

36d31f7

Group forcing by year and month and write to netcdf

1cf718b

Add lateral fill feature

24ac24b

Interpolate atmospheric forcing and start revamping SWR correction

afb2457

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

cf6e48a

…orcing

More atmospheric forcing

3bbb4f4

Make sure atmospheric forcing interpolation works for grid straddles

5cc3858

Handle Greenwhich meridian stuff

e89a63d

SWRCorrection and time chunking

d1d1944

Time chunk size 1 until .save()

a743f29

clean up atmospheric_forcing module

b9c4216

Protect user against interpolation errors due to discontinuous lon range

e03e0c4

nan_check only over the ocean

43ace2b

Add SWRCorrection to __init__.py

bf3f9d4

Add numba to conda env and rename the env

557fd3b

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

a58260e

…orcing

Add atmospheric forcing notebook

b33985d

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

479e09a

…orcing

Docs stuff

b92381d

Add note to notebook

3f42d42

Merge branch 'atm-forcing' of github.com:NoraLoose/roms-tools into at…

3e78aed

…m-forcing

Add numba to pyproject.toml

a9a4e46

NoraLoose force-pushed the atm-forcing branch from b9dd1c9 to a9a4e46 Compare June 18, 2024 16:09

This was referenced Jun 18, 2024

Dask performance #42

Open

Interpolation schemes #43

Open

NoraLoose requested a review from TomNicholas June 18, 2024 16:21

Add template to docs so methods show up in API ref

d8621a9

TomNicholas reviewed Jul 1, 2024

View reviewed changes

NoraLoose and others added 9 commits July 8, 2024 09:01

Update roms_tools/_version.py

9acef71

Co-authored-by: Tom Nicholas <[email protected]>

Change options for xr.open_mfdataset

0556bf1

Delete roms_tools/_version.py

79f115c

Merge branch 'atm-forcing' of github.com:NoraLoose/roms-tools into at…

eec39d7

…m-forcing

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

fe4c1b5

…orcing

Run linting tools on this branch

5b558e3

Update atmospheric forcing docs

58ba1ca

Merge branch 'main' of github.com:CWorthy-ocean/roms-tools into atm-f…

1a8191c

…orcing

Merge branch 'atm-forcing' of github.com:NoraLoose/roms-tools into at…

2a1099e

…m-forcing

Add note on perlmutter and dask behaviour

1797883

NoraLoose merged commit fbbe244 into CWorthy-ocean:main Jul 9, 2024
8 checks passed

NoraLoose mentioned this pull request Jul 10, 2024

Step 4: bug in meta data for absolute humidity #25

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Atmospheric forcing #41

Atmospheric forcing #41

NoraLoose commented Jun 18, 2024

NoraLoose commented Jun 27, 2024

TomNicholas left a comment

TomNicholas Jul 1, 2024

NoraLoose Jul 8, 2024

TomNicholas Jul 8, 2024

NoraLoose Jul 8, 2024

TomNicholas Jul 9, 2024

TomNicholas Jul 1, 2024

NoraLoose Jul 8, 2024

NoraLoose commented Jul 9, 2024

TomNicholas commented Jul 9, 2024

NoraLoose commented Jul 9, 2024

Atmospheric forcing #41

Atmospheric forcing #41

Conversation

NoraLoose commented Jun 18, 2024

ToDo

Python vs. MATLAB-based results

NoraLoose commented Jun 27, 2024

TomNicholas left a comment

Choose a reason for hiding this comment

TomNicholas Jul 1, 2024

Choose a reason for hiding this comment

NoraLoose Jul 8, 2024

Choose a reason for hiding this comment

TomNicholas Jul 8, 2024

Choose a reason for hiding this comment

NoraLoose Jul 8, 2024

Choose a reason for hiding this comment

TomNicholas Jul 9, 2024

Choose a reason for hiding this comment

TomNicholas Jul 1, 2024

Choose a reason for hiding this comment

NoraLoose Jul 8, 2024

Choose a reason for hiding this comment

NoraLoose commented Jul 9, 2024

TomNicholas commented Jul 9, 2024

NoraLoose commented Jul 9, 2024