Fantastic4casters

2021 BU EE585 team project: EFI/NEON terrestrial carbon challenge

0. Contact Information

1. Pulling and visualizing data

For any current date, the R script named "Data_download.R" is used to pull NEON measurements (NEE, LE, and soil moisture) and NOAA weather forecasts (NOAA’s Global Ensemble Forecasting System, GEFS) across the four NEON sites, and to plot time series for the NEON history and the NOAA projections.

Before running, the variable "base_dir" at Data_download.R that defines the working directory, where the data are temporarily stored and output graphs saved, needs to be set manually. Additionally, the user should mannually create the directories on the local machine: the working directory, as well as "data", "graph", "drives" under the working directory. To schedule running the code on a daily basis, copy the following cron table in the Terminal and hit enter (cron is required, supported only on Unix-based operating systems):



# [On terminal] crontab -e > i > Insert below code 

# setup the terrestral data script to run at 5:00 AM
MAILTO="[email protected];[email protected];[email protected];[email protected]"
00 05 * * * /usr/local/bin/Rscript/ "PATH/Data_download.R"

Of the data being pulled, the NEON measurements are updated monthly, with each update releasing new daily data for the past month. Therefore, for the daily runs, the plotted NEON historical time series will include data only up to the latest NEON release. For the NOAA weather forecasts, 35-day ensemble projections, making up of 31 ensembles, or forecasts by separate models, are released once per six hours at a 1-hour forecasting resolution.

The time series plots will be exported to the "graph" sub-directory under the main directory.

2. Historical time-series fit

Before generating forecasts, use scripts named "XXX" to fit the historical data. For this historical fit of NEE, LE, and soil moisture, we created a joint, state-space, dynamic linear model which include data models and process models. The data models are inspired from simple Gaussian distributions,

$\begin{aligned}NEE_{obs}[t] & \sim N(NEE[t], \tau_{NEE_{obs}})\\LE_{obs}[t] & \sim N(LE[t], \tau_{LE_{obs}}) \\SM_{obs}[t] & \sim N(SM[t], \tau_{SM_{obs}})\end{aligned}$

where $\inline NEE$ , $\inline LE$ , and $\inline SM$ are the targets for our forecasting, $\inline t$ represents time, and $\inline \tau$ 's (given by normal distributions, see below) represent the uncertainties during observation and/or data collection. The subscript $\inline obs$ represents the observed value of the variables.

The process model includes shortwave radiance, longwave radiance, air temperature, and precipitation as covariates. It also makes NEE, LE, and soil moisture intercorrelated.

$\begin{aligned}NEE[t] &\sim N(\mu_{NEE}[t],\tau_{NEE_{add}}) \\LE[t] &\sim N(\mu_{LE}[t],\tau_{LE_{add}}) \\SM[t] &\sim N(\mu_{SM}[t],\tau_{SM_{add}}) \\\mu_{NEE}[t] &= \beta_{NEE}\cdot NEE[t-1] + \beta_{NEE,LE}\cdot LE[t-1] + \\& \beta_{NEE,SM}\cdot SoilMois[t-1] + \beta_{NEEI}\cdot XfI[t,1] + \\& \beta_{NEE,sw}\cdot XfC[t,1] + \beta_{temp}\cdot XfC[t,3] \\\mu_{LE}[t] &= \beta_{LE}\cdot LE[t-1] + \beta_{LE,NEE}\cdot NEE[t-1] + \\& \beta_{LE,SM}\cdot SM[t-1] + \beta_{LEI}\cdot XfI[t,2] + \\& \beta_{LE,sw}\cdot XfC[t,1] + \beta_{lw}\cdot XfC[t,2] \\\mu_{SM}[t] &= \beta_{SM}\cdot SM[t-1] + \beta_{SM,NEE}\cdot NEE[t-1] + \\& \beta_{SM,LE}\cdot LE[t-1] + \beta_{SMI}\cdot XfI[t,3] + \\& \beta_{precip}\cdot XfC[t,4] \\XfI[t,i] &\sim N(\mu_{XfI}[i],\tau_{XfI}[i]) \\XfC[t,i] &\sim N(\mu_{XfC}[i],\tau_{XfC}[i])\end{aligned}$

where $\inline \mu$ 's are means of the normal distributions and $\inline \tau$ 's define uncertainties, with the subscript $\inline add$ indicating the model's iteration over time $\inline t$ . For deriving each $\inline \mu$ , $\inline \beta$ 's are coefficients for the terms, including the last step of the variable and the other variables, intercepts $\inline XfI$ , and the corresponding covariates $\inline XfC$ . Incoming shortwave radiation ( $\inline sw$ , $\inline XfC[:,1]$ ) and temperature ( $\inline temp$ , $\inline XfC[:,3]$ ) are selected as covariates for NEE, $\inline sw$ and incoming longwave radiation ( $\inline lw$ , $\inline XfC[:,2]$ ) for LE, and precipitation ( $\inline precip$ , $\inline XfC[:,4]$ ) for SM.

Priors used for the data models and the process model are

$\begin{aligned}NEE[1] &\sim N(0,0.00001) \\LE[1] &\sim N(0,0.00001) \\SM[1] &\sim N(0,0.00001) \\\end{aligned}$

$\begin{aligned}\tau_{NEE_{obs}} &\sim \Gamma(3,1) \\\tau_{LE_{obs}} &\sim \Gamma(0.5,1) \\\tau_{SM_{obs}} &\sim \Gamma(0.1,0.1) \\\tau_{NEE_{add}} &\sim \Gamma(3,1) \\\tau_{LE_{add}} &\sim \Gamma(0.1,0.1) \\\tau_{SM_{add}} &\sim \Gamma(0.1,0.1) \\\end{aligned}$

$\begin{aligned}\beta_{all} & \sim N(0,0.001) \\\mu_{XFI}[i] &\sim N(0,0.001) \\\mu_{XfC}[i] &\sim N(0,0.001) \\\tau_{XfI}[i] &\sim N(0.01,0.01) \\\tau_{XfC}[i] &\sim N(0.01,0.01)\end{aligned}$

The model was run with JAGS (Just Another Gibbs Sampler), a statistical software package designed to do Bayesian analyses using Markov Chain Monte Carlo (MCMC) numerical simulation methods, for 20,000 iterations with three chains. The burn-in period is determined to be the first 500 steps of iteration, and is removed in subsequent analyses.

Name		Name	Last commit message	Last commit date
Latest commit History 234 Commits
Previous works		Previous works
.gitignore		.gitignore
00A_fit_dlm_revised.R		00A_fit_dlm_revised.R
00B_NOAAconversion.R		00B_NOAAconversion.R
00C_Library+Directory_Setting.R		00C_Library+Directory_Setting.R
00D_UpdatedData.Rmd		00D_UpdatedData.Rmd
01A_Targetdownload.R		01A_Targetdownload.R
01B_NOAAdownload.R		01B_NOAAdownload.R
01C_COVdownload.R		01C_COVdownload.R
02_JointDLM.Rmd		02_JointDLM.Rmd
03_UncertaintyAnalysis.Rmd		03_UncertaintyAnalysis.Rmd
04_EnsembleForecast.Rmd		04_EnsembleForecast.Rmd
04_IterativeParticleFilter.Rmd		04_IterativeParticleFilter.Rmd
05_submission.Rmd		05_submission.Rmd
Fantastic4casters.Rproj		Fantastic4casters.Rproj
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fantastic4casters

0. Contact Information

1. Pulling and visualizing data

2. Historical time-series fit

3. Ensemble forecast

About

Releases

Packages

Contributors 5

Languages

License

EcoForecast/Fantastic4casters

Folders and files

Latest commit

History

Repository files navigation

Fantastic4casters

0. Contact Information

1. Pulling and visualizing data

2. Historical time-series fit

3. Ensemble forecast

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages