A better fitting interface #22

kmnhan · 2024-04-15T20:03:24Z

We removed guess_fit since fitting is not magic, and users should be aware of what the initial parameters are. “Explicit is better than implicit.”

Arguably, this introduces a little bit of inconvenience to the fitting workflow since we need to specify independent vars.

Another shortcoming is that the PyARPES approach of creating Xarray objects containing ModelResults has its advantages, but placing non-picklable objects into NetCDF-like structures is counterintuitive and could be misleading, but I can’t think of a better alternative…

kmnhan · 2024-04-16T10:12:38Z

Maybe add a callable accessor named lmfit or qfit that closely follows DataArray.curvefit syntax but takes an lmfit model. I think the best pythonic approach would be to use apply_ufunc, but we'll have to see how performant it is when conducting parallel fits.

Add a `Dataset.modelfit` and `DataArray.modelfit` accessor with similar syntax and output as `Dataset.curvefit`. Closes #22

kmnhan · 2024-04-17T12:36:11Z

An initial version of a callable accessor based on apply_ufunc has been added with e06982d as modelfit. Slower than joblib parallelization but faster than expected, should return in a few seconds for couple hundred well-conditioned fits.

It is very versatile but not as easy to use as I thought. Returns the best fit coeffs, their stderr, and goodness of fit statistics. I tried to make it return the covariance matrix and the number of variables and the initial parameters, but this is difficult since they may differ for each fit. One idea is to write them in terms of params, and leave the unrelated variables as NaN... should see how ambiguous this is to the user.

On the other hand, it should be feasible to store the y-values, try to implement that. Maybe apply_ufunc has a nice way of handling it.

Add a `Dataset.modelfit` and `DataArray.modelfit` accessor with similar syntax and output as `Dataset.curvefit`. Closes #22

kmnhan · 2024-04-20T03:47:47Z

Should close with 0f7a1e0.
Added covariance matrix and modelresult object (optional) to output. Parallelization is implemented by converting to dataset and parallelizing over data_vars, may not be the most efficient way but it works!

Added new interface for fitting, see #22 for discussions. Made loader argument optional for `erlab.io.loader_context` so it can be used to just change the data directory. Momentum conversion has been rewritten using `xarray.apply_ufunc`, and is now dask-compatible. It also automatically determines the current energy axis (kinetic or binding).

kmnhan self-assigned this Apr 15, 2024

kmnhan added the enhancement New feature or request label Apr 16, 2024

kmnhan mentioned this issue Apr 16, 2024

2.3.0 Update #23

Merged

4 tasks

kmnhan added a commit that referenced this issue Apr 17, 2024

feat: add callable fit accessor using apply_ufunc

e06982d

Add a `Dataset.modelfit` and `DataArray.modelfit` accessor with similar syntax and output as `Dataset.curvefit`. Closes #22

kmnhan added a commit that referenced this issue Apr 18, 2024

feat: add callable fit accessor using apply_ufunc

11e3546

Add a `Dataset.modelfit` and `DataArray.modelfit` accessor with similar syntax and output as `Dataset.curvefit`. Closes #22

kmnhan linked a pull request Apr 20, 2024 that will close this issue

2.3.0 Update #23

Merged

4 tasks

kmnhan closed this as completed in #23 Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A better fitting interface #22

A better fitting interface #22

kmnhan commented Apr 15, 2024 •

edited

Loading

kmnhan commented Apr 16, 2024

kmnhan commented Apr 17, 2024

kmnhan commented Apr 20, 2024

A better fitting interface #22

A better fitting interface #22

Comments

kmnhan commented Apr 15, 2024 • edited Loading

kmnhan commented Apr 16, 2024

kmnhan commented Apr 17, 2024

kmnhan commented Apr 20, 2024

kmnhan commented Apr 15, 2024 •

edited

Loading