Refitting `model_final` and nuisance averaging #360

vsyrgkanis · 2021-01-02T04:57:03Z

Support refitting only final model in DML after changing estimator parameters
Add support for monte carlo nuisance estimation, with multiple k-fold draws.
added rlearner residuals_ property that returns fitted residuals for training data (fixes plotting the residuals T & Y from the model #350 )
fixed flaky cate interpreter test
added refit example in the dml notebook

This reverts commit deda008.

fixed small issue with _d_t storage enabled refitting in drlearner enabled refit in ortho_iv enabled monte_carlo_iterations in ortho_iv added cache_values param to ortho_iv fixed docstrings to remove the model parameters and added dosctrings to the abstract methods. Removed doctest examples from ortholearner and rlearner as these classes can no longer be used as standalone and are abstract classes

linting fixed failing tests fixed notebooks fixed ortholearner docstring reverted to allowing n_crossfit_splits and raising deprecation warning. fixed ortholearner doctest fixed bugs for failing tests fixed failing test bugs added extra kwargs to _strata old sklearn crashes with new scipy. we can revert once we enforce new sklearn disallowing bootstrap inference when refitting added many refit tests. fixed some leftover bugs based on new tests typo in tests changed monte_carlo_iterations to mc_iters and added mc_agg parameter to change the aggregation method {mean, median} fixed nuisance aggregation when nuisances have different dimensions removed _models_nuisance initilaization at init. added rlearner residuals_ property fixed flaky cate interpreter test

… the final stage is estimated.

kbattocchi

This is great! Left a few minor suggestions.

econml/_ortho_learner.py

econml/dml.py

econml/drlearner.py

setup.cfg

heimengqi

Looks good. I am a bit confused by the attribute name changes with heading and trailing underscore, but I checked the test code coverage, it's all being tested so should be fine.

econml/_ortho_learner.py

econml/cate_estimator.py

econml/drlearner.py

econml/cate_estimator.py

heimengqi · 2021-01-09T01:06:06Z

econml/dml.py

+
+    @property
+    def bias_part_of_coef(self):
+        return self.rlearner_model_final._fit_cate_intercept


Same here, why we need this attribute if it's same as fit_cate_intercept ? Another corner case I think we are missing now is from the parse_final_model_params, what if the user sets fit_cate_intercept=False but input the final model with fit_intercept=True, we should update our fit_cate_intercept depends on whether there is an intercept in the user defined final model?

fit_cate_intercept is different than the intercept of the final model. The fit_cate_intercept is aobut whether we augment X with an constant of 1's. The fit intercept of the final model is about fitting an extr aoffset in the regresstion not a cate offset.

Also observe that bias_part_of_coef, returns the fit_cate_intercept based on the fit time. See the implementation in of bias_part_of_coef in dml. It doesn't read the fit_cate_intercept, but it reads the parameter that was stored when fit was called.
So there wont be an inconsitency here. At least that was a corner case that I tried to address.

Co-authored-by: Keith Battocchi <[email protected]>

…lis/refit

kbattocchi

Looks good, just one small question.

econml/dml/dml.py

kbattocchi and others added 7 commits December 22, 2020 09:26

Support refitting in DML

a3adef1

Add support for monte carlo nuisance estimation

d3520ee

Address PR feedback

b4a6bdf

Address monte carlo feedback

7b98d46

Refit test fixes

f636dab

added RScorer

deda008

Revert "added RScorer"

b7470f2

This reverts commit deda008.

vsyrgkanis requested a review from kbattocchi January 2, 2021 04:57

vsyrgkanis force-pushed the vasilis/refit branch 5 times, most recently from acd4d11 to 39634db Compare January 2, 2021 22:00

vsyrgkanis force-pushed the vasilis/refit branch from 39634db to cf7122a Compare January 2, 2021 23:58

Merge branch 'master' into vasilis/refit

72a297a

vsyrgkanis force-pushed the vasilis/refit branch 3 times, most recently from b315e2e to cc2ae24 Compare January 4, 2021 02:36

vsyrgkanis force-pushed the vasilis/refit branch from cc2ae24 to ba0dbf0 Compare January 4, 2021 03:23

vasilismsr added 2 commits January 3, 2021 22:56

changed refit to refit_final to make sure that it's obvious that only…

3b37a4c

… the final stage is estimated.

added refit example in the dml notebook

d29f8a3

vsyrgkanis marked this pull request as ready for review January 4, 2021 21:19

vsyrgkanis requested review from heimengqi and moprescu January 4, 2021 21:19

vsyrgkanis added the enhancement New feature or request label Jan 4, 2021

vsyrgkanis changed the title ~~Vasilis/refit~~ Refitting model_final and nuisance averaging Jan 6, 2021

kbattocchi mentioned this pull request Jan 6, 2021

Add support for refitting and nuisance averaging #343

Closed

kbattocchi requested changes Jan 8, 2021

View reviewed changes

heimengqi approved these changes Jan 9, 2021

View reviewed changes

vsyrgkanis and others added 15 commits January 8, 2021 21:03

Update econml/dml.py

ad27125

Co-authored-by: Keith Battocchi <[email protected]>

addressed review comments. Deprecated positional arguments at init.

031ffb0

Merge branch 'vasilis/refit' of github.com:microsoft/EconML into vasi…

12c5518

…lis/refit

linting

8a49619

docstring fixes

bad9db9

docstring fixes

1951a31

fixed failing tests due to deprecation of positional

b523ef0

fixed failing notebook due to deprecation of positional

d913aae

fixed failing test due to positional deprecation

dafa116

merged with master

b52408f

fixed relative import. fixed drlearner test featurizer access

c553795

fixed merge bugs

e35305f

fixed bugs from merge

e9f3f34

fixed docstring

222d297

added refit re-implementation in causalforest dml

7f437c1

kbattocchi approved these changes Jan 10, 2021

View reviewed changes

econml/dml/dml.py Outdated Show resolved Hide resolved

vsyrgkanis merged commit 9a96875 into master Jan 10, 2021

vsyrgkanis deleted the vasilis/refit branch January 10, 2021 22:26

kbattocchi mentioned this pull request Jan 22, 2021

Add a Monte Carlo wrapper to OrthoLearner #238

Closed

kbattocchi mentioned this pull request Feb 18, 2021

Get first stages residuals #321

Closed

kbattocchi mentioned this pull request Mar 28, 2022

We might want to consider running Monte-Carlo over the folds #311

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refitting `model_final` and nuisance averaging #360

Refitting `model_final` and nuisance averaging #360

vsyrgkanis commented Jan 2, 2021 •

edited

Loading

kbattocchi left a comment

heimengqi left a comment

heimengqi Jan 9, 2021

vsyrgkanis Jan 9, 2021

vsyrgkanis Jan 9, 2021

kbattocchi left a comment

Refitting model_final and nuisance averaging #360

Refitting model_final and nuisance averaging #360

Conversation

vsyrgkanis commented Jan 2, 2021 • edited Loading

kbattocchi left a comment

Choose a reason for hiding this comment

heimengqi left a comment

Choose a reason for hiding this comment

heimengqi Jan 9, 2021

Choose a reason for hiding this comment

vsyrgkanis Jan 9, 2021

Choose a reason for hiding this comment

vsyrgkanis Jan 9, 2021

Choose a reason for hiding this comment

kbattocchi left a comment

Choose a reason for hiding this comment

Refitting `model_final` and nuisance averaging #360

Refitting `model_final` and nuisance averaging #360

vsyrgkanis commented Jan 2, 2021 •

edited

Loading