SMC + FIVO implementation #16

andrewwarrington · 2021-12-06T22:09:38Z

Main components:

SMC implementation:
- Self-contained code in ssm -> inference -> smc.py.
- Implements both BPFs and SMC.
- To use proposals, the proposal must be passed in as a callable with a fixed set of arguments.
- Returns an object of type SMCPosterior that contains all sweep information.
- Can be vmaped pretty easily to do repeated sweeps or over different trials.
- Accompanying notebook in ssm -> notebooks -> smc-lds.ipynb
- PyTest scripts in tests -> inference -> smc.py
  - One short test that basically tests that everything compiles, runs, and returns the right shape.
  - One longer test that tests that the estimated marginal likelihood is correct. (Marked by @PyTest.mark.slow)
Conditional generators and proposals:
- ssm -> inference -> conditional_generators.py contains a template implementation for a flexible implementations of conditional distributions.
- The example included is for a conditional independent multivariate Gaussian.
- The structure is a bit wild, but you call the build_independent_gaussian_generator which then builds a linen neural network module with the prescribed trunk function, and mean and variance head functions.
- There are sometimes some problems with jitting this because of the way I build it, but it is sufficient for now.
- ssm -> inference -> proposals.py then wraps a call to a conditional generator with the prescribed trunk function, and mean and variance head functions, and also contains functions for formatting the input and output to the proposal. These input and output wrappers would need to be changed for different proposal structures or input templates.
- There can either be a single proposal, or, multiple stacked proposals. To use multiple proposals (indicated by proposal_type = 'INDEPENDENT'), there must be as many proposals as there are time steps, and then the timestep indexes the proposal to use.
FIVO:
- ssm -> inference -> fivo.py contains some helper functions for running FIVO on a model.
- FIVO uses the SMC sweep to compute a biased estimate of the expected log marginal likelihood.
- The amount of boilerplate code for implementing FIVO is fairly minimal, with a lot of model-specific configuration stuff to be implemented by the user. There is therefore a template FIVO implementation in the accompanying notebook ssm-> notebooks -> fivo-lds.ipynb.
- For this code I have introduced a ._parameters paradigm. Using boilerplate code this will capture the default calling arguments when the model is initialised. This then allows for a shallow tree-flatten and un-flatten to be performed, but using the named and interpretable calling arguments.
  - Requires that the inputs are unconstrained parameters. Also silently requires that the inputs are "leaf-like-variables/nodes", which if not satisfied may cause it to silently fail.
  - This will need to be updated to something more thorough, but it is good enough for the time being.
- The model parameters to be learned are designated using the string values (which will then pull the values out of ._parameters, or inject the values into a new instantiation of the model.
- Resampling gradients are currently commented out (and will throw a loud NotImplementedError).
Added a few little bells and whistles into utils.py and started a utility file for neural networks stuff nn_util.py.

Obviously the FIVO code only works for Gaussian LDS's at the moment. The SMC code should work for everything. There are only independent Gaussian proposals defined at the moment, we should look to add more types of proposal.

I am reasonably confident in this current implementation. But we should sit down and do as thorough-er code review as you like.

A

… isnt it.;

… is resolved by reducing the emission covariance, which suggests that maybe it isnt actually a shift by one, but is a shift by some parameteric amount) and why the evidence approximations converge to the wrong value

… i expected though. it consistently overestimates the evidence for higher initial and emission covariances, which is kind of weird. but squashing these down and increasing the number of particles and the evidence approximations converge.

…y to present some of the analysis

…lly is

…lotting code in here for the time being.

…t does appear to work though.

…he functions so that it can be notebook-ized and does not require duplicating a ton of code

… Snax

…dity that the bound with proposals cant become tight. '

…icient jitting

github-actions · 2021-12-06T22:11:03Z

Unit Test Results

  1 files ±  0   1 suites ±0 10m 16s ⏱️ - 20m 47s
38 tests - 34 38 ✔️ - 34 0 💤 ±0 0 ❌ ±0

Results for commit 0111b9e. ± Comparison against base commit d84b3fe.

This pull request removes 72 and adds 38 tests. Note that renamed tests count towards both.

tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[10]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[12]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[2]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[4]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[6]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_emissions_dim[8]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_latent_dim[10]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_latent_dim[12]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_latent_dim[2]
tests.timing_comparisons.test_time_hmm.TestGaussianARHMM ‑ test_arhmm_em_fit_latent_dim[4]
…

tests.arhmm.test_arhmm ‑ test_gaussian_arhmm_em_fit
tests.arhmm.test_arhmm ‑ test_gaussian_arhmm_jit
tests.arhmm.test_arhmm ‑ test_gaussian_arhmm_sample
tests.arhmm.test_arhmm ‑ test_gaussian_arhmm_sample_is_consistent
tests.hmm.test_hmm ‑ test_bernoulli_hmm_em_fit
tests.hmm.test_hmm ‑ test_bernoulli_hmm_jit
tests.hmm.test_hmm ‑ test_bernoulli_hmm_sample
tests.hmm.test_hmm ‑ test_bernoulli_hmm_sample_is_consistent
tests.hmm.test_hmm ‑ test_gaussian_hmm_em_fit
tests.hmm.test_hmm ‑ test_gaussian_hmm_jit
…

♻️ This comment has been updated with latest results.

…est script. also added the flexibility for smc posterior to handle states without a latent dimension. also added some of the infrastructure for using arbitrary pytrees in the proposal. i figure that if this is the case then the user has created the _front-end_ for handling such a pytree as output

…machines, so this will need disabling

andrewwarrington · 2021-12-08T22:50:46Z

Coolio, so, I've added some extra tests (FIVO and SMC in some discrete models), and I've fixed up some of the interface and tools stuff Collin and I spoke about. Holla and let me know :)

… a couple more tests/checks

…put shape

andrew warrington added 30 commits November 9, 2021 15:57

adding initial skeleton of SMC code.

ef53462

implemented systematic resampling which is a barrel of fucking laughs…

7fcfa37

… isnt it.;

beginning to make notebook actually useful

effd304

stripped out a lot of the test code from the code SMC code

aba5a84

caught bug numero uno

c2ede0e

touching up the notebook a little bit. not 100pc sure on the right wa…

59b1c97

…y to present some of the analysis

reorganise

b0b3aad

as ever, the bug was not statistical, it was user error 🤦

214162e

adding sgr flag for future use

019ff0c

starting to work through issues raised in the PR

57c7f59

drafted test, but not too sure what the best metric for passing actua…

7235cf8

…lly is

refactoring a little, adding a bunch of comments

c9850f5

minor

a3b9813

minor

3122b02

removing errant file

14d7b79

refactored the proposal definition a little bit, and also slung the p…

4f7c5bd

…lotting code in here for the time being.

adding possibly the most convoluted implementation of FIVO to date. i…

ff04356

…t does appear to work though.

minor

04d25dd

rejigged some stuff. nothing major

20fd3a7

adding number of particles as a static argnum

55649e2

fivo seems to be working okay. proposal is very slow to learn

e995477

going to refactor a little bit. need to be able to separate out all t…

bd97c70

…he functions so that it can be notebook-ized and does not require duplicating a ton of code

refactored so we can put it into a notebook moreeffectively

6dbb90e

adding first pass of a jax-ssm-fivo-notebook

3332cdd

minor refactoring

5b881ad

built nones throughout the computation

c3be8c5

minor

b92eca1

going to do a bit of refactoring to use Linen as a backend instead of…

a23b7b2

… Snax

andrew warrington added 9 commits December 3, 2021 11:55

just keep rollin' (rollin' rollin') changes forward. there is some od…

cba2524

…dity that the bound with proposals cant become tight. '

minor refactors. explicitly adding num datasets to allow for more eff…

96fe012

…icient jitting

add clocking function

dbc9b88

bring notebook into line

0d8ebad

sort plotting out

775e67e

bringing tests up to speed

9d01b04

really hiding the resampling grad stuff

ace8dd3

strip out old test code

a50ffae

adding ._parameters attribute to otherm model types

71b0b61

adding flax

a61c96e

schlagercollin requested review from schlagercollin and slinderman December 7, 2021 00:34

andrew warrington added 8 commits December 7, 2021 13:06

semi merge

3bec692

adding fivo test. the convergence test is slow, especially on slower …

b7a9b49

…machines, so this will need disabling

speed up the test

69da5b7

thatll do it...

56997df

rename to pytest finds them

dc5f936

bumping some stock stuff off to a utils file.

1531d26

Merge branch 'fivo-aux' into smc

92de3ad

andrew warrington and others added 8 commits December 9, 2021 13:04

moving posterior into smc and reconciling some changes

5c3fce9

rolling forward some final formatting for the SMCPosterior and adding…

0111b9e

… a couple more tests/checks

minor cleanup in smc

7662194

removing num_datasets. it will statically compile based off of the in…

aa4f032

…put shape

Merge branch 'smc' of github.com:lindermanlab/ssm-jax-refactor into smc

323ff7d

moved smc plotting into plots.py

18c19f5

Merge branch 'main' into smc

6064282

replace vectorize pytree with jax flatten op

ab97563

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SMC + FIVO implementation #16

SMC + FIVO implementation #16

andrewwarrington commented Dec 6, 2021

github-actions bot commented Dec 6, 2021 •

edited

Loading

andrewwarrington commented Dec 8, 2021

SMC + FIVO implementation #16

Are you sure you want to change the base?

SMC + FIVO implementation #16

Conversation

andrewwarrington commented Dec 6, 2021

github-actions bot commented Dec 6, 2021 • edited Loading

Unit Test Results

andrewwarrington commented Dec 8, 2021

github-actions bot commented Dec 6, 2021 •

edited

Loading