Use biased autocorrelation estimate by default #1785

jonny-so · 2024-04-19T12:42:54Z

I was noticing some very erratic and unexpected behaviour from the effective_sample_size diagnostic, which was due to some extreme values in the far right tail of the autocorrelation function. This can be reproduced pretty easily by plotting the autocorrelation of a sequence of 1000 IID Gaussians.

This behaviour seems to be due to the following line

numpyro/numpyro/diagnostics.py

Line 130 in 2f1bccd

autocorr = autocorr / np.arange(N, 0.0, -1)

which I believe is there to make the estimate unbiased. Stan however uses the biased estimate of Geyer (1992) (see https://github.com/stan-dev/stan/blob/634034deb3abd6314d980c1aab083f64269f4019/src/stan/analyze/mcmc/autocovariance.hpp#L60), presumably to stop the issue of the variance in the right tail exploding.

I have local changes which (optionally) use the biased estimate. I think this should be the default, partly to be consistent with Stan, but also because it seems too erratic to be useful with the current behaviour. It seems to be an intentional departure from the Stan implementation however, so I thought best to open an issue here to discuss.

The text was updated successfully, but these errors were encountered:

fehiepsi · 2024-04-29T21:59:37Z

Wow, this is subtle. It would be great if you could contribute a PR, @jonny-so! Could you also add some tests to illustrate that the biased estimate behaves better in some cases? Thanks!

jonny-so · 2024-10-05T20:19:08Z

I created a PR. Also, just to correct the record having read some more -- the biased estimator was discussed, but not proposed by Geyer (1992). It is also discussed in depth by Priestley (1981, Section 5.3), who cites Parzen (1961) and Schaerf (1964). I was also wrong to say that the variance explodes in the tail, it just remains O(1) as the chain length grows, leading to "wild" tails (Priestly, 1981).

fehiepsi added the enhancement New feature or request label Apr 29, 2024

fehiepsi mentioned this issue Jul 4, 2024

autocorrelation function #1828

Closed

fehiepsi added the help wanted Extra attention is needed label Aug 1, 2024

jonny-so added a commit to jonny-so/numpyro that referenced this issue Oct 5, 2024

use biased autocorrelation estimate by default (resolves pyro-ppl#1785)

5344700

fehiepsi closed this as completed in e462292 Oct 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use biased autocorrelation estimate by default #1785

Use biased autocorrelation estimate by default #1785

jonny-so commented Apr 19, 2024 •

edited

Loading

fehiepsi commented Apr 29, 2024

jonny-so commented Oct 5, 2024

Use biased autocorrelation estimate by default #1785

Use biased autocorrelation estimate by default #1785

Comments

jonny-so commented Apr 19, 2024 • edited Loading

fehiepsi commented Apr 29, 2024

jonny-so commented Oct 5, 2024

jonny-so commented Apr 19, 2024 •

edited

Loading