Alternative clamping for `NegativeBinomialMeanClust` #237

damonbayer · 2024-05-29T03:36:53Z

As requested, porting the clamping from https://github.com/damonbayer/immunity_semi_parametric_model/blob/d54162e1bda24950efdb5ce60da686cc47e2b36b/src/immunity_semi_parametric_model.jl#L9.

Some of the math in the existing code seemed a bit odd to me, so I made some simplifications:

_μ^2 / ex_σ² = _μ^2 / (_α * _μ^2) = 1 / _α

Perhaps I am missing something.

SamuelBrand1 · 2024-05-29T09:14:36Z

Hey @damonbayer !

Thanks for the contribution!

The underlying parametrisation we're using is determined by the variance-to-mean relationship:

$$\sigma^2 = \mu + \alpha \mu^2$$

And I got taught the $\alpha \mu^2$ as "excess" variance compared to a Poisson, so the code basically just reflects my year 1 probability lecturer...

Your commit looks good and is maths-equivalent so just waiting on the CI.

SamuelBrand1

Could you check the failure of the unit test here?

This could just be because the committed approach is better/auto-safe!

SamuelBrand1 · 2024-05-29T09:23:00Z

EpiAware/src/EpiObsModels/utils.jl

@@ -33,11 +33,8 @@ function NegativeBinomialMeanClust(μ, α)
    if isnan(μ) || isnan(α)


Out of interest, does nextfloat etc mean we can get rid of the logic branching here? That would be good from the pov of compilable tape for reverse diff.

Is your desire to replace the DiscreteUniform(0, 1_000_000) with a very wide NegativeBinomial? I'm not sure of the right way to do that, but the clamp method will not play nice with NaNs.

seabbs · 2024-05-29T10:10:54Z

❤️

I think prior to merging we should add @damonbayer to the authors list (

Rt-without-renewal/EpiAware/Project.toml

Line 3 in 919b8a8

authors = ["Samuel Abbott <[email protected]>", "Samuel Brand <[email protected]>", "Zachary Susswein <[email protected]>"]

+ anywhere else we have one).

Longer term we should have some kind of contributing policy for this but above practice is my usual default so I think we should follow until we have said policy in place.

damonbayer · 2024-05-29T14:41:38Z

Could you check the failure of the unit test here?

This could just be because the committed approach is better/auto-safe!

Actually, it seems that the existing approach is safer. For the values in the test, the clamp does nothing, but the rand produces an InexactError.

julia>     big_mu = 1e30
               alpha = 0.5
               α = alpha
               μ = big_mu
               r = clamp(1 / α, nextfloat(zero(α)), prevfloat(typemax(α)))
               p = clamp(1 / (1 + α * μ), nextfloat(zero(μ)), one(μ))
               rand(NegativeBinomial(r, p))
ERROR: InexactError: trunc(Int64, 1.3967586116300517e30)

Maybe it's just better to keep as is.

seabbs · 2024-05-29T15:27:20Z

Just noting I am really surprised that this is the case (and slightly sad). Not entirely clear to me what action we should take here? Promote to an issue for investigation or is there anything more to be done here?

damonbayer · 2024-05-29T16:02:14Z

I have a related, longstanding issue in Distributions.jl JuliaStats/Distributions.jl#1512

SamuelBrand1 · 2024-05-30T10:38:53Z

So burrowing into this; the place that chucks an error is the call to rand

e.g.

big_mu = 1e30
alpha = 0.5

p = 1 / (1 + alpha + big_mu)
r = 1 / alpha
nb = NegativeBinomial(r, p)

rand(nb)
# InexactError: trunc(Int64, 1.0211481865897263e30)

This is because Distributions generates a random neg bin by:

Draw a Gamma with mean 1
multiple mean by Gamma
Draw a Poisson with the compound mean

Poisson sampling branches into to a bunch of special cases for efficiency, in particular

J.H. Ahrens, U. Dieter (1982)
"Computer Generation of Poisson Deviates from Modified Normal Distributions"
ACM Transactions on Mathematical Software, 8(2):163-179
For μ sufficiently large, (i.e. >= 10.0)

This algo requires finding a value

L = floor(Int, μ - 1.1484)

And its floor that has a problem because it uses trunc rather than unsafe_trunc or indeed BigInt

SamuelBrand1 · 2024-05-30T10:40:43Z

Note that logpdf is not affected e.g.

logpdf(nb, 1000)
# -129.86005643920763

SamuelBrand1 · 2024-05-30T10:51:24Z

So the options are here:

Overload the rand function in some way,
Raise an issue with Distributions.jl
Keep as is

SamuelBrand1 · 2024-05-30T11:02:37Z

Another more radical possibility is that we accept that having models that can sample > 1e17 infections (which is easier to do in unbounded pops than one might expect due to RW $\log R_t$ and the magic of exponential growth) are bad and enforce a max population size. Then these warm up phase problems should disappear in a more principled way.

seabbs · 2024-05-30T11:23:28Z

Another more radical possibility is that we accept that having models that can sample > 1e17 infections (which is easier to do in unbounded pops than one might expect due to RW
and the magic of exponential growth) are bad and enforce a max population size. Then these warm up phase problems should disappear in a more principled way.

So for this see my comment in the issue and I think generally wee should move discussion there.

Nice investigation.

Raise an issue with Distributions.jl

This is my preference I think and for now we work around?

seabbs · 2024-06-11T08:21:49Z

For my read of this is that we plan not to merge so closing. @damonbayer it would be a joy to have another contribution from you :)

Alternative clamping for NegativeBinomialMeanClust

c543f20

SamuelBrand1 self-requested a review May 29, 2024 09:15

SamuelBrand1 requested changes May 29, 2024

View reviewed changes

SamuelBrand1 reviewed May 29, 2024

View reviewed changes

damonbayer and others added 4 commits May 29, 2024 09:01

Merge branch 'CDCgov:main' into patch-1

6ce9814

Add Damon to authors

36293ec

Update make.jl

9d148fa

Update Project.toml

3f3a5a5

SamuelBrand1 mentioned this pull request May 30, 2024

Bounded population size as default #244

Closed

SamuelBrand1 mentioned this pull request May 30, 2024

add RenewalWithPopulation infection generating process model #245

Merged

seabbs closed this Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alternative clamping for `NegativeBinomialMeanClust` #237

Alternative clamping for `NegativeBinomialMeanClust` #237

damonbayer commented May 29, 2024 •

edited by seabbs

Loading

SamuelBrand1 commented May 29, 2024

SamuelBrand1 left a comment

SamuelBrand1 May 29, 2024

damonbayer May 29, 2024

seabbs commented May 29, 2024

damonbayer commented May 29, 2024

seabbs commented May 29, 2024 •

edited

Loading

damonbayer commented May 29, 2024

SamuelBrand1 commented May 30, 2024 •

edited

Loading

SamuelBrand1 commented May 30, 2024

SamuelBrand1 commented May 30, 2024

SamuelBrand1 commented May 30, 2024 •

edited

Loading

seabbs commented May 30, 2024

seabbs commented Jun 11, 2024

		@@ -33,11 +33,8 @@ function NegativeBinomialMeanClust(μ, α)
		if isnan(μ) \|\| isnan(α)

Alternative clamping for NegativeBinomialMeanClust #237

Alternative clamping for NegativeBinomialMeanClust #237

Conversation

damonbayer commented May 29, 2024 • edited by seabbs Loading

SamuelBrand1 commented May 29, 2024

SamuelBrand1 left a comment

Choose a reason for hiding this comment

SamuelBrand1 May 29, 2024

Choose a reason for hiding this comment

damonbayer May 29, 2024

Choose a reason for hiding this comment

seabbs commented May 29, 2024

damonbayer commented May 29, 2024

seabbs commented May 29, 2024 • edited Loading

damonbayer commented May 29, 2024

SamuelBrand1 commented May 30, 2024 • edited Loading

SamuelBrand1 commented May 30, 2024

SamuelBrand1 commented May 30, 2024

SamuelBrand1 commented May 30, 2024 • edited Loading

seabbs commented May 30, 2024

seabbs commented Jun 11, 2024

Alternative clamping for `NegativeBinomialMeanClust` #237

Alternative clamping for `NegativeBinomialMeanClust` #237

damonbayer commented May 29, 2024 •

edited by seabbs

Loading

seabbs commented May 29, 2024 •

edited

Loading

SamuelBrand1 commented May 30, 2024 •

edited

Loading

SamuelBrand1 commented May 30, 2024 •

edited

Loading