Poisson: Fix undefined behavior and support f64 output #795

vks · 2019-05-15T12:39:27Z

No description provided.

Before, we might trigger undefined behaviour if the sample gets too large.

Internally, we generate `f64`, so it makes sense to let the user access that result directly, instead of forcing a conversion from `u64`.

dhardy

Thanks.

Now the only question is whether to bother with an f32 version. I don't see a lot of point, but it would still be a breaking change to introduce later (because Poission becomes Poisson<f64>).

dhardy · 2019-05-15T13:10:50Z

rand_distr/src/poisson.rs

+impl Distribution<u64> for Poisson {
+    fn sample<R: Rng + ?Sized>(&self, rng: &mut R) -> u64 {
+        let result: f64 = self.sample(rng);
+        assert!(result >= 0.);


This is already guaranteed by the algorithm (see loop break condition), though the check is cheap enough relative to the algorithm that it doesn't matter.

Maybe I should add #[inline], so the check can be optimized away?

I think it makes sense to have the check to avoid undefined behavior, in case the the implementation somehow is wrong and returns negative numbers or nan.

It now supports `f32` as well.

vks · 2019-05-15T14:16:17Z

I added f32 sampling. We could probably have a specialized version of log_gamma for f32 that uses less precision.

rand_distr/src/poisson.rs

rand_distr/src/utils.rs

dhardy · 2019-05-15T14:27:02Z

rand_distr/src/utils.rs

@@ -91,33 +111,33 @@ impl Float for f64 {
 /// `Ag(z)` is an infinite series with coefficients that can be calculated
 /// ahead of time - we use just the first 6 terms, which is good enough
 /// for most purposes.
-pub(crate) fn log_gamma(x: f64) -> f64 {
+pub(crate) fn log_gamma<N: Float>(x: N) -> N {


This is not an optimal f32 implementation and I'm not sure it's even an acceptable one (e.g. the a parameter below rounds to 1). I would suggest just using f64 arithmetic and converting at function start/end, though there's very little point to supporting Poisson<f32> in this case (only really for the type deduction).

Additionally, this function prototype is okay for internal use but not if we wished to expose log_gamma since we could not add correct implementations for other float types without specialization. A trait would be better, though clunky since we only have one method.

Statrs's Gamma function implementations are likely better.

It essentially evaluates a polynomial, so I think it should be fine, even if the coefficients have less precision. I thinks it is more likely that too many coefficients are used for the target precision.

though there's very little point to supporting Poisson in this case (only really for the type deduction).

It might help for sampling from the uniform distribution.

How does this help with the uniform distribution?

We are doing some rejection sampling for the Poisson distribution, sampling f32 instead of f64 in the loop might help.

Ah, you mean speed up usage of uniform, not help implement uniform. Yes, and when we get a specialisation of log_gamma it may speed things up for any(?) situation where f32 is faster than f64.

dhardy · 2019-05-15T14:30:32Z

rand_distr/src/poisson.rs

@@ -137,29 +137,41 @@ mod test {
    fn test_poisson_10() {
        let poisson = Poisson::new(10.0).unwrap();
        let mut rng = crate::test::rng(123);
-        let mut sum = 0;
+        let mut sum_u64 = 0;
+        let mut sum_f64 = 0.;


What is the goal of this test? We know both values should be the same since (a) both should be non-negative integers and (b) we should not be going anywhere close to the limits/accuracy of either type.

The goal is to test that the code path works as expected, not the precision.

It's the same code path, aside from the extra cast.

I would like to trigger the path with the cast, making sure it works. I think it is important because of the potential undefined behavior.

vks · 2019-05-15T15:22:01Z

I changed log_gamma to always use f64. I think the previous implementation was okay though.

rand_distr/src/poisson.rs

dhardy · 2019-05-16T13:28:29Z

rand_distr/src/poisson.rs

@@ -67,6 +67,7 @@ where Standard: Distribution<N>
 impl<N: Float> Distribution<N> for Poisson<N>
 where Standard: Distribution<N>
 {
+    #[inline]


This is a large function — I don't think #[inline] makes any sense here.

This is not #[inline(always)], so the compiler is still free to make that choice. It might make sense to inline it into the u64 sampling, so that some bound checks can be eliminated.

vks added 2 commits May 15, 2019 14:27

rand_distr: Check bounds before returning Poisson sample

31aad14

Before, we might trigger undefined behaviour if the sample gets too large.

rand_distr: Add support for f64 to Poisson

c03e2c8

Internally, we generate `f64`, so it makes sense to let the user access that result directly, instead of forcing a conversion from `u64`.

dhardy mentioned this pull request May 15, 2019

Tracker: Rand 0.7 #715

Closed

22 tasks

vks added this to the 0.7 release milestone May 15, 2019

dhardy approved these changes May 15, 2019

View reviewed changes

vks added 3 commits May 15, 2019 15:57

rand_distr: Make Poisson generic

638b6be

It now supports `f32` as well.

rand_distr: Add tests for f64 sampling from Poisson

eec9bba

rand_distr: Add tests for f32 sampling from Poisson

90833ca

rand_distr: Encourage inlining of Float methods

2553cb5

dhardy reviewed May 15, 2019

View reviewed changes

vks added 2 commits May 15, 2019 17:02

Address review feedback

15b9a39

Always calculate log_gamma using f64

84d89a7

dhardy approved these changes May 15, 2019

View reviewed changes

dhardy reviewed May 16, 2019

View reviewed changes

rand_distr/src/poisson.rs Show resolved Hide resolved

rand_distr: Inline Poisson sampling

ec99801

dhardy reviewed May 16, 2019

View reviewed changes

dhardy merged commit c0b8722 into rust-random:master May 16, 2019

vks deleted the fp-poisson branch June 3, 2019 10:08

dhardy mentioned this pull request Jun 18, 2020

Migrate rand_distr to num-traits for no_std support #987

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Poisson: Fix undefined behavior and support f64 output #795

Poisson: Fix undefined behavior and support f64 output #795

vks commented May 15, 2019

dhardy left a comment

dhardy May 15, 2019

vks May 15, 2019

vks commented May 15, 2019

dhardy May 15, 2019

vks May 15, 2019

vks May 15, 2019

dhardy May 16, 2019

vks May 16, 2019

dhardy May 16, 2019

dhardy May 15, 2019

vks May 15, 2019

dhardy May 15, 2019

vks May 16, 2019

vks commented May 15, 2019

dhardy May 16, 2019

vks May 16, 2019

Poisson: Fix undefined behavior and support f64 output #795

Poisson: Fix undefined behavior and support f64 output #795

Conversation

vks commented May 15, 2019

dhardy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vks commented May 15, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vks commented May 15, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment