use float32 for images, and validate #1329

kosack · 2020-05-14T16:47:32Z

Changes Containers for images to specify
they should be float32
call validate() when making schema in HDF5TableWriter

So far doesn't change the computation of the images. Need to ensure ImageExtractor creates float32s.

- Changes Containers for images to specify they should be float32 - call validate() when making schema in HDF5TableWriter

ctapipe/io/hdf5tableio.py

maxnoe · 2020-05-15T11:40:04Z

@kosack I added a test testing for all containers defined in ctapipe/containers.py if the default can be written using the hdf5 writer, this turned up some more missing units / bad defaults.

I also changed three time ranges to two fields min / max to avoid needless non-scalar fields.

codecov · 2020-05-15T12:35:53Z

Codecov Report

Merging #1329 into master will increase coverage by 0.13%.
The diff coverage is 98.90%.

@@            Coverage Diff             @@
##           master    #1329      +/-   ##
==========================================
+ Coverage   90.92%   91.05%   +0.13%     
==========================================
  Files         179      179              
  Lines       12164    12192      +28     
==========================================
+ Hits        11060    11102      +42     
+ Misses       1104     1090      -14

Impacted Files	Coverage Δ
ctapipe/calib/camera/flatfield.py	`95.04% <ø> (ø)`
ctapipe/core/container.py	`88.35% <ø> (+8.90%)`	⬆️
ctapipe/image/extractor.py	`86.86% <ø> (ø)`
ctapipe/io/tests/test_hdf5.py	`97.79% <94.73%> (-0.20%)`	⬇️
ctapipe/calib/camera/pedestals.py	`94.94% <100.00%> (ø)`
ctapipe/calib/camera/tests/test_calibrator.py	`100.00% <100.00%> (ø)`
ctapipe/containers.py	`100.00% <100.00%> (ø)`
ctapipe/core/tests/test_traits.py	`100.00% <100.00%> (ø)`
ctapipe/image/muon/intensity_fitter.py	`93.00% <100.00%> (ø)`
ctapipe/image/muon/ring_fitter.py	`100.00% <100.00%> (ø)`
... and 12 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c9eb246...5588fe4. Read the comment docs.

kosack · 2020-05-15T12:59:51Z

Thanks, this looks good. I'll just update the ImageExtractor then to produce the right data type

kosack · 2020-05-15T13:02:35Z

I think to fix that it's just to change extract_around_peak, which currently returns float64s to return float32s instead. @watsonjj is that enough?

An alternative would be to down-sample it afterward, but that is slower and probably not a good idea. If we rely on any operations being on 64-bit versions, then the answer would change when we re-read DL1 data stored in 32-bits and re-compute things, which is not a good design.

ctapipe/image/extractor.py

HealthyPear · 2020-05-15T13:22:39Z

I think to fix that it's just to change extract_around_peak, which currently returns float64s to return float32s instead. @watsonjj is that enough?

An alternative would be to down-sample it afterward, but that is slower and probably not a good idea. If we rely on any operations being on 64-bit versions, then the answer would change when we re-read DL1 data stored in 32-bits and re-compute things, which is not a good design.

In TwoPassWindowSum I had to force the conversion (line 903), is this is still needed? I was getting an error if I didn't convert to 32

kosack · 2020-05-15T13:52:45Z

In TwoPassWindowSum I had to force the conversion (line 903), is this is still needed? I was getting an error if I didn't convert to 32

Do you recall what error you were getting? I don't see anything that would have forbidden 64-bits, in ctapipe at least (if it was in prototype, you have your own IO system, and may have defined it as 32 bits)

maxnoe · 2020-05-15T13:56:00Z

@kosack I had to require an rtol of only 1e-4 to make that test pass with float32.

I guess 1e-4 on waveforms is still plenty, but are you sure we don't want to use float64 internally?

HealthyPear · 2020-05-15T13:58:08Z

In TwoPassWindowSum I had to force the conversion (line 903), is this is still needed? I was getting an error if I didn't convert to 32

Do you recall what error you were getting? I don't see anything that would have forbidden 64-bits, in ctapipe at least (if it was in prototype, you have your own IO system, and may have defined it as 32 bits)

Ah yes, it is possible that was a ctapipe 0.7 issue....probably that conversion is not needed anymore.

As soon as we merge this, I will try take it out to see if nothing happens (as I expect)

kosack · 2020-05-15T14:02:30Z

I guess 1e-4 on waveforms is still plenty, but are you sure we don't want to use float64 internally?

Perhaps, or else we need to think about changing operations to minimize increasing floating point error, which normally you can ignore in 64 bits, but not in 32.

kosack · 2020-05-15T14:03:59Z

The worry I have is that if we store a 64-bit version in the Container (and down-sample in the writer), then if we run the analysis without writing a DL1 file, we would have all operations in 64bits, but if we write and read, it woudl change to 32 bits, and thus would give different results. So no matter what, we should write 32-bits to the container in memory, so further algorithms don't expect a higher bit depth

maxnoe · 2020-05-15T14:13:16Z

I'll try to improve the numerical stability of the extractor.

maxnoe · 2020-05-15T14:46:28Z

When using numpy.sum in extract_around_peak, the relative error is one order of magnitude less (numpy.sum does uses a middle ground implementation between floating point accuracy and speed instead of a plain loop sum).

This increases runtime from 430 µs to 450 µs in the test I did. This is acceptable I think, right?

Should I push this?

kosack · 2020-05-15T14:55:27Z

Should I push this?

Sounds good, I say go ahead. Might want to add a comment why it is used, in case somebody changes it back later.

watsonjj · 2020-05-15T15:13:50Z

When using numpy.sum in extract_around_peak, the relative error is one order of magnitude less (numpy.sum does uses a middle ground implementation between floating point accuracy and speed instead of a plain loop sum).

This increases runtime from 430 µs to 450 µs in the test I did. This is acceptable I think, right?

Should I push this?

I'm confused, what does using numpy.sum solve? I specifically didn't use it so I have more control over the sum, i.e. correctly sum samples when the window is at the edge of the waveform, and also to not include negative values in the pulse_time calculation

kosack · 2020-05-15T15:16:12Z

It's just that now that we've switched to 32-bit images, we have to worry more about numerical rounding error, so the way things are summed is important.

Though it's not clear which algorithm the Numba version of np.sum uses, but I guess it's more stable based on @maxnoe's test. A naive sum (loop over and accumulate) is usually not the ideal way to do it. Though what we could do is leave extract_around_peak() in 64 bit to avoid this, and just always call it with

image, peak = extract_around_peak()
image = image.astype("float32")
peak = peak.astype("float32")

but then , each extractor must do that and there is some speed overhead

kosack · 2020-05-15T15:33:05Z

you can see now, in 32bit, the tests fail, since the difference after error is around 1e-3, and the tolerance is 1e-7:

>       np.testing.assert_allclose(event.dl1.tel[telid].image, y.sum(1))
E       AssertionError:
E       Not equal to tolerance rtol=1e-07, atol=0
E
E       Mismatched elements: 1613 / 2048 (78.8%)
E       Max absolute difference: 0.0012207
E       Max relative difference: 6.832399e-05
E        x: array([402.1624 , 694.62103, 981.598  , ..., 525.88116, 796.07776,
E              855.5047 ], dtype=float32)
E        y: array([402.16254, 694.6211 , 981.59845, ..., 525.881  , 796.07776,
E              855.50446], dtype=float32)

Or we just keep it with rtol=1e-4. as already implemented. It's up to yoy! @watsonjj you can decide, since you have to review this anyhow ;)

watsonjj · 2020-05-15T15:42:50Z

I see. I'm surprised at the difference!

kosack · 2020-05-15T15:44:44Z

I see. I'm surprised at the difference!

It was so much more difficult back in the days of 32-bit (or even 16-bit!) computers and low-ram! using doubles used to have a large speed and memory impact, so we had to use 32-bit floats for everything, and always think about numerical stability.

Anyhow, I think 1e-4 PE is still pretty reasonable, so maybe we don't do anything for now.

maxnoe · 2020-05-15T15:48:48Z

@watsonjj numpy sum is only used for the image sum, not the weighted average.

The numpy sum uses an algorithm optimized as a tradeoff between speed and minimizing floating point error, it reduced rounding errors.

The other extreme would be to use math.fsum which keeps tracke of rounding errors exactly to produce the sum without any rounding errors.

kosack · 2020-05-15T15:58:49Z

see e.g. https://en.wikipedia.org/wiki/Kahan_summation_algorithm or https://code.activestate.com/recipes/393090/

I guess fsum is slow, and we don't really need that level of precision, so either stick with low-precision (assuming we don't add too many more sample), or use np.sum, which I assume uses something like the kahan algorithm [edit: see below]

From the numpy manual:

For floating point numbers the numerical precision of sum (and np.add.reduce) is in general limited by directly adding each number individually to the result causing rounding errors in every step. However, often numpy will use a numerically better approach (partial pairwise summation) leading to improved precision in many use-cases. This improved precision is always provided when no axis is given. When axis is given, it will depend on which axis is summed. Technically, to provide the best speed possible, the improved precision is only used when the summation is along the fast axis in memory. Note that the exact precision may vary depending on other parameters. In contrast to NumPy, Python’s math.fsum function uses a slower but more precise approach to summation. Especially when summing a large number of lower precision floating point numbers, such as float32, numerical errors can become significant. In such cases it can be advisable to use dtype=”float64” to use a higher precision for the output.

maxnoe · 2020-05-15T17:04:51Z

So what should we do here?

I would say this is the last blocker for 0.8. Should we revert the numpy sum and increase the rtol or leave it like it is now?

@kosack @watsonjj

Would be nice to get it out today, pending the review of #1163

watsonjj · 2020-05-16T16:17:45Z

I think an rtol of 1e-4 is still fine for our purpose. There was no particular reason I previously used 1e-7

maxnoe · 2020-05-16T17:50:34Z

We could also use numba.double to make the accumulator a double bevor assigning to the 32bit array

maxnoe · 2020-05-18T08:21:25Z

@watsonjj why does the loop go over all samples and then do branching to check if 0 < i < n_samples?

I replaced this with min / max before the loop and it's now twice as fast.

watsonjj

Great, works for me

use float32 for images, and validate

481a7b5

- Changes Containers for images to specify they should be float32 - call validate() when making schema in HDF5TableWriter

vuillaut reviewed May 14, 2020

View reviewed changes

ctapipe/io/hdf5tableio.py Show resolved Hide resolved

maxnoe added 2 commits May 15, 2020 12:25

Fix defaults of mcevent container

9aa6052

Test writing all container defaults, fix defaults for failing containers

d5efd2a

kosack requested a review from watsonjj as a code owner May 15, 2020 11:38

vuillaut previously approved these changes May 15, 2020

View reviewed changes

vuillaut self-requested a review May 15, 2020 11:53

Fix defaults and initialization of ReconstructedShowerContainer

2316979

maxnoe dismissed vuillaut’s stale review via 2316979 May 15, 2020 12:24

change extract_around_peak to return float32 arrays

13b65be

kosack requested a review from HealthyPear as a code owner May 15, 2020 13:07

maxnoe reviewed May 15, 2020

View reviewed changes

ctapipe/image/extractor.py Show resolved Hide resolved

check dtype of extract_around_peak output in test

5cb9f07

Make test expectation also float32

75e1fe4

Only require rtol of 1e-4

eb19543

maxnoe added 2 commits May 18, 2020 10:29

Use float64 to accumulate charge / peaktime, improve loop

20a9080

Remove unnecessary check

5588fe4

watsonjj approved these changes May 18, 2020

View reviewed changes

maxnoe approved these changes May 18, 2020

View reviewed changes

kosack merged commit 10b058f into cta-observatory:master May 18, 2020

kosack deleted the fix/large_image_sizes branch October 2, 2020 09:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use float32 for images, and validate #1329

use float32 for images, and validate #1329

kosack commented May 14, 2020

maxnoe commented May 15, 2020

codecov bot commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020

kosack commented May 15, 2020 •

edited

Loading

HealthyPear commented May 15, 2020

kosack commented May 15, 2020

maxnoe commented May 15, 2020

HealthyPear commented May 15, 2020

kosack commented May 15, 2020

kosack commented May 15, 2020 •

edited

Loading

maxnoe commented May 15, 2020

maxnoe commented May 15, 2020

kosack commented May 15, 2020 •

edited

Loading

watsonjj commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

watsonjj commented May 15, 2020

kosack commented May 15, 2020 •

edited

Loading

maxnoe commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

maxnoe commented May 15, 2020 •

edited

Loading

watsonjj commented May 16, 2020

maxnoe commented May 16, 2020

maxnoe commented May 18, 2020 •

edited

Loading

watsonjj left a comment

use float32 for images, and validate #1329

use float32 for images, and validate #1329

Conversation

kosack commented May 14, 2020

maxnoe commented May 15, 2020

codecov bot commented May 15, 2020 • edited Loading

Codecov Report

kosack commented May 15, 2020

kosack commented May 15, 2020 • edited Loading

HealthyPear commented May 15, 2020

kosack commented May 15, 2020

maxnoe commented May 15, 2020

HealthyPear commented May 15, 2020

kosack commented May 15, 2020

kosack commented May 15, 2020 • edited Loading

maxnoe commented May 15, 2020

maxnoe commented May 15, 2020

kosack commented May 15, 2020 • edited Loading

watsonjj commented May 15, 2020 • edited Loading

kosack commented May 15, 2020 • edited Loading

kosack commented May 15, 2020 • edited Loading

watsonjj commented May 15, 2020

kosack commented May 15, 2020 • edited Loading

maxnoe commented May 15, 2020 • edited Loading

kosack commented May 15, 2020 • edited Loading

maxnoe commented May 15, 2020 • edited Loading

watsonjj commented May 16, 2020

maxnoe commented May 16, 2020

maxnoe commented May 18, 2020 • edited Loading

watsonjj left a comment

Choose a reason for hiding this comment

codecov bot commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

watsonjj commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

maxnoe commented May 15, 2020 •

edited

Loading

kosack commented May 15, 2020 •

edited

Loading

maxnoe commented May 15, 2020 •

edited

Loading

maxnoe commented May 18, 2020 •

edited

Loading