Ucts fix to account for wrongly tagged pedestal events due to skipping events #497

moralejo · 2020-07-31T11:33:04Z

This introduces and offline fix for the shifting of the UCTS info by 1-event jumps.

It also makes that the DL1 output contains now all events, not just those with valid image parameters. I think this is much better to allow study e.g. of interleaved events in the standard DL1 files.

…n correcting the UCTS info

…ered when correcting the UCTS info" This reverts commit 6dd3b2b.

This reverts commit 47d1ce3.

whether the image parametrization was successful or not

complains later when doing e.g. atan(width/foclength)

moralejo · 2020-07-31T11:34:22Z

By the way, in order to work, the fix needs to have the proper dragon timestamps, so the r0 to dl1 step has to be launched with the command-line options to build them.

contrera · 2020-07-31T11:51:43Z

Is this correction compatible with running the subruns in parallel ?

DirkHoffmann · 2020-07-31T13:06:42Z

Is this correction compatible with running the subruns in parallel ?

No, the comparisons and corrections need strictly incremented event order N → N+1 (and later N → N+2 etc., if the skip happends multiple times in the same run).

Unless someone has found a brilliant solution for that.

moralejo · 2020-07-31T13:11:48Z

The correction is indeed compatible with running the subruns in parallel. But one needs the same that is needed to build a proper dragon time stamp: pass some synchronization info through the command line. This is done now automatically in the onsite analysis, so no problem there.

DirkHoffmann · 2020-07-31T13:25:47Z

Sorry, first of all I must admit that I mis-read "subruns" and understood streams (sub-files 1-4).

But splitting a run into several parts still seems not obvious to me:

The correction is indeed compatible with running the subruns in parallel.

How do you decide to shift the event fragments of later subruns (and by how much), if you have not yet processed all previous subruns of the same run, in order to know if there was a shift (or more)?

moralejo · 2020-07-31T13:29:52Z

Sorry, first of all I must admit that I mis-read "subruns" and understood streams (sub-files 1-4).

But splitting a run into several parts still seems not obvious to me:

The correction is indeed compatible with running the subruns in parallel.

How do you decide to shift the event fragments of later subruns (and by how much), if you have not yet processed all previous subruns of the same run, in order to know if there was a shift (or more)?

The trick is that the process which launches each subrun gets information from the daily check which has already gone through the whole run. It somehow identifies in each subrun how to build the absolute dragon timestamp for the first event in the subrun (or the first with valid ucts info to be precise), and this is passed to the r0_to_dl1 process through two command-line arguments. With valid dragon time stamps we can compare the times with ucts and search for the jumps.

moralejo · 2020-07-31T13:40:43Z

Ok, tests fail because I now write out all events to the DL1 output, and it is comparing one so-produced dl1 file, with a fixed test one which contains only survivors of the image parametrization...

contrera · 2020-07-31T13:44:22Z

OK. I think right now the onsite analysis is not yet ready for this, we only consider one correction per run. Adapting the code to it may take time.

moralejo · 2020-07-31T13:46:35Z

OK. I think right now the onsite analysis is not yet ready for this, we only consider one correction per run. Adapting the code to it may take time.

No, I think it is ready. The dragon time stamps are built with one correction per subrun, I believe, at least Isidro provides them like that. And I think we could not have built at all the dragon timestamp (and hence detect the Crab pulsar) without that.

moralejo · 2020-07-31T14:53:54Z

No, I think it is ready. The dragon time stamps are built with one correction per subrun, I believe, at least Isidro provides them like that. And I think we could not have built at all the dragon timestamp (and hence detect the Crab pulsar) without that.

After discussion with @contrera: it is not correct that it is 'one correction per subrun'; since the Dragon counter is reliable, we only need one fixed point in the whole run which connects Dragon counter and absolute time. And so the same fixed point (from subrun 0) is used for all subruns.

This however does not mean the UCTS event shift correction introduced in this PR will not work on a subrun-wise basis: only the first N events of a subrun cannot be corrected, with N being the number of jumps in the UCTS info that have happened in previous subruns. Since at least in current data these jumps are rare, I think this is a very minor limitation.

moralejo · 2020-07-31T15:31:09Z

Ok, tests fail because I now write out all events to the DL1 output, and it is comparing one so-produced dl1 file, with a fixed test one which contains only survivors of the image parametrization...

The test that fails is actually not against a fixed file, but against a file produced with lstchain_mc_dl1ab (i.e. opening the DL1 file and recalculating image parameters).

…mtel, with those in one produced from the very same DL1 (redoing the cleaning and parametrization) is now done only for events in which intensity is not nan, since the former file may now contain events in which the Hillas parametrization was unsuccessful

contrera · 2020-07-31T16:19:17Z

No, I think it is ready. The dragon time stamps are built with one correction per subrun, I believe, at least Isidro provides them like that. And I think we could not have built at all the dragon timestamp (and hence detect the Crab pulsar) without that.

After discussion with @contrera: it is not correct that it is 'one correction per subrun'; since the Dragon counter is reliable, we only need one fixed point in the whole run which connects Dragon counter and absolute time. And so the same fixed point (from subrun 0) is used for all subruns.

This however does not mean the UCTS event shift correction introduced in this PR will not work on a subrun-wise basis: only the first N events of a subrun cannot be corrected, with N being the number of jumps in the UCTS info that have happened in previous subruns. Since at least in current data these jumps are rare, I think this is a very minor limitation.

You are right. Good. Then it can run in parallel and modifications are minimal. :-)

lstchain/reco/r0_to_dl1.py

…ent)

…ning and parametrization to the DL1 file is too difficult because the (deprecated) DL1ParametersContainer is a mess. Basically, a reset() leaves it in a state (initialized with None's, no units) which is incompatible with writing it later. And I do not think it is worth to fill manually all the values for empty events, given the container is deprecated.

moralejo · 2020-07-31T19:09:12Z

Turns out that writing out the events which do not survive cleaning and parametrization to the DL1 file is pretty difficult, because the (deprecated) DL1ParametersContainer is a mess. Basically, a reset() leaves it in a state (initialized with None's, no units) which is incompatible with writing it later (that seems to me an ugly feature - anyone can think of an easy fix, wirth applying to a deprecated container?).
Found a way of doing that, so we now write out to DL1 file all events, whether or not they survived cleaning. Now tests are passing, for that I had to slightly modify lstchain_mc_dl1ab so that the default values for non-parametrized images are the same.

codecov · 2020-07-31T19:11:42Z

Codecov Report

Merging #497 into master will decrease coverage by 0.02%.
The diff coverage is 32.25%.

@@            Coverage Diff             @@
##           master     #497      +/-   ##
==========================================
- Coverage   41.58%   41.56%   -0.03%     
==========================================
  Files          77       77              
  Lines        6428     6448      +20     
==========================================
+ Hits         2673     2680       +7     
- Misses       3755     3768      +13

Impacted Files	Coverage Δ
lstchain/scripts/lstchain_mc_dl1ab.py	`0.00% <0.00%> (ø)`
lstchain/reco/r0_to_dl1.py	`63.66% <43.47%> (-1.31%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9815214...5fedbd4. Read the comment docs.

…eaning, and still keep copnsistency between r0_to_dl1 and dl1ab

moralejo · 2020-08-01T09:24:19Z

Note: as of now, the correction of the UCTS info is done quite late in the loop, after calibration and parametrization, just because the needed dragon time is calculated there. Obviously this is inconvenient if one wants to use the info from interleaved pedestals e.g. in the cleaning. Eventually we have to move both the dragon time calculation and the ucts info shifting to a function called right after an event is read from the event source. Or even putting it directly into the event source, if it is possible... For now it is still very useful like it is, because we will have all calibrated images properly tagged in the DL1 output, so one can re-do the cleaning starting from DL1 files, for optimizing cleaning strategies.

…in case of more than one UCTS jump. Also: set -1 as ucts_trigger_type of events with no valid UCTS info.

lstchain/reco/r0_to_dl1.py

lstchain/scripts/lstchain_mc_dl1ab.py

maxnoe · 2020-08-01T11:16:31Z

lstchain/reco/r0_to_dl1.py

+        # integer parameters.
+        #
+        for key in dl1_container.keys():
+            dl1_container[key] = u.Quantity(0, dl1_container.fields[key].unit)


Putting in values that might be in the valid range for parameters as missing value indicators is never good and should be avoided.

The right thing todo is to use the correct defaults in the containers.

For floats, that is nan, for quantitites it is u.Quantity(np.nan, unit), for positive integers it could be -1, for general integers there is no general case and care must be taken in chossing a missing value indicator.

We did all this for the ctapipe containers in Version 0.8, we only missed one container for muon results which is fixed in the current master.

The same approach should be adapted in lstchain.

I agree with you from a theoretical point of view. But here none of the parameters will be used for anything if intensity is 0. And we are anyway moving to the ctapipe DL1 containers as soon as we can, so why bother about something in the deprecated container, which even now has no impact? What we badly need now is proper event tagging, and hopefully this provide it for most data.

moralejo added 11 commits July 29, 2020 19:39

Firs try at correcting UCTS time stamps

47d1ce3

Make sure that possible jumps in event_id are properly considered whe…

6dd3b2b

…n correcting the UCTS info

Revert "Make sure that possible jumps in event_id are properly consid…

01e9e58

…ered when correcting the UCTS info" This reverts commit 6dd3b2b.

Revert "Firs try at correcting UCTS time stamps"

02492d7

This reverts commit 47d1ce3.

Merge branch 'master' of https://github.com/cta-observatory/cta-lstchain

8362d79

Fix for event-shifted UCTS info

3796c65

Make that all events are written out to the DL1 output file, no matter

b26964f

whether the image parametrization was successful or not

Added missing width and length units in default nans

016b3b1

Removed blank line

3fd7a61

Put default nans in width and length in meters, otherwise the program

80d2f3a

complains later when doing e.g. atan(width/foclength)

Fill proper event id, also for events in which Hillas failed

88f9978

moralejo marked this pull request as ready for review July 31, 2020 13:08

moralejo marked this pull request as draft July 31, 2020 15:52

maxnoe reviewed Jul 31, 2020

View reviewed changes

lstchain/reco/r0_to_dl1.py Show resolved Hide resolved

moralejo added 2 commits July 31, 2020 20:20

Attempt at fixing the failing tests (make r0_to_dl1 and dl1ab consist…

d71e0a5

…ent)

moralejo marked this pull request as ready for review July 31, 2020 19:14

moralejo requested a review from rlopezcoto July 31, 2020 19:15

moralejo marked this pull request as draft August 1, 2020 07:15

Found a way to write out all events, including those not surviving cl…

31e6499

…eaning, and still keep copnsistency between r0_to_dl1 and dl1ab

moralejo marked this pull request as ready for review August 1, 2020 08:27

moralejo marked this pull request as draft August 1, 2020 10:46

Simplified code

ff69ec4

moralejo marked this pull request as ready for review August 1, 2020 13:07

Fixed bug - the previous version would assign the wrong trigger type …

5f1aa04

…in case of more than one UCTS jump. Also: set -1 as ucts_trigger_type of events with no valid UCTS info.

rlopezcoto reviewed Aug 17, 2020

View reviewed changes

lstchain/reco/r0_to_dl1.py Show resolved Hide resolved

lstchain/reco/r0_to_dl1.py Show resolved Hide resolved

lstchain/scripts/lstchain_mc_dl1ab.py Show resolved Hide resolved

lstchain/scripts/lstchain_mc_dl1ab.py Show resolved Hide resolved

maxnoe reviewed Aug 19, 2020

View reviewed changes

rlopezcoto approved these changes Sep 1, 2020

View reviewed changes

rlopezcoto added 2 commits September 14, 2020 10:34

Merge branch 'master' into ucts_fix

676795b

Merge branch 'master' into ucts_fix

5fedbd4

rlopezcoto changed the title ~~Ucts fix~~ Ucts fix to account for wrongly tagged pedestal events due to skipping events Sep 14, 2020

rlopezcoto merged commit 27ca0db into cta-observatory:master Sep 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ucts fix to account for wrongly tagged pedestal events due to skipping events #497

Ucts fix to account for wrongly tagged pedestal events due to skipping events #497

moralejo commented Jul 31, 2020 •

edited

Loading

moralejo commented Jul 31, 2020

contrera commented Jul 31, 2020

DirkHoffmann commented Jul 31, 2020

moralejo commented Jul 31, 2020 •

edited

Loading

DirkHoffmann commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020

contrera commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020 •

edited

Loading

contrera commented Jul 31, 2020

moralejo commented Jul 31, 2020 •

edited

Loading

codecov bot commented Jul 31, 2020 •

edited

Loading

moralejo commented Aug 1, 2020

maxnoe Aug 1, 2020

moralejo Aug 30, 2020

Ucts fix to account for wrongly tagged pedestal events due to skipping events #497

Ucts fix to account for wrongly tagged pedestal events due to skipping events #497

Conversation

moralejo commented Jul 31, 2020 • edited Loading

moralejo commented Jul 31, 2020

contrera commented Jul 31, 2020

DirkHoffmann commented Jul 31, 2020

moralejo commented Jul 31, 2020 • edited Loading

DirkHoffmann commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020

contrera commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020

moralejo commented Jul 31, 2020 • edited Loading

contrera commented Jul 31, 2020

moralejo commented Jul 31, 2020 • edited Loading

codecov bot commented Jul 31, 2020 • edited Loading

Codecov Report

moralejo commented Aug 1, 2020

maxnoe Aug 1, 2020

Choose a reason for hiding this comment

moralejo Aug 30, 2020

Choose a reason for hiding this comment

moralejo commented Jul 31, 2020 •

edited

Loading

moralejo commented Jul 31, 2020 •

edited

Loading

moralejo commented Jul 31, 2020 •

edited

Loading

moralejo commented Jul 31, 2020 •

edited

Loading

codecov bot commented Jul 31, 2020 •

edited

Loading