frequencies: Limit the range of frequency pivots to the observed data when no start and end date are provided #1132

huddlej · 2023-01-19T22:40:37Z

Context

This issue arose as part of a conversation about pivot intervals for frequencies and what reasonable start/end dates should be.

Description

The time points when we evaluate frequencies should not exceed the date range of the observed data unless the user has requested a specific start and/or end date. As @rneher notes in the conversation linked above:

Pivots for KDE frequencies should never extend more than the narrow width beyond the last time window were there is more or less representative data. They would be too heavily influenced by fluctuations in that case. This is less of an issue for the diffusion frequencies.

Instead of setting the lower and upper date bounds based on the pivot frequency like so:

pivot_start = start_date if start_date else np.floor(np.min(observations) / pivot_frequency) * pivot_frequency
pivot_end = end_date if end_date else np.ceil(np.max(observations) / pivot_frequency) * pivot_frequency

we should consider at least using a max date of the latest observation like so:

pivot_start = start_date if start_date else np.floor(np.min(observations) / pivot_frequency) * pivot_frequency
pivot_end = end_date if end_date else np.max(observations)

The latest approach to calculating pivots would guarantee that the last pivot matches the latest data point. However, there is no guarantee that a lower date bound based on the earliest observation would match that observation. We could change the logic of the pivot calculations to ensure that the earliest observation (or start date) is always included, though. This isn't a large change conceptually, but it would require us to update the expected behavior of the frequencies API as represented in our unit tests.

huddlej added the enhancement New feature or request label Jan 19, 2023

huddlej self-assigned this Jan 19, 2023

huddlej mentioned this issue Jan 19, 2023

frequencies: Fix pivot logic to always include the end date #1121

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

frequencies: Limit the range of frequency pivots to the observed data when no start and end date are provided #1132

frequencies: Limit the range of frequency pivots to the observed data when no start and end date are provided #1132

huddlej commented Jan 19, 2023

frequencies: Limit the range of frequency pivots to the observed data when no start and end date are provided #1132

frequencies: Limit the range of frequency pivots to the observed data when no start and end date are provided #1132

Comments

huddlej commented Jan 19, 2023

Context

Description