Description
When preparing prospective production forecasts, we may want to be able to mirror epix_slide
or epix_as_of
operations that we use to prepare pseudoprospective analyses. However, this may involve getting a snapshot epix_as_of
a version with no update data (e.g., due to a nondaily update cadence, a holiday or some other occurrence causing the data source to be stale, or running the forecasts earlier than the data source has been updated and not being able to distinguish this from the data source responding that there has been no change). Currently, unless we have some "redundant" DT
rows duplicating previous values with the latest no-change version, this is going to raise an error (max_version
> self_max
). (This also means that compactification (#101) could change the error behavior here.) We should consider one of the following:
- take another arg to the archive constructor that acts as an override for
self_max
, and would allow this operation to go through with a warning rather than an error, or - provide some helper functions, parameters, and messages to deal with this situation: if we are dealing with a stale archive (
max(DT$version)
<forecast_ref_time
), do we interpret this as if the data source reported no change (unless it looks really stale) and allow getting the snapshot, or do we interpret it as an issue fetching the data and, up to some point, allow ourselves to back up toepix_as_of
the max DT version, providing this as a parameter to the slide function to adjust things? These are actually different options when you think about fetching backfill-aware training sets and forecast aheads relative to the snapshot version. - some sort of combination