Consider validating sensible key values in epi_archive
, valid ops for & performance improvements from nonunique keys
#89
Labels
op-semantics
Operational semantics; many potentially breaking changes here
P2
low priority
performance
REPL
Improved print, errors, etc.
Currently, we do not check for distinct key values in
epi_archive
.$as_of
might give the right result if the duplicate keys have duplicate values and we don't want duplicate rows as output, but gives the wrong thing if instead user is trying to take advantage of a particular update-reporting structure which might enable some performance improvements, which we could take further advantage of by being more flexible with the key, described next.version
--- we are working off of full snapshots in DT --- then we only need to look up the version once, and can key byversion
alone. But we can't use theunique
lookup foras_of
here; maybe a rolling join would work & generalize to the next case.version
for time_valuesversion - 1:windowlength
, then we could key just by (time_value, version).Might interact with #87.
The text was updated successfully, but these errors were encountered: