-
Notifications
You must be signed in to change notification settings - Fork 8
Check/test all_rows
, improve docs
#147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
all_rows
, consider changing to just return a filtered archiveall_rows
, improve docs
If We could implement our own error, checking if In any case, I don't think there's a concern about accidentally generating a ton of duplicate or unincluded rows. With current behavior, toggling on I also noticed that when |
I'm not seeing this. Do you have an example? |
TL;DR: there are some little things here and there making it hard to test/understand minimal examples here that I'd like to work out. We should maybe try to address those first and then revisit this to make sure there's no action needed. I'm still missing what the use cases we're trying to fulfill here are, which is part of why I am so confused. Originally, I think I was misled by
which seems to be inaccurate: the computation must return 1 row or a number of rows matching the number of distinct non-time_value-non-version-key values encountered. So we shouldn't expect duplicate key values out of this (and don't seem to get them even if we happen to trick the check by having a coincidental match in the number of time values and non-time-non-version key values and group by time_value, at least in my testing so far). But not sure if we need to be concerned about group_by vars that are selected from outside the key vars. Maybe the major surprise here (that also makes this harder to debug) is that Additionally, constructing a minimal example is complicated by the silent filtering here:
which we will likely want to change to raise an error if any of the ref_time_values is |
A couple random notes from trying to write out other issues:
[ |
Even more notes:
|
In #290 the plan is to just remove |
Closing associated PR #184. |
all_rows
[inepix_slide
] looks like it will result in a huge expansion of rows ifgroup_by
isn't all non-version non-time key cols. Check whether this is an issue.mutate
andsummarize
, except maybe still requiring 1-row outputs on the "mutate" version? [No, it's not. Or there is a bug when specifying a single ref time value and/or other situations causing a lot of extra rows to be added even when not in the above situation.]all_rows
The text was updated successfully, but these errors were encountered: