Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dynamic partitions have a few edge cases that are hard to test #63

Open
C-Loftus opened this issue Dec 17, 2024 · 1 comment
Open

Dynamic partitions have a few edge cases that are hard to test #63

C-Loftus opened this issue Dec 17, 2024 · 1 comment

Comments

@C-Loftus
Copy link
Member

Currently the renci / nq backup asset is ran every time but the actual process of sending it or doing any computation is skipped if all the partitions in the dag aren't materialized.

That being said, I am unclear what happens if the entire graph gets crawled and all partitions are materialized. Namely, if one more source gets recrawled, the other partitions are still materialized so its possible the export would run again after just one source is recrawled after a full export.

In general, working with dynamic partitions is a bit ill documented and harder to work with.

@C-Loftus
Copy link
Member Author

Its possible that the proper way to do this would be to track the start time of the materialization of each partition are write a simple algorithm to check which are grouped together.

It also might just be easier to have an asset at the end of the pipeline that resets all dynamic partitions. That is the equivalent of grouping assets together in the next materialization

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant