-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposed Recipes for ClimateBench target variables #134
Comments
Duncan, this would be an ideal recipe, and we would love to support it. Do you have any idea how big the total dataset is? |
@rabernat Fantastic! I'm going to get working on this now the paper is close to acceptance. The final dataset is only a few Gb, maybe 10's Gbs if we extend to multiple CMIP models. Doing this in the cloud will be great though since running it locally requires storing all the intermediary daily data which is pretty large. Is there an example recipe that pulls from the ESGF S3 bucket if available but falls back to the ESGF nodes if unavailable? This PR looks close but has been superceded by this, which hasn't been merged. |
Great! Not big, the original dataset is around 1Gb but I want to do a monthly version which would be ~12x bigger.
On 4 May 2022, at 19:13, Ryan Abernathey ***@***.***> wrote:
Duncan, this would be an ideal recipe, and we would love to support it. Do you have any idea how big the total dataset is?
—
Reply to this email directly, view it on GitHub<#134 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAYF2VAEPQY2727DKXRGNFTVIK45VANCNFSM5VBXL7JQ>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Thanks for setting up this great resource. I'm curious if dataset proposed below would make a good PangeoForge recipe. It could be a fairly simple extension of a CMIP6 recipe but I'm not sure if dependencies / recipe chaining is supported yet or in your plans?
Source Dataset
This fairly simple dataset consists of a few key (2D) CMIP6 variables from a single model for benchmarking Climate model emulation approaches:
tas
,diurnal_temperature_range
(tasmax
-tasmin
),pr
andpr90
.Transformation / Alignment / Merging
The transformations are fairly light, just combining across members and time where necessary and then calculating the monthly and annual quantities from daily data.
Output Dataset
zarr output would be preferable, either one file per scenario, or one big file with a scenario dimension (though the time dimension varies with scenario making that tricky I think).
The text was updated successfully, but these errors were encountered: