Proposed Recipes for ClimateBench target variables #134

duncanwp · 2022-05-04T11:55:04Z

Thanks for setting up this great resource. I'm curious if dataset proposed below would make a good PangeoForge recipe. It could be a fairly simple extension of a CMIP6 recipe but I'm not sure if dependencies / recipe chaining is supported yet or in your plans?

Source Dataset

This fairly simple dataset consists of a few key (2D) CMIP6 variables from a single model for benchmarking Climate model emulation approaches: tas, diurnal_temperature_range (tasmax-tasmin), pr and pr90.

The file format is CMORized NetCDF
The files are arranged slightly differently for different scenarios but are broadly one file per ensemble member per scenario
Accessed via open ESGF THREDDS server

Transformation / Alignment / Merging

The transformations are fairly light, just combining across members and time where necessary and then calculating the monthly and annual quantities from daily data.

Output Dataset

zarr output would be preferable, either one file per scenario, or one big file with a scenario dimension (though the time dimension varies with scenario making that tricky I think).

The text was updated successfully, but these errors were encountered:

rabernat · 2022-05-04T18:13:35Z

Duncan, this would be an ideal recipe, and we would love to support it. Do you have any idea how big the total dataset is?

duncanwp · 2022-08-15T15:09:39Z

@rabernat Fantastic! I'm going to get working on this now the paper is close to acceptance.

The final dataset is only a few Gb, maybe 10's Gbs if we extend to multiple CMIP models. Doing this in the cloud will be great though since running it locally requires storing all the intermediary daily data which is pretty large.

Is there an example recipe that pulls from the ESGF S3 bucket if available but falls back to the ESGF nodes if unavailable? This PR looks close but has been superceded by this, which hasn't been merged.

duncanwp · 2022-10-11T07:45:37Z

Great! Not big, the original dataset is around 1Gb but I want to do a monthly version which would be ~12x bigger. On 4 May 2022, at 19:13, Ryan Abernathey ***@***.***> wrote: Duncan, this would be an ideal recipe, and we would love to support it. Do you have any idea how big the total dataset is? — Reply to this email directly, view it on GitHub<#134 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAYF2VAEPQY2727DKXRGNFTVIK45VANCNFSM5VBXL7JQ>. You are receiving this because you authored the thread.Message ID: ***@***.***>

duncanwp added the proposed recipe label May 4, 2022

duncanwp mentioned this issue May 4, 2022

Proposed Recipes for input4MIPS (specifically ClimateBench input variables) #135

Open

duncanwp mentioned this issue May 13, 2022

Pangeo Forge Recipe duncanwp/ClimateBench#1

Open

duncanwp mentioned this issue Aug 15, 2022

Derived CMIP6 data recipe builder (WIP) pangeo-forge/pangeo-forge-recipes#252

Closed

duncanwp mentioned this issue Jan 6, 2023

Proposed Recipes for ClimateBench #243

Closed

duncanwp mentioned this issue Aug 3, 2023

ClimateBench dataset leap-stc/data-management#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposed Recipes for ClimateBench target variables #134

Proposed Recipes for ClimateBench target variables #134

duncanwp commented May 4, 2022

rabernat commented May 4, 2022

duncanwp commented Aug 15, 2022

duncanwp commented Oct 11, 2022 via email

Proposed Recipes for ClimateBench target variables #134

Proposed Recipes for ClimateBench target variables #134

Comments

duncanwp commented May 4, 2022

Source Dataset

Transformation / Alignment / Merging

Output Dataset

rabernat commented May 4, 2022

duncanwp commented Aug 15, 2022

duncanwp commented Oct 11, 2022 via email