Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Augmenting CVs beyond the basic information captured in CMIP6 #69

Open
vnaik60 opened this issue Mar 12, 2024 · 7 comments
Open

Augmenting CVs beyond the basic information captured in CMIP6 #69

vnaik60 opened this issue Mar 12, 2024 · 7 comments

Comments

@vnaik60
Copy link

vnaik60 commented Mar 12, 2024

Can this level of information be captured via CVs?

@durack1
Copy link
Member

durack1 commented Mar 12, 2024

Thanks @vnaik60 - this should likely be created across in the CMIP6Plus_CVs so we catch needed data during the first pass, which we're still working on.

@wolfiex @matthew-mizielinski @davidhassell @glevava @taylor13

@durack1 durack1 changed the title detailed model information via CVs? Augmenting CVs beyond the basic information captured in CMIP6 Mar 12, 2024
@durack1
Copy link
Member

durack1 commented Mar 12, 2024

FYI, the AR6 Table AII.5 needed the information that you have linked to above, in addition to the information in the CMIP6_source_id.json. I note that the AR6 table was built from the AR5 Table 9.A template, and the AR4 Table 8.1 before it, and TAR Table 8.1 before it.

@taylor13
Copy link
Collaborator

Perhaps this should also be discussed with the CMIP documentation task team. (David and Guillaume). Don't see why that level of information can't be included in CVs if it is needed.

@glevava
Copy link
Collaborator

glevava commented Mar 14, 2024

I'm not sure all these information are strictly what we call "Controlled Vocabulary". For instance answering the question "Is there an active land carbon cycle?" by "Yes/No" is not related to CV. But such a level of information is clearly useful to be included in the *_source_id.json. I agree with @taylor13 this could be easily catched in the MVD to do so. We should think about CV to only define and centralise the typography and syntax of CMIP terms, so that it can be used consistently by other services when necessary, for example for certain elements of model documentation here.

@taylor13
Copy link
Collaborator

taylor13 commented Mar 14, 2024

I agree we should probably make a distinction between "controlled vocabulary" and the more extensive information recorded in some of our CMIP .json files. Most of the CV's are structured as pretty simple json files. In the simplest case we name the vocabulary (e.g., "table_id") and then list all the acceptable values:

    "table_id":[
        "3hr",
        "6hrLev",
        "6hrPlev",
        "6hrPlevPt",
        "AERday",
        "AERhr",
        .
        .
        .

In a slightly more complex case, which I think we can still refer to as a controlled vocabulary, we name the vocabulary (e.g., "acitivity_id") we include short "definitions" of what is meant by each word in the vocabulary:

    "activity_id":{
        "AerChemMIP":"Aerosols and Chemistry Model Intercomparison Project",
        "C4MIP":"Coupled Climate Carbon Cycle Model Intercomparison Project",
        "CDRMIP":"Carbon Dioxide Removal Model Intercomparison Project",
        "CFMIP":"Cloud Feedback Model Intercomparison Project",
         .
         .
         .

There are other json files we have called "controlled vocabularies", but which I think we might better refer to as "registries". Two clear examples are the "experiment_id" file and the "source_id" file.
Would distinguishing between these two types of information (a CV vs. a registry) make sense?
The documentation task team would be concerned first with expanding the "source" registry, and hopefully eventually also help with the experiment registry and help decide how to record how models have "conformed" their simulations to the experiment design.

@durack1
Copy link
Member

durack1 commented Mar 14, 2024

While I think we should get the nomenclature right, we shouldn't get too buried in the weeds here. If we wanted to be explicit, CMIP should probably be renamed ESMIP (Earth System Model Intercomparison Project), and that's not going to happen.

Following that example, we can refer to the project information in a project_CVs repo, and this information will be a mix of true controlled vocabularies and registered information (registries). We had hoped that reusable (across many projects) "controlled vocabularies" could be migrated upstream to the mip-cmor-tables repo, which means we have e.g. institution registrations in a single place, and these can then be reused across projects (e.g. CCCma is contributing to CMIP3, CMIP5, CMIP6, CMIP6Plus, input4MIPs, ...) so makes sense that "CCCma" the institution_id is registered in a single place, and that same identifier reused, consistently across projects

@vnaik60
Copy link
Author

vnaik60 commented Mar 15, 2024

Thanks Paul for tracing the source of the table. This googledoc was sent out to the modeling centers three weeks before the FGD deadline. I brought this up here so that this information is captured in a more formal way in the CMIP model documentation effort.

@durack1 durack1 transferred this issue from WCRP-CMIP/CMIP6_CVs May 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants