├── .gitignore
├── tools
    ├── environment.yml
    ├── check_name_rules.py
    ├── check_xml_unique.py
    ├── lib
    │   └── xml_tools.py
    ├── ccpp_meta_stdname_check.py
    └── write_standard_name_table.py
├── LICENSE
├── CODEOWNERS
├── README.md
├── .github
    ├── PULL_REQUEST_TEMPLATE
    └── workflows
    │   └── pull_request_ci.yml
├── standard_names_v1_0.xsd
└── StandardNamesRules.rst


/.gitignore:
--------------------------------------------------------------------------------
1 | *.pyc
2 | 


--------------------------------------------------------------------------------
/tools/environment.yml:
--------------------------------------------------------------------------------
1 | name: test
2 | channels:
3 |   - conda-forge
4 | dependencies:
5 |   - pyyaml
6 | 


--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
 1 | Copyright 2020, NOAA, UCAR/NCAR CU/CIRES
 2 | 
 3 | Licensed under the Apache License, Version 2.0 (the "License");
 4 | you may not use this file except in compliance with the License.
 5 | You may obtain a copy of the License at
 6 | 
 7 |     http://www.apache.org/licenses/LICENSE-2.0
 8 | 
 9 | Unless required by applicable law or agreed to in writing, software
10 | distributed under the License is distributed on an "AS IS" BASIS,
11 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
12 | See the License for the specific language governing permissions and
13 | limitations under the License.
14 | 


--------------------------------------------------------------------------------
/CODEOWNERS:
--------------------------------------------------------------------------------
 1 | # Lines starting with '#' are comments.
 2 | # Each line is a file pattern followed by one or more owners.
 3 | 
 4 | # These owners will be the default owners for everything in the repo.
 5 | 
 6 | *       @cacraigucar @climbfuji @dustinswales @gold2718 @grantfirl @mattldawson @mkavulich @mwaxmonsky @nusbaume @peverwhee @MarekWlasak @svahl991 @ss421
 7 | 
 8 | # Order is important. The last matching pattern has the most precedence.
 9 | # So if a pull request only touches javascript files, only these owners
10 | # will be requested to review.
11 | #*.js    @octocat @github/js
12 | 
13 | # You can also use email addresses if you prefer.
14 | #docs/*  docs@example.com
15 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
 1 | # ESMStandardNames
 2 | 
 3 | The Earth System Modeling Standard Names Repository contains community-accepted Standard Names, publishing tools, and search tools.
 4 | 
 5 | Rules governing the designation and format of standard names can be found in [StandardNamesRules.rst](https://github.com/ESCOMP/ESMStandardNames/blob/main/StandardNamesRules.rst).
 6 | 
 7 | A [Markdown file describing the standard names is included](https://github.com/ESCOMP/ESMStandardNames/blob/main/Metadata-standard-names.md), as well as a [YAML version of the XML file](https://github.com/ESCOMP/ESMStandardNames/blob/main/Metadata-standard-names.yaml).
 8 | 
 9 | Edits to standard names must be made in the xml file `standard_names.xml` only. When a pull request is opened into the main branch, the YAML and Markdown files should be updated using the `tools/write_standard_name_table.py` script. This can be done manually by the pull request author, or by activating the GitHub action available on an open pull request.
10 | 


--------------------------------------------------------------------------------
/.github/PULL_REQUEST_TEMPLATE:
--------------------------------------------------------------------------------
 1 | <!-- Please read all instructions, then fill out the following template.-->
 2 | 
 3 | <!-- If this PR includes changes to standard names, you must update the Markdown and YAML files
 4 | to be consistent with your changes to standard_names.xml.
 5 | 
 6 | This can be done by running the following commands:
 7 | 
 8 |     tools/write_standard_name_table.py --output-format md standard_names.xml
 9 | 
10 | This script requires the pyyaml Python package; to install with pip use command:
11 |     python -m pip install PyYaml
12 |  For conda users, environment file tools/environment.yml is provided.
13 | 
14 | Note that the above procedure assumes your changes were made to standard_names.xml If you instead
15 | made your changes to one of the human-readable files (Metadata-standard-names.md or
16 | Metadata-standard-names.yaml), YOU MUST UPDATE THE FILE standard_names.xml MANUALLY, otherwise your
17 | changes may be lost.
18 | -->
19 | 
20 | <!-- Include a short, descriptive title in the field above. If this PR is for a branch other than
21 | `main`, include the destination branch name in brackets at the beginning, e.g.:
22 | 
23 | [release/v1] Bugfix for release/v1 branch
24 | 
25 | Tag this pull request with appropriate labels. If you do not have permission to add labels, list
26 | the labels you’d like to include in your PR description
27 | 
28 | If this PR is not ready to be reviewed or merged, open as a draft, and include "DRAFT:" as a prefix
29 | to the title above.
30 | 
31 | Developers listed in the CODEOWNERS file will automatically be assigned to review your Pull Request.
32 | If your contribution should be reviewed by anyone else, assign those reviewers manually from the
33 | menu at right, or by tagging them with @USERNAME in the Description text.
34 | 
35 | Changes to standard names and/or descriptions should be made in the standard_names.xml file. The
36 | files Metadata-standard-names.md and Metadata-standard-names.yaml will need to be updated before
37 | a pull request is merged; this can be done manually by the PR author using the script
38 | tools/write_standard_name_table.py or will be done by the Code Manager prior to merging via Github
39 | Actions.
40 | 
41 | Be sure to check in on the PR regularly to respond to comments/questions/reviews!
42 | --> 
43 | 
44 | ## Description
45 | <!-- One or more paragraphs describing the proposed changes and the reasoning behind them. -->
46 | 
47 | ## Issues
48 | <!-- Link any associated GitHub issues in this or other repositories here. -->
49 | 
50 | 
51 | 


--------------------------------------------------------------------------------
/standard_names_v1_0.xsd:
--------------------------------------------------------------------------------
 1 | <?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
 2 | 
 3 | <xs:schema elementFormDefault="qualified"
 4 |            xmlns:xs="http://www.w3.org/2001/XMLSchema">
 5 | 
 6 |   <!-- identifier types -->
 7 | 
 8 |   <xs:simpleType name="standard_name_type">
 9 |     <xs:restriction base="xs:string">
10 |       <xs:pattern value="[a-z][a-z0-9_]*"/>
11 |     </xs:restriction>
12 |   </xs:simpleType>
13 | 
14 |   <xs:simpleType name="type_type">
15 |     <xs:restriction base="xs:string">
16 |       <xs:pattern value="[iI][nN][tT][eE][gG][eE][rR]"/>
17 |       <xs:pattern value="[rR][eE][aA][lL]"/>
18 |       <xs:pattern value="[lL][oO][gG][iI][cC][aA][lL]"/>
19 |       <xs:pattern value="[cC][hH][aA][rR][aA][cC][tT][eE][rR]"/>
20 |       <xs:pattern value="[dD][oO][uU][bB][lL][eE][ ]*[pP][rR][eE][cC][iI][sS][iI][oO][nN]"/>
21 |       <xs:pattern value="[cC][oO][mM][pP][lL][eE][xX]"/>
22 |     </xs:restriction>
23 |   </xs:simpleType>
24 | 
25 |   <xs:simpleType name="version_type">
26 |     <xs:restriction base="xs:string">
27 |       <xs:pattern value="[1-9][0-9]*[.][0-9]+"/>
28 |     </xs:restriction>
29 |   </xs:simpleType>
30 | 
31 |   <!-- attributes -->
32 | 
33 |   <xs:attribute name="comment"        type="xs:string"/>
34 |   <xs:attribute name="description"      type="xs:string"/>
35 |   <xs:attribute name="name"           type="standard_name_type"/>
36 |   <xs:attribute name="units"          type="xs:string"/>
37 |   <xs:attribute name="version"        type="version_type"/>
38 | 
39 |   <!-- definition of complex types -->
40 | 
41 |   <xs:complexType name="md_type">
42 |     <xs:simpleContent>
43 |       <xs:extension base="type_type">
44 |         <xs:attribute ref="units" use="required"/>
45 |       </xs:extension>
46 |     </xs:simpleContent>
47 |   </xs:complexType>
48 | 
49 |   <xs:complexType name="stdname_type">
50 |     <xs:sequence>
51 |       <xs:element name="type"     type="md_type"/>
52 |     </xs:sequence>
53 |     <xs:attribute ref="name"      use="required"/>
54 |     <xs:attribute ref="description" use="optional"/>
55 |   </xs:complexType>
56 | 
57 |   <xs:complexType name="section_type">
58 |     <xs:sequence>
59 |       <xs:element name="standard_name" type="stdname_type"
60 |                   minOccurs="1" maxOccurs="unbounded"/>
61 |     </xs:sequence>
62 |     <xs:attribute ref="name"    use="required"/>
63 |     <xs:attribute ref="comment" use="optional"/>
64 |   </xs:complexType>
65 | 
66 |   <!-- definition of elements -->
67 | 
68 |   <xs:element name="section" type="section_type">
69 |   </xs:element>
70 | 
71 |   <xs:element name="standard_names">
72 |     <xs:complexType>
73 |       <xs:sequence>
74 |         <xs:element ref="section" minOccurs="0" maxOccurs="unbounded"/>
75 |       </xs:sequence>
76 |       <xs:attribute name="name"    type="xs:string"    use="required"/>
77 |       <xs:attribute name="version" type="version_type" use="required"/>
78 |     </xs:complexType>
79 |   </xs:element>
80 | 
81 | </xs:schema>
82 | 


--------------------------------------------------------------------------------
/tools/check_name_rules.py:
--------------------------------------------------------------------------------
 1 | #!/usr/bin/env python3
 2 | 
 3 | """
 4 | Check standard names database file for violations of standard name character rules
 5 | """
 6 | 
 7 | import argparse
 8 | import sys
 9 | import os.path
10 | import re
11 | import xml.etree.ElementTree as ET
12 | 
13 | ################################################
14 | # Add lib modules to python path
15 | ################################################
16 | 
17 | _CURR_DIR = os.path.dirname(os.path.abspath(__file__))
18 | sys.path.append(os.path.join(_CURR_DIR, "lib"))
19 | 
20 | #######################################
21 | #Import needed framework python modules
22 | #######################################
23 | 
24 | from xml_tools import find_schema_file, find_schema_version, validate_xml_file, read_xml_file
25 | 
26 | def main():
27 |     """Parse the standard names database file and output a dictionary
28 |     where the keys are any standard names in violation of character rules,
29 |     and the values are lists of the specific rules violated
30 |     """
31 |     #Parse arguments
32 |     parser = argparse.ArgumentParser(description=__doc__)
33 | 
34 |     parser.add_argument("-s","--standard_name_file",
35 |                         metavar='<standard names filename>',required=True,
36 |                         type=str, help="XML file with standard name library")
37 |     args = parser.parse_args()
38 | 
39 |     stdname_file = os.path.abspath(args.standard_name_file)
40 |     tree, root = read_xml_file(stdname_file)
41 | 
42 |     # Validate the XML file
43 |     version = find_schema_version(root)
44 |     schema_name = os.path.basename(stdname_file)[0:-4]
45 |     schema_root = os.path.dirname(stdname_file)
46 |     schema_path = os.path.join(schema_root,schema_name)
47 |     schema_file = find_schema_file(schema_path, version)
48 |     if schema_file:
49 |         try:
50 |             validate_xml_file(stdname_file, schema_name, version, None,
51 |                             schema_path=schema_root, error_on_noxmllint=True)
52 |         except ValueError:
53 |             raise ValueError(f"Invalid standard names file, {stdname_file}")
54 |     else:
55 |         raise ValueError(f'Cannot find schema file, {schema_name}, for {version=}')
56 | 
57 |     #Parse list of standard names and see if any names violate one or more rules
58 |     violators = {}
59 |     legal_first_char = re.compile('[a-z]')
60 |     valid_name_chars = re.compile('[a-z0-9_]')
61 |     for name in root.findall('./section/standard_name'):
62 |         sname = name.attrib['name']
63 |         violations = []
64 |         if legal_first_char.sub('', sname[0]):
65 |             violations.append('First character is not a lowercase letter')
66 |         testchars = valid_name_chars.sub('', sname)
67 |         if testchars:
68 |             violations.append(f'Invalid characters are present: "{testchars}"')
69 | 
70 |         # If any violations were detected, add an entry to "violators" dictionary
71 |         if violations:
72 |             violators[sname] = violations
73 | 
74 |     if violators:
75 |         raise Exception(f"Violating standard names found:\n{violators}")
76 | 
77 |     # Check for non-ascii characters (ord > 127)
78 |     for elem in ET.tostringlist(root, encoding='unicode'):
79 |         violations = []
80 |         badchars = ''
81 |         badchars=''.join([i if ord(i) > 127 else '' for i in elem])
82 |         if badchars:
83 |             violations.append(f'Non-ascii characters found in {elem}: {badchars}')
84 |         if violations:
85 |             violators[elem] = f'Non-ascii characters found: {badchars}'
86 | 
87 |     if violators:
88 |         raise Exception(f"Violating entries found:\n{violators}")
89 | 
90 |     print(f'Success! All entries in {args.standard_name_file} follow the rules.')
91 | 
92 | if __name__ == "__main__":
93 |     main()
94 | 


--------------------------------------------------------------------------------
/.github/workflows/pull_request_ci.yml:
--------------------------------------------------------------------------------
  1 | name: Pull request checks
  2 | 
  3 | on:
  4 |   workflow_dispatch:
  5 |   pull_request:
  6 |     branches:
  7 |       - main
  8 |       - release/*
  9 | 
 10 | jobs:
 11 |   check-unique-standard-names:
 12 |     name: Check for duplicates in standard names
 13 |     runs-on: ubuntu-latest
 14 |     steps:
 15 |       - name: Checkout repository
 16 |         uses: actions/checkout@v4
 17 | 
 18 |       - name: Setup Python
 19 |         uses: actions/setup-python@v4
 20 |         with:
 21 |           python-version: "3.x"
 22 | 
 23 |       - name: Install dependencies
 24 |         run: |
 25 |           sudo apt-get update
 26 |           sudo apt-get -y install libxml2-utils
 27 | 
 28 |       - name: Check for duplicate standard names, descriptions
 29 |         run: |
 30 |             tools/check_xml_unique.py standard_names.xml
 31 |             tools/check_xml_unique.py standard_names.xml --field="description"
 32 | 
 33 |   check-name-rules:
 34 |     name: Check standard names against rules
 35 |     runs-on: ubuntu-latest
 36 | 
 37 |     steps:
 38 |       - name: Checkout repository
 39 |         uses: actions/checkout@v4
 40 | 
 41 |       - name: Setup Python
 42 |         uses: actions/setup-python@v4
 43 |         with:
 44 |           python-version: "3.x"
 45 | 
 46 |       - name: Install dependencies
 47 |         run: |
 48 |           sudo apt-get update
 49 |           sudo apt-get -y install libxml2-utils
 50 | 
 51 |       - name: Checks standard names against character rules
 52 |         run: |
 53 |           python3 tools/check_name_rules.py -s standard_names.xml
 54 | 
 55 |   test-rendering:
 56 |     name: Test rendering xml file to markdown and yaml
 57 |     runs-on: ubuntu-latest
 58 |     steps:
 59 |       - name: Checkout repository
 60 |         uses: actions/checkout@v4
 61 | 
 62 |       - name: Setup Python
 63 |         uses: actions/setup-python@v4
 64 |         with:
 65 |           python-version: "3.x"
 66 | 
 67 |       - name: Install dependencies
 68 |         run: |
 69 |           sudo apt-get update
 70 |           sudo apt-get -y install libxml2-utils
 71 |           python -m pip install --upgrade pip
 72 |           python -m pip install PyYaml
 73 | 
 74 |       - name: Test rendering xml file to markdown
 75 |         run: |
 76 |           # Checks if the saved markdown matches freshly rendered markdown.
 77 |           # If this fails, prompt user to update
 78 |           tools/write_standard_name_table.py --output-format md standard_names.xml
 79 |           if ! git diff --exit-code --quiet; then
 80 |             echo "❌ Detected that Metadata-standard-names.md is not consistent with standard_names.xml"
 81 |             echo "✅ To fix: Run the following command locally and commit the result:"
 82 |             echo "    tools/write_standard_name_table.py --output-format md standard_names.xml"
 83 |             echo "📘 This script requires the pyyaml Python package; to install with pip use command:"
 84 |             echo "    python -m pip install PyYaml"
 85 |             echo "📘 For conda users, environment file tools/environment.yml is provided."
 86 |             echo
 87 |             exit 1
 88 |           fi
 89 | 
 90 |       - name: Test rendering xml file to yaml
 91 |         run: |
 92 |           tools/write_standard_name_table.py --output-format yaml standard_names.xml
 93 |           if ! git diff --exit-code --quiet; then
 94 |             echo "❌ Detected that Metadata-standard-names.yaml is not consistent with standard_names.xml"
 95 |             echo "✅ To fix: Run the following command locally and commit the result:"
 96 |             echo "    tools/write_standard_name_table.py --output-format yaml standard_names.xml"
 97 |             echo "📘 This script requires the pyyaml Python package; to install with pip use command:"
 98 |             echo "    python -m pip install PyYaml"
 99 |             echo "📘 For conda users, environment file tools/environment.yml is provided."
100 |             echo
101 |             exit 1
102 |           fi
103 | 
104 | 


--------------------------------------------------------------------------------
/tools/check_xml_unique.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python3
  2 | 
  3 | """
  4 | Remove duplicates from a metadata standard-name XML library file.
  5 | """
  6 | 
  7 | import argparse
  8 | import sys
  9 | import os.path
 10 | import xml.etree.ElementTree as ET
 11 | import copy
 12 | 
 13 | ################################################
 14 | # Add lib modules to python path
 15 | ################################################
 16 | 
 17 | _CURR_DIR = os.path.dirname(os.path.abspath(__file__))
 18 | sys.path.append(os.path.join(_CURR_DIR, "lib"))
 19 | 
 20 | #######################################
 21 | #Import needed framework python modules
 22 | #######################################
 23 | 
 24 | from xml_tools import find_schema_file, find_schema_version, validate_xml_file, read_xml_file
 25 | 
 26 | ###############################################################################
 27 | def parse_command_line(args, description):
 28 | ###############################################################################
 29 |     parser = argparse.ArgumentParser(description=description,
 30 |                                      formatter_class=argparse.RawTextHelpFormatter)
 31 | 
 32 |     parser.add_argument("standard_name_file",
 33 |                         metavar='<standard names filename>',
 34 |                         type=str, help="XML file with standard name library")
 35 |     parser.add_argument("--overwrite", action='store_true',
 36 |                         help="flag to remove duplicates and overwrite the file")
 37 |     parser.add_argument("--field", type=str, default="name",
 38 |                         help="Field to check for uniqueness; default is 'name'")
 39 |     parser.add_argument("--debug", action='store_true',
 40 |                         help="flag for additional debug print statements")
 41 | 
 42 |     pargs = parser.parse_args(args)
 43 |     return pargs
 44 | 
 45 | ###############################################################################
 46 | def main_func():
 47 | ###############################################################################
 48 |     """Parse the standard names database file and notify of duplicates.
 49 |     """
 50 |     # Parse command line arguments
 51 |     args = parse_command_line(sys.argv[1:], __doc__)
 52 |     stdname_file = os.path.abspath(args.standard_name_file)
 53 |     tree, root = read_xml_file(stdname_file)
 54 | 
 55 |     # Validate the XML file
 56 |     version = find_schema_version(root)
 57 |     schema_name = os.path.basename(stdname_file)[0:-4]
 58 |     schema_root = os.path.dirname(stdname_file)
 59 |     schema_path = os.path.join(schema_root,schema_name)
 60 |     schema_file = find_schema_file(schema_path, version)
 61 |     if schema_file:
 62 |         try:
 63 |             validate_xml_file(stdname_file, schema_name, version, None,
 64 |                             schema_path=schema_root, error_on_noxmllint=True)
 65 |         except ValueError:
 66 |             raise ValueError(f"Invalid standard names file, {stdname_file}")
 67 |     else:
 68 |         raise ValueError(f'Cannot find schema file, {schema_name}, for {version=}')
 69 | 
 70 |     #get list of all standard names
 71 |     all_std_names = []
 72 |     for name in root.findall('./section/standard_name'):
 73 |         try:
 74 |             all_std_names.append(name.attrib[args.field])
 75 |         except KeyError:
 76 |             if (args.debug):
 77 |                 print(f"WARNING: no field '{args.field}' for standard name '{name.attrib['name']}' ")
 78 |     #get list of all unique and duplicate standard names, in source order
 79 |     seen = set()
 80 |     uniq_std_names = []
 81 |     dup_std_names = []
 82 |     for x in all_std_names:
 83 |         if x not in seen:
 84 |             uniq_std_names.append(x)
 85 |             seen.add(x)
 86 |         else:
 87 |             dup_std_names.append(x)
 88 | 
 89 |     if len(dup_std_names)>0:
 90 |         print(f'The following duplicate {args.field} entries were found:')
 91 |         for dup in dup_std_names:
 92 |             rm_elements = root.findall(f'./section/standard_name[@{args.field}="{dup}"]')[1:]
 93 |             print(f"{dup}, ({len(rm_elements)} duplicate(s))")
 94 |         if args.overwrite:
 95 |             print(f'Removing duplicates and overwriting {stdname_file}')
 96 |             for dup in dup_std_names:
 97 |                 first_use = True #Logical that indicates the first use of the duplicated name
 98 |                 rm_parents = root.findall('./section/standard_name[@name="%s"]..'%dup)
 99 |                 for par in rm_parents:
100 |                     rm_ele = par.findall('./standard_name[@name="%s"]'%dup)
101 |                     for ele in rm_ele:
102 |                         if first_use:
103 |                             #Now all future uses of the name will be removed:
104 |                             first_use = False
105 |                         else:
106 |                             par.remove(ele)
107 |             #Overwrite the xml file with the new, duplicate-free element tree:
108 |             tree.write(stdname_file, "utf-8")
109 |         else:
110 |             # If not overwriting, exit with status 1 to indicate failure
111 |             sys.exit(1)
112 |     else:
113 |         print(f'No duplicate {args.field}s were found.')
114 | 
115 | 
116 | ###############################################################################
117 | if __name__ == "__main__":
118 |     main_func()
119 | 


--------------------------------------------------------------------------------
/tools/lib/xml_tools.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | 
  3 | """
  4 | Parse and / or validate an XML file and return the captured variables.
  5 | """
  6 | 
  7 | # Python library imports
  8 | from __future__ import print_function
  9 | import os
 10 | import os.path
 11 | import subprocess
 12 | import sys
 13 | import logging
 14 | from shutil import which
 15 | import xml.etree.ElementTree as ET
 16 | try:
 17 |     _XMLLINT = which('xmllint')
 18 | except ImportError:
 19 |     _XMLLINT = None
 20 | # end try
 21 | 
 22 | # Find python version
 23 | PY3 = sys.version_info[0] > 2
 24 | PYSUBVER = sys.version_info[1]
 25 | _LOGGER = None
 26 | 
 27 | ###############################################################################
 28 | def call_command(commands, logger, silent=False):
 29 | ###############################################################################
 30 |     """
 31 |     Try a command line and return the output on success (None on failure)
 32 |     >>> call_command(['ls', 'really__improbable_fffilename.foo'], _LOGGER) #doctest: +IGNORE_EXCEPTION_DETAIL
 33 |     Traceback (most recent call last):
 34 |     RuntimeError: Execution of 'ls really__improbable_fffilename.foo' failed:
 35 |     [Errno 2] No such file or directory
 36 |     >>> call_command(['ls', 'really__improbable_fffilename.foo'], _LOGGER, silent=True)
 37 |     False
 38 |     >>> call_command(['ls'], _LOGGER)
 39 |     True
 40 |     """
 41 |     result = False
 42 |     outstr = ''
 43 |     if logger is None:
 44 |         silent = True
 45 |     # end if
 46 |     try:
 47 |         if PY3:
 48 |             if PYSUBVER > 6:
 49 |                 cproc = subprocess.run(commands, check=True,
 50 |                                        capture_output=True)
 51 |                 if not silent:
 52 |                     logger.debug(cproc.stdout)
 53 |                 # end if
 54 |                 result = cproc.returncode == 0
 55 |             elif PYSUBVER >= 5:
 56 |                 cproc = subprocess.run(commands, check=True,
 57 |                                        stdout=subprocess.PIPE,
 58 |                                        stderr=subprocess.PIPE)
 59 |                 if not silent:
 60 |                     logger.debug(cproc.stdout)
 61 |                 # end if
 62 |                 result = cproc.returncode == 0
 63 |             else:
 64 |                 raise ValueError("Python 3 must be at least version 3.5")
 65 |             # end if
 66 |         else:
 67 |             pproc = subprocess.Popen(commands, stdin=None,
 68 |                                      stdout=subprocess.PIPE,
 69 |                                      stderr=subprocess.PIPE)
 70 |             output, _ = pproc.communicate()
 71 |             if not silent:
 72 |                 logger.debug(output)
 73 |             # end if
 74 |             result = pproc.returncode == 0
 75 |         # end if
 76 |     except (OSError, RuntimeError, subprocess.CalledProcessError) as err:
 77 |         if silent:
 78 |             result = False
 79 |         else:
 80 |             cmd = ' '.join(commands)
 81 |             emsg = "Execution of '{}' failed with code:\n"
 82 |             outstr = emsg.format(cmd, err.returncode)
 83 |             outstr += "{}".format(err.output)
 84 |             raise RuntimeError(outstr)
 85 |         # end if
 86 |     # end of try
 87 |     return result
 88 | 
 89 | ###############################################################################
 90 | def find_schema_version(root):
 91 | ###############################################################################
 92 |     """
 93 |     Find the version of the host registry file represented by root
 94 |     >>> find_schema_version(ET.fromstring('<model name="CAM" version="1.0"></model>'))
 95 |     [1, 0]
 96 |     >>> find_schema_version(ET.fromstring('<model name="CAM" version="1.a"></model>')) #doctest: +IGNORE_EXCEPTION_DETAIL
 97 |     Traceback (most recent call last):
 98 |     ValueError: Illegal version string, '1.a'
 99 |     Format must be <integer>.<integer>
100 |     >>> find_schema_version(ET.fromstring('<model name="CAM" version="0.0"></model>')) #doctest: +IGNORE_EXCEPTION_DETAIL
101 |     Traceback (most recent call last):
102 |     ValueError: Illegal version string, '0.0'
103 |     Major version must be at least 1
104 |     >>> find_schema_version(ET.fromstring('<model name="CAM" version="0.-1"></model>')) #doctest: +IGNORE_EXCEPTION_DETAIL
105 |     Traceback (most recent call last):
106 |     ValueError: Illegal version string, '0.0'
107 |     Minor version must be at least 0
108 |     """
109 |     verbits = None
110 |     if 'version' not in root.attrib:
111 |         raise ValueError("version attribute required")
112 |     # end if
113 |     version = root.attrib['version']
114 |     versplit = version.split('.')
115 |     try:
116 |         if len(versplit) != 2:
117 |             raise ValueError('oops')
118 |         # end if (no else needed)
119 |         try:
120 |             verbits = [int(x) for x in versplit]
121 |         except ValueError as verr:
122 |             raise ValueError(verr)
123 |         # end try
124 |         if verbits[0] < 1:
125 |             raise ValueError('Major version must be at least 1')
126 |         # end if
127 |         if verbits[1] < 0:
128 |             raise ValueError('Minor version must be non-negative')
129 |         # end if
130 |     except ValueError as verr:
131 |         errstr = """Illegal version string, '{}'
132 |         Format must be <integer>.<integer>"""
133 |         ve_str = str(verr)
134 |         if ve_str:
135 |             errstr = ve_str + '\n' + errstr
136 |         # end if
137 |         raise ValueError(errstr.format(version))
138 |     # end try
139 |     return verbits
140 | 
141 | ###############################################################################
142 | def find_schema_file(schema_root, version, schema_path=None):
143 | ###############################################################################
144 |     """Find and return the schema file based on <schema_root> and <version>
145 |     or return None.
146 |     If <schema_path> is present, use that as the directory to find the
147 |     appropriate schema file. Otherwise, just look in the current directory."""
148 | 
149 |     verstring = '_'.join([str(x) for x in version])
150 |     schema_filename = "{}_v{}.xsd".format(schema_root, verstring)
151 |     if schema_path:
152 |         schema_file = os.path.join(schema_path, schema_filename)
153 |     else:
154 |         schema_file = schema_filename
155 |     # end if
156 |     if os.path.exists(schema_file):
157 |         return schema_file
158 |     # end if
159 |     return None
160 | 
161 | ###############################################################################
162 | def validate_xml_file(filename, schema_root, version, logger,
163 |                       schema_path=None, error_on_noxmllint=False):
164 | ###############################################################################
165 |     """
166 |     Find the appropriate schema and validate the XML file, <filename>,
167 |     against it using xmllint
168 |     """
169 |     # Check the filename
170 |     if not os.path.isfile(filename):
171 |         raise ValueError("validate_xml_file: Filename, '{}', does not exist".format(filename))
172 |     # end if
173 |     if not os.access(filename, os.R_OK):
174 |         raise ValueError("validate_xml_file: Cannot open '{}'".format(filename))
175 |     # end if
176 |     if not schema_path:
177 |         # Find the schema, based on the model version
178 |         thispath = os.path.abspath(__file__)
179 |         pdir = os.path.dirname(os.path.dirname(os.path.dirname(thispath)))
180 |         schema_path = os.path.join(pdir, 'schema')
181 |     # end if
182 |     schema_file = find_schema_file(schema_root, version, schema_path)
183 |     if not (schema_file and os.path.isfile(schema_file)):
184 |         verstring = '.'.join([str(x) for x in version])
185 |         emsg = """validate_xml_file: Cannot find schema for version {},
186 |         {} does not exist"""
187 |         raise ValueError(emsg.format(verstring, schema_file))
188 |     # end if
189 |     if not os.access(schema_file, os.R_OK):
190 |         emsg = "validate_xml_file: Cannot open schema, '{}'"
191 |         raise ValueError(emsg.format(schema_file))
192 |     # end if
193 |     if _XMLLINT is not None:
194 |         if logger is not None:
195 |             lmsg = "Checking file {} against schema {}"
196 |             logger.debug(lmsg.format(filename, schema_file))
197 |         # end if
198 |         cmd = [_XMLLINT, '--noout', '--schema', schema_file, filename]
199 |         result = call_command(cmd, logger)
200 |         return result
201 |     # end if
202 |     lmsg = "xmllint not found, could not validate file {}"
203 |     if error_on_noxmllint:
204 |         raise ValueError("validate_xml_file: " + lmsg.format(filename))
205 |     # end if
206 |     if logger is not None:
207 |         logger.warning(lmsg.format(filename))
208 |     # end if
209 |     return True # We could not check but still need to proceed
210 | 
211 | ###############################################################################
212 | def read_xml_file(filename, logger=None):
213 | ###############################################################################
214 |     """Read the XML file, <filename>, and return its tree and root"""
215 |     if os.path.isfile(filename) and os.access(filename, os.R_OK):
216 |         if PY3:
217 |             file_open = (lambda x: open(x, 'r', encoding='utf-8'))
218 |         else:
219 |             file_open = (lambda x: open(x, 'r'))
220 |         # end if
221 |         with file_open(filename) as file_:
222 |             try:
223 |                 tree = ET.parse(file_)
224 |                 root = tree.getroot()
225 |             except ET.ParseError as perr:
226 |                 emsg = "read_xml_file: Cannot read {}, {}"
227 |                 raise ValueError(emsg.format(filename, perr))
228 |     elif not os.access(filename, os.R_OK):
229 |         raise ValueError("read_xml_file: Cannot open '{}'".format(filename))
230 |     else:
231 |         emsg = "read_xml_file: Filename, '{}', does not exist"
232 |         raise ValueError(emsg.format(filename))
233 |     # end if
234 |     if logger:
235 |         logger.debug("Read XML file, '{}'".format(filename))
236 |     # end if
237 |     return tree, root
238 | 
239 | ###############################################################################
240 | 
241 | if __name__ == "__main__":
242 |     _LOGGER = logging.getLogger('xml_tools')
243 |     for handler in list(_LOGGER.handlers):
244 |         _LOGGER.removeHandler(handler)
245 |     # end for
246 |     _LOGGER.addHandler(logging.NullHandler())
247 |     try:
248 |         # First, run doctest
249 |         import doctest
250 |         doctest.testmod()
251 |     except ValueError as cerr:
252 |         print("{}".format(cerr))
253 | # no else:
254 | 


--------------------------------------------------------------------------------
/tools/ccpp_meta_stdname_check.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python3
  2 | 
  3 | """
  4 | 
  5 | This tool checks if all of the
  6 | standard names present in a
  7 | CCPP metadata file also exist
  8 | in the standard names dictionary.
  9 | 
 10 | The tool currently has two options:
 11 | 
 12 | 1.  A path to a single metadata file
 13 |     is passed, in which case only that
 14 |     file's standard names are checked, e.g.:
 15 | 
 16 | ./meta_stdname_check --metafile-loc /path/to/file.meta --stdname-dict /path/to/dict.xml
 17 | 
 18 | 2.  A path to a directory is passed, in
 19 |     which case the directory is searched,
 20 |     along with any subdirectories, for
 21 |     metadata files, and all found files'
 22 |     standard names are checked, e.g.:
 23 | 
 24 | ./meta_stdname_check --metafile-loc /meta/path/ --stdname-dict /path/to/dict.xml
 25 | 
 26 | """
 27 | 
 28 | ######################################
 29 | #Import needed standard python modules
 30 | ######################################
 31 | 
 32 | import argparse
 33 | import sys
 34 | import os
 35 | import os.path
 36 | import datetime
 37 | from collections import OrderedDict
 38 | 
 39 | ################################################
 40 | # Add lib modules to python path
 41 | ################################################
 42 | 
 43 | _CURR_DIR = os.path.dirname(os.path.abspath(__file__))
 44 | sys.path.append(os.path.join(_CURR_DIR, "lib"))
 45 | 
 46 | #######################################
 47 | #Import needed framework python modules
 48 | #######################################
 49 | 
 50 | from xml_tools import read_xml_file
 51 | 
 52 | #################
 53 | #Helper functions
 54 | #################
 55 | 
 56 | #++++++++++++++++++++++++++++++
 57 | #Input Argument parser function
 58 | #++++++++++++++++++++++++++++++
 59 | 
 60 | def parse_arguments():
 61 | 
 62 |     """
 63 |     Parses command-line input arguments
 64 |     using the argparse python module and
 65 |     outputs the final argument object.
 66 |     """
 67 | 
 68 |     #Create description:
 69 |     desc = "Check if the metafile contains variable standard names\n"
 70 |     desc += "that are not in the provided standard names dictionary."
 71 | 
 72 |     #Create parser object:
 73 |     parser = argparse.ArgumentParser(description=desc)
 74 | 
 75 |     #Add input arguments to be parsed:
 76 |     parser.add_argument('-m', '--metafile-loc',
 77 |                         metavar='<path to directory or file>',
 78 |                         action='store', type=str,
 79 |                         help="Location of metadata file(s)")
 80 | 
 81 |     parser.add_argument('-s', '--stdname-dict',
 82 |                         metavar='<path to file>',
 83 |                         action='store', type=str,
 84 |                         help="Location of standard name dictionary (XML file)")
 85 | 
 86 |     #Parse Argument inputs
 87 |     args = parser.parse_args()
 88 | 
 89 |     return args.metafile_loc, args.stdname_dict
 90 | 
 91 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 92 | #Function to extract standard names from element tree root
 93 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 94 | 
 95 | def get_dict_stdnames(xml_tree_root):
 96 | 
 97 |     """
 98 |     Extract all elements with the "standard_name" tag,
 99 |     find the "name" attribute for that tag, and collect
100 |     all of those "names" in a set.
101 |     """
102 | 
103 |     #Create empty set to store standard name names:
104 |     std_dict_names = set()
105 | 
106 |     #Loop over all standard_name tags"
107 |     for stdname in xml_tree_root.findall('./section/standard_name'):
108 |         #Add the "name" attribute to the set:
109 |         std_dict_names.add(stdname.attrib['name'])
110 |     #End for
111 | 
112 |     return std_dict_names
113 | 
114 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
115 | #Function to parse a list of strings from a metadata file
116 | #in order to find all standard names
117 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
118 | 
119 | def find_metafile_stdnames(metafile_obj):
120 | 
121 |     """
122 |     Find all lines that start with "standard_name",
123 |     and then assume that all characters after an "="
124 |     are part of the standard name, excluding those
125 |     that are behind a comment delimiter (#).
126 | 
127 |     NOTE:
128 | 
129 |     The CCPP-framework has much more advanced parsers
130 |     that can extract this same info, but bringing them
131 |     into this repo would require many additional
132 |     supporting source files to be brought in as well.
133 | 
134 |     However, if it is found that this simplified parser
135 |     is hitting too many edge cases then it might be wise
136 |     to use the actual CCPP-framework parser instead of
137 |     expanding on this function or script.
138 |     """
139 | 
140 |     #Create empty set to store found standard names:
141 |     meta_stdname_set = set()
142 | 
143 |     #Loop over lines in metadata file object:
144 |     for line in metafile_obj:
145 | 
146 |         #Check if line starts with "standard_name":
147 |         if line.lstrip().startswith("standard_name"):
148 | 
149 |             #Attempt to find string index for "equals" sign:
150 |             equals_index = line.find("=")
151 | 
152 |             #Check if an equals sign actually
153 |             #exists:
154 |             if equals_index != -1:
155 | 
156 |                 #If so, then extract all text to the right
157 |                 #of the equals sign:
158 |                 stdname_text = line[equals_index+1:]
159 | 
160 |                 #Attempt to find the index for a comment delimiter:
161 |                 comment_index = stdname_text.find("#")
162 | 
163 |                 #If comment exists, then remove
164 |                 #it from the standard name text:
165 |                 if comment_index != -1:
166 |                     stdname_text = stdname_text[:comment_index]
167 |                 #End if
168 |             #End if
169 | 
170 |             #Add stripped/trimmed text to the standardname set:
171 |             meta_stdname_set.add(stdname_text.strip())
172 | 
173 |         #End if
174 |     #End for
175 | 
176 |     return meta_stdname_set
177 | 
178 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
179 | #Function to extract standard names in CCPP metadata file
180 | #that are not in a provided set of accepted standard names
181 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
182 | 
183 | def missing_metafile_names(metafile, stdname_set):
184 | 
185 |     """
186 |     Extract all standard names listed in CCPP
187 |     metadata file, and provide a list of all
188 |     names that are not in the provide standard
189 |     name set.
190 |     """
191 | 
192 |     #Open metadata file:
193 |     with open(metafile,'r', encoding='utf-8') as mfile:
194 | 
195 |         #Find all standard names in metadata file
196 |         meta_stdname_set = find_metafile_stdnames(mfile)
197 |     #End with
198 | 
199 |     #Create set of all standard names not in dictionary set:
200 |     missing_stdname_set = meta_stdname_set.difference(stdname_set)
201 | 
202 |     #Return sorted list of missing standard names:
203 |     return sorted(missing_stdname_set)
204 | 
205 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
206 | #Function to find the paths to all metadata files within
207 | #a given directory path
208 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
209 | 
210 | def find_metadata_files(dir_path):
211 | 
212 |     """
213 |     Walk through the provided directory
214 |     and create a list of all found CCPP
215 |     metadata files.
216 |     """
217 | 
218 |     #Create new, empy list to store metadata file paths:
219 |     metadata_files = []
220 | 
221 |     #Walk through provided directory:
222 |     for root, _, files in os.walk(dir_path):
223 |         #Ignore git directories:
224 |         if '.git' not in root:
225 | 
226 |             #Find all metadata files in current root location:
227 |             local_meta_files = [mfil for mfil in files if mfil[-5:] == '.meta']
228 | 
229 | 
230 |             #Add all found metadata files to metadata list,
231 |             #including their full path:
232 |             for local_file in local_meta_files:
233 |                 metadata_files.append(os.path.join(root, local_file))
234 |             #End for
235 |         #End if
236 |     #End for
237 | 
238 |     #Return list of metadata files:
239 |     return metadata_files
240 | 
241 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
242 | #Function to print a "human-readable" list of all of the
243 | #standard names in the provided CCPP metadata files that
244 | #were not found in the provided standard name dictionary
245 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
246 | 
247 | def print_missing_names(missing_names_dict):
248 | 
249 |     """
250 |     Prints a list of the metadata files that
251 |     contain standard names not found in the
252 |     dictionary, and underneath each metadata
253 |     file a list of each "missing" standard name.
254 |     """
255 | 
256 |     #Get current date/time:
257 |     curr_time = datetime.datetime.now()
258 | 
259 |     print("\n#######################")
260 |     print("Date/time of when script was run:")
261 |     print(curr_time)
262 |     print("#######################")
263 |     msg = "\nNon-dictionary standard names found in the following"
264 |     msg += " metadata files:"
265 |     print(msg)
266 | 
267 |     #Loop over dictionary keys, which should be
268 |     #paths to metadata files:
269 |     for metafile in missing_names_dict:
270 | 
271 |         print("\n--------------------------\n")
272 |         print(f"{metafile}\n")
273 | 
274 |         #Extract standard names for file:
275 |         missing_names_list = missing_names_dict[metafile]
276 | 
277 |         for stdname in missing_names_list:
278 |             print(f"    - {stdname}")
279 |         #End for
280 | 
281 |     #End for
282 | 
283 |     print("\n#######################")
284 | 
285 | #+++++++++++++++++++++++++++++++++++++++++++++++++++++++++
286 | 
287 | ############
288 | #Main script
289 | ############
290 | 
291 | #Parse command-line arguments:
292 | metafile_loc, stdname_xml = parse_arguments()
293 | 
294 | #Open standard name dictionary:
295 | _, stdname_dict_root = read_xml_file(stdname_xml)
296 | 
297 | #Extract all standard names from dictionary:
298 | std_names = get_dict_stdnames(stdname_dict_root)
299 | 
300 | #Create new meta file/missing names dictionary:
301 | meta_miss_names_dict = OrderedDict()
302 | 
303 | #Check if user passed in single metadata file:
304 | if os.path.isfile(metafile_loc):
305 | 
306 |     #Find all metadata standard names
307 |     #that are not in the dictionary:
308 |     missing_stdnames = missing_metafile_names(metafile_loc,
309 |                                               std_names)
310 | 
311 |     #If missing stdnames exist, then add the
312 |     #file and missing names to dictionary:
313 |     if missing_stdnames:
314 |         meta_miss_names_dict[metafile_loc] = missing_stdnames
315 |     #End if
316 | 
317 | #If not a file, then check if a directory:
318 | elif os.path.isdir(metafile_loc):
319 | 
320 |     #Find all CCPP metadata files that are
321 |     #located in or under this directory:
322 |     meta_files = find_metadata_files(metafile_loc)
323 | 
324 |     #Loop through all metadata files:
325 |     for meta_file in meta_files:
326 | 
327 |         #Find all metadata standard names
328 |         #that are not in the dictionary
329 |         missing_stdnames = missing_metafile_names(meta_file,
330 |                                               std_names)
331 | 
332 |         #If missing stdnames exist, then add the
333 |         #file and missing names to dictionary:
334 |         if missing_stdnames:
335 |             meta_miss_names_dict[meta_file] = missing_stdnames
336 |         #End if
337 |     #End for
338 | 
339 | else:
340 |     #This is a non-supported input, so raise
341 |     #an error:
342 |     emsg = f"The metafile-loc arg input, '{metafile_loc}'\n"
343 |     emsg += "is neither a file nor a directory,"
344 |     emsg += " so script will end here."
345 |     raise FileNotFoundError(emsg)
346 | #End if
347 | 
348 | #Print list of metadata file standard
349 | #names that are not in the dictionary:
350 | if meta_miss_names_dict:
351 |     #Print organized, human-readable
352 |     #list of "missing" standard names
353 |     #to the screen, along with the
354 |     #metadata file they are associated
355 |     #with
356 |     print_missing_names(meta_miss_names_dict)
357 | else:
358 |     #Notify user that all standard names
359 |     #exist in the dictionary:
360 |     print("All standard names are in the dictionary!")
361 | #End if
362 | 
363 | 
364 | ##############
365 | #End of script
366 | 


--------------------------------------------------------------------------------
/tools/write_standard_name_table.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python3
  2 | 
  3 | """
  4 | Convert a metadata standard-name XML library file to another format.
  5 | """
  6 | 
  7 | # Python library imports
  8 | from collections import OrderedDict
  9 | import xml.etree.ElementTree as ET
 10 | import os.path
 11 | import argparse
 12 | import sys
 13 | import re
 14 | import yaml
 15 | 
 16 | ################################################
 17 | # Add lib modules to python path
 18 | ################################################
 19 | 
 20 | _CURR_DIR = os.path.dirname(os.path.abspath(__file__))
 21 | sys.path.append(os.path.join(_CURR_DIR, "lib"))
 22 | 
 23 | #######################################
 24 | # Import needed framework python modules
 25 | #######################################
 26 | 
 27 | from xml_tools import validate_xml_file, read_xml_file
 28 | from xml_tools import find_schema_file, find_schema_version
 29 | 
 30 | #######################################
 31 | # Regular expressions
 32 | #######################################
 33 | 
 34 | _REAL_SUBST_RE = re.compile(r"(.*\d)p(\d.*)")
 35 | 
 36 | _DROPPED_LINK_CHARS_RE = re.compile(r"[^a-z_-]")
 37 | 
 38 | #######################################
 39 | # Custom representer for OrderedDict
 40 | #######################################
 41 | 
 42 | def ordered_dict_representer(dumper, data):
 43 |     return dumper.represent_mapping(yaml.resolver.BaseResolver.DEFAULT_MAPPING_TAG, data.items())
 44 | yaml.add_representer(OrderedDict, ordered_dict_representer)
 45 | 
 46 | ########################################################################
 47 | def convert_text_to_link(text_str):
 48 | ########################################################################
 49 |     """
 50 |     When Markdown converts a header string into
 51 |     an internal document link it applies certain
 52 |     text conversion rules.  This function thus
 53 |     applies those same rules to a given string
 54 |     in order to produce the correct link.
 55 |     """
 56 | 
 57 |     # First trim the string to remove leading/trailing white space:
 58 |     link_str = text_str.strip()
 59 | 
 60 |     # Next, make sure all text is lowercase:
 61 |     link_str = link_str.lower()
 62 | 
 63 |     # Then, replace all spaces with dashes:
 64 |     link_str = link_str.replace(" ", "-")
 65 | 
 66 |     # Finally, remove all characters that aren't
 67 |     # letters, underscores, or dashes:
 68 |     link_str = _DROPPED_LINK_CHARS_RE.sub("", link_str)
 69 | 
 70 |     return link_str
 71 | 
 72 | ########################################################################
 73 | def standard_name_to_description(prop_dict, context=None):
 74 | ########################################################################
 75 |     """Translate a standard_name to its default description
 76 |     Note: This code is copied from the CCPP Framework.
 77 |     >>> standard_name_to_description({'standard_name':'cloud_optical_depth_layers_from_0p55mu_to_0p99mu'})
 78 |     'Cloud optical depth layers from 0.55mu to 0.99mu'
 79 |     >>> standard_name_to_description({'local_name':'foo'}) #doctest: +IGNORE_EXCEPTION_DETAIL
 80 |     Traceback (most recent call last):
 81 |     CCPPError: No standard name to convert foo to description
 82 |     >>> standard_name_to_description({}) #doctest: +IGNORE_EXCEPTION_DETAIL
 83 |     Traceback (most recent call last):
 84 |     CCPPError: No standard name to convert to description
 85 |     >>> standard_name_to_description({'local_name':'foo'}, context=ParseContext(linenum=3, filename='foo.F90')) #doctest: +IGNORE_EXCEPTION_DETAIL
 86 |     Traceback (most recent call last):
 87 |     CCPPError: No standard name to convert foo to description at foo.F90:3
 88 |     >>> standard_name_to_description({}, context=ParseContext(linenum=3, filename='foo.F90')) #doctest: +IGNORE_EXCEPTION_DETAIL
 89 |     Traceback (most recent call last):
 90 |     CCPPError: No standard name to convert to description at foo.F90:3
 91 |     """
 92 |     # We assume that standard_name has been checked for validity
 93 |     # Make the first char uppercase and replace each underscore with a space
 94 |     if 'standard_name' in prop_dict:
 95 |         standard_name = prop_dict['standard_name']
 96 |         if standard_name:
 97 |             description = standard_name[0].upper() + re.sub("_", " ",
 98 |                                                           standard_name[1:])
 99 |         else:
100 |             description = ''
101 |         # end if
102 |         # Next, substitute a decimal point for the p in [:digit]p[:digit]
103 |         match = _REAL_SUBST_RE.match(description)
104 |         while match is not None:
105 |             description = match.group(1) + '.' + match.group(2)
106 |             match = _REAL_SUBST_RE.match(description)
107 |         # end while
108 |     else:
109 |         description = ''
110 |         if 'local_name' in prop_dict:
111 |             lname = ' {}'.format(prop_dict['local_name'])
112 |         else:
113 |             lname = ''
114 |         # end if
115 |         ctxt = context_string(context)
116 |         emsg = 'No standard name to convert{} to description{}'
117 |         raise CCPPError(emsg.format(lname, ctxt))
118 |     # end if
119 |     return description
120 | 
121 | ###############################################################################
122 | def parse_command_line(args, program_description):
123 | ###############################################################################
124 |     parser = argparse.ArgumentParser(description=program_description,
125 |                                      formatter_class=argparse.RawTextHelpFormatter)
126 | 
127 |     parser.add_argument("standard_name_file",
128 |                         metavar='<standard names filename>',
129 |                         type=str, help="XML file with standard name library")
130 |     parser.add_argument("--output-filename", metavar='<output filename>',
131 |                         type=str, default='Metadata-standard-names',
132 |                         help="Name of output file (without extension)")
133 |     parser.add_argument("--output-format", metavar='[md|yaml]', type=str, default=None,
134 |                         required=True, help="Format of output file")
135 |     pargs = parser.parse_args(args)
136 |     return pargs
137 | 
138 | ###############################################################################
139 | def convert_xml_to_markdown(root, library_name, snl):
140 | ###############################################################################
141 |     snl.write('# {}\n'.format(library_name))
142 |     # Write a table of contents for top-level sections
143 |     snl.write('#### Table of Contents\n')
144 |     for section in root:
145 |         sec_name = section.get('name')
146 |         sec_name_link = convert_text_to_link(sec_name)  # convert string to link text
147 |         snl.write(f"* [{sec_name}](#{sec_name_link})\n")
148 |     # end for
149 |     snl.write('\n')
150 |     for section in root:
151 |         parse_section(snl, section)
152 | 
153 | ###############################################################################
154 | def parse_section(snl, sec, level='##'):
155 | ###############################################################################
156 |     # Step through the sections
157 |     sec_name = sec.get('name')
158 |     sec_comment = sec.get('comment')
159 |     snl.write(f'{level} {sec_name}\n')
160 |     if sec_comment is not None:
161 |         # First, squeeze out the spacing
162 |         while sec_comment.find('  ') >= 0:
163 |             sec_comment = sec_comment.replace('  ', ' ')
164 |         while sec_comment:
165 |             sec_comment = sec_comment.lstrip()
166 |             cind = sec_comment.find('\\n')
167 |             if cind > 0:
168 |                 snl.write('{}\n'.format(sec_comment[0:cind]))
169 |                 sec_comment = sec_comment[cind + 2:]
170 |             else:
171 |                 snl.write('{}\n'.format(sec_comment))
172 |                 sec_comment = ''
173 |             # end if
174 |         # end while
175 |     # end if
176 |     for std_name in sec:
177 |         if std_name.tag == 'section':
178 |             parse_section(snl, std_name, level + '#')
179 |             continue
180 |         stdn_name = std_name.get('name')
181 |         stdn_description = std_name.get('description')
182 |         if stdn_description is None:
183 |             sdict = {'standard_name': stdn_name}
184 |             stdn_description = standard_name_to_description(sdict)
185 |         # end if
186 |         snl.write("* `{}`: {}\n".format(stdn_name, stdn_description))
187 |         # Should only be a type in the standard_name text
188 |         for item in std_name:
189 |             if item.tag == 'type':
190 |                 txt = item.text
191 |                 kind = item.get('kind')
192 |                 if kind is None:
193 |                     kstr = ''
194 |                 else:
195 |                     kstr = "(kind={})".format(kind)
196 |                 # end if
197 |                 units = item.get('units')
198 |                 snl.write('    * `{}{}`: units = {}\n'.format(txt, kstr,
199 |                                                               units))
200 |             else:
201 |                 emsg = "Unknown standard name property, '{}'"
202 |                 raise ValueError(emsg.format(item.tag))
203 |             # end if
204 |         # end for
205 |     # end for
206 | # end for
207 | 
208 | ###############################################################################
209 | def convert_xml_to_yaml(root, library_name, yaml_file):
210 | ###############################################################################
211 |     yaml_data = OrderedDict()
212 |     yaml_data['library_name'] = library_name
213 |     yaml_data['sections'] = []
214 |     for section in root:
215 |         sec_data = OrderedDict()
216 |         sec_data['name'] = section.get('name')
217 |         # Format comment and add to dicionary
218 |         sec_comment = section.get('comment')
219 |         if sec_comment:
220 |             # Remove code block markdown
221 |             sec_comment = sec_comment.replace('```', '')
222 |             # Split multiline into array
223 |             sec_comment = sec_comment.split('\\n')
224 |             # Remove multiple whitespaces
225 |             sec_comment = [' '.join(x.split()) for x in sec_comment if ' '.join(x.split())]
226 |             # Join together into one long string
227 |             sec_comment = ' '.join(sec_comment)
228 |         sec_data['comment'] = sec_comment
229 |         # Parse standard names for this section
230 |         sec_data['standard_names'] = []
231 |         for std_name in section:
232 |             if std_name.tag == 'standard_name':
233 |                 stdn_name = std_name.get('name')
234 |                 stdn_description = std_name.get('description', None)
235 |                 if stdn_description is None:
236 |                     sdict = {'standard_name': stdn_name}
237 |                     stdn_description = standard_name_to_description(sdict)
238 |                 std_type = std_name.find('type')
239 |                 stdn_type = std_type.text
240 |                 if std_type is not None:
241 |                     std_name_data = OrderedDict()
242 |                     std_name_data['name'] = stdn_name
243 |                     std_name_data['description'] = stdn_description
244 |                     std_name_data['type'] = std_type.text
245 |                     std_name_data['kind'] = std_type.get('kind', None)
246 |                     try:
247 |                         std_name_data['units'] = int(std_type.get('units', None))
248 |                     except ValueError:
249 |                         std_name_data['units'] = std_type.get('units', None)
250 |                     sec_data['standard_names'].append(std_name_data)
251 |         yaml_data['sections'].append(sec_data)
252 | 
253 |     yaml.dump(yaml_data, yaml_file, default_flow_style=False)
254 | 
255 | ###############################################################################
256 | def main_func():
257 | ###############################################################################
258 |     """Validate and parse the standard names database file and generate
259 |     a document containing the data."""
260 |     # Parse command line arguments
261 |     args = parse_command_line(sys.argv[1:], __doc__)
262 |     stdname_file = os.path.abspath(args.standard_name_file)
263 |     # Read the XML file
264 |     _, root = read_xml_file(stdname_file)
265 |     library_name = root.get('name')
266 |     # Validate the XML file (needs to be here to grab the version)
267 |     version = find_schema_version(root)
268 |     schema_name = os.path.basename(stdname_file)[0:-4]
269 |     schema_root = os.path.dirname(stdname_file)
270 |     schema_file = find_schema_file(schema_name, version)
271 |     if not schema_file:
272 |         emsg = 'Cannot find schema file, {}, for version {}'
273 |         raise ValueError(emsg.format(schema_name, version))
274 |     # end if
275 | 
276 |     try:
277 |         emsg = "Invalid standard names file, {}".format(stdname_file)
278 |         file_ok = validate_xml_file(stdname_file, schema_name, version,
279 |                                      None, schema_path=schema_root,
280 |                                      error_on_noxmllint=True)
281 |     except ValueError as valerr:
282 |         cemsg = "{}".format(valerr).split('\n')[0]
283 |         if cemsg[0:12] == 'Execution of':
284 |             xstart = cemsg.find("'")
285 |             if xstart >= 0:
286 |                 xend = cemsg[xstart + 1:].find("'") + xstart + 1
287 |                 emsg += '\n' + cemsg[xstart + 1:xend]
288 |             # end if (else, just keep original message)
289 |         elif cemsg[0:18] == 'validate_xml_file:':
290 |             emsg += "\n" + cemsg
291 |         # end if
292 |         raise ValueError(emsg)
293 |     # end try
294 | 
295 |     outfile_name = args.output_filename
296 |     if args.output_format == 'md':
297 |         with open(f"{outfile_name}.md", "w") as md_file:
298 |             convert_xml_to_markdown(root, library_name, md_file)
299 |     elif args.output_format == 'yaml':
300 |         with open(f"{outfile_name}.yaml", "w") as yaml_file:
301 |             convert_xml_to_yaml(root, library_name, yaml_file)
302 |     else:
303 |         emsg = "Unsupported output format, '{}'"
304 |         raise ValueError(emsg.format(args.output_format))
305 |     # end if
306 | 
307 | ###############################################################################
308 | if __name__ == "__main__":
309 |     main_func()
310 | 


--------------------------------------------------------------------------------
/StandardNamesRules.rst:
--------------------------------------------------------------------------------
  1 | .. # define a hard line break for HTML
  2 | .. |br| raw:: html
  3 | 
  4 |    <br />
  5 | 
  6 | *******************
  7 | Earth System Modeling (ESM) Standard Names
  8 | *******************
  9 | 
 10 | This document contains information about the rules used to create Standard Names
 11 | for use with Earth System Models. It describes the
 12 | 
 13 | * ESM Standard Name rules
 14 | * Standard Name qualifiers
 15 | * Other common standard name components
 16 | * Acronyms, abbreviations, and aliases
 17 | * Units
 18 | 
 19 | .. _Rules
 20 | 
 21 | ESM Standard Name Rules
 22 | ========================
 23 | 
 24 | #. Standard names should be identical to those from the latest version
 25 |    of the `Climate and Forecast (CF) metadata
 26 |    conventions <https://cfconventions.org/standard-names.html>`_ unless
 27 |    an appropriate name does not exist in that standard, or the adoption
 28 |    of said names leads to inconsistencies in the naming convention.
 29 | 
 30 | #. When no suitable standard name exists in the CF conventions, the following guidelines should be followed for constructing a new name.
 31 |    The phrases in brackets are optional. The words in *italic* appear explicitly as stated,
 32 |    while the words in ``this font`` indicate other words or phrases to be substituted.
 33 |    The new standard name is constructed by joining the base standard name to the qualifiers using underscores.
 34 | 
 35 |    [``transformation``] [``component``] [``non-instant time``] base_name [*in* ``medium``] [*at* ``level``] [*due_to* ``process``] [``non-current time``] [*assuming* ``condition``]
 36 | 
 37 |    This construction was originally based on rules set forth in the
 38 |    `CF guidelines <http://cfconventions.org/Data/cf-standard-names/docs/guidelines.html>`_),
 39 |    but have since evolved for better consistency and generality across a broader set of fields
 40 |    than was originally envisioned by the CF conventions. "``medium``" should be specified when
 41 |    the variable in question is a substance or other quantity contained within some other medium
 42 |    (e.g. for ``mole_fraction_of_ozone_in_air``, the base name is "ozone", while the medium is "air").
 43 |    "Transformation" refers to descriptors such as "``tendency_of``", "``log10``", or other operations or processes describing some transformation or adjustment of a variable; a detailed list of possible transformations can be found `later in this document <#transformations>`_.
 44 |    Other parts of the construction provide information about a variable's horizontal surface
 45 |    (e.g. ``at_cloud_base``), component (i.e. direction of variable, e.g. ``downward``), process (e.g.
 46 |    ``due_to_deep_convection``), or condition (e.g., ``assuming_clear_sky``). These qualifications do not
 47 |    change the units of the quantity. This is not an exhaustive list of qualifiers that may be needed for a given standard name;
 48 |    see subsequent rules below for more information.
 49 | 
 50 |    The following table provides a few concrete examples of standard names and how they are constructed
 51 |    with respect to the guideline template.
 52 | 
 53 |    `image of table providing standard name construction examples <https://raw.githubusercontent.com/wiki/ESCOMP/ESMStandardNames/images/standard_name_construction_examples.png>`_)
 54 | 
 55 |    Note that "transformations" are a special case, where multiple transformations may be applied,
 56 |    and multiple quantities may be compared, operated on, etc. For transformations involving
 57 |    multiple quantities (e.g. ``ratio_of_X_to_Y``; see the `section on Transformations <#transformations>`_
 58 |    for more information), the above formula may be extended around multiple base names.
 59 | 
 60 |    `image of table providing standard name construction examples with multiple transformations <https://raw.githubusercontent.com/wiki/ESCOMP/ESMStandardNames/images/standard_name_transformation_examples.png>`_)
 61 | 
 62 |    In the latter example, ``ln`` is operating on the quantity ``water_vapor_partial_pressure_assuming_saturation``,
 63 |    while ``derivative_of`` is a combined transformation of ``water_vapor_partial_pressure_assuming_saturation``
 64 |    and ``air_temperature``. When multiple transformations are present, a more detailed description
 65 |    should be provided in the ``description`` field to prevent any possible ambiguity.
 66 | 
 67 | #. Variables are current and instantaneous unless specified. Variables that are not
 68 |    current (e.g., previous timestep) or non-instantaneous (e.g., accumulated values)
 69 |    should have qualifiers in the standard name to describe what they represent.
 70 | 
 71 | #. By default (when not specified otherwise), variables are grid means or centers
 72 |    (defined by the host). If a variable is defined at a different physical location,
 73 |    a qualifier should be used to denote this. For example, to specify the vertical
 74 |    location of a variable with respect to vertical grid cells, the following variants
 75 |    are possible:
 76 | 
 77 |    * ``[variable]``, with no location suffix, is defined at vertical-cell centers or
 78 |      as vertical-cell averages.
 79 | 
 80 |    * ``[variable]_at_interfaces`` is defined at the interfaces between grid cells
 81 |      vertically, including the bottom-most and top-most interfaces.
 82 |    * ``[variable]_at_top_interfaces`` is defined at the interfaces between grid cells
 83 |      vertically, including the top-most interface *but excluding the bottom-most
 84 |      interface*.
 85 | 
 86 |    * ``[variable]_at_bottom_interfaces`` is defined at the interfaces between grid
 87 |      cells vertically, including the bottom-most interface *but excluding the
 88 |      top-most interface*.
 89 | 
 90 |    This implies that if ``[variable]`` is defined on `n` points vertically,
 91 |    ``[variable]_at_interfaces`` is defined on `n+1` points,
 92 |    ``[variable]_at_top_interfaces`` is defined on `n` points, and
 93 |    ``[variable]_at_bottom_interfaces`` is defined on `n` points.
 94 | 
 95 | #. By default, *mixing_ratio* refers to mass mixing ratios. The description should
 96 |    explicitly specify that it refers to the *mass* mixing ratio.
 97 |    Mass mixing ratios should contain information regarding
 98 |    with respect to what quantity they are defined, and options are *wrt_dry_air*,
 99 |    *wrt_moist_air*, or *wrt_moist_air_and_condensed_water*, where *moist_air*
100 |    refers to dry air plus vapor and *moist_air_and_condensed_water* refers
101 |    to dry air plus vapor and hydrometeors.
102 | 
103 |    Use of the term *specific_humidity* should be avoided, as there is no consensus on
104 |    whether it refers to *water_vapor_mixing_ratio_wrt_moist_air* or
105 |    *water_vapor_mixing_ratio_wrt_moist_air_and_condensed_water*.
106 |    *total_water* can be used to designate water in every form, i.e. water
107 |    vapor plus condensed water.
108 | 
109 |    Volume mixing ratios should be qualified as *volume_mixing_ratio*.
110 | 
111 | #. By default, *mole_fraction_of_X_in_Y* refers to the total amount of *Y*. So, for example,
112 |    *mole_fraction_of_ozone_in_air* refers to the total amount of (moist) air. (In the case of air,
113 |    the default meaning is moist air, as described in the *mixing ratio* rule.) When this is not
114 |    the case, a qualifier should be used to denote this. *e.g.*, *mole_fraction_of_ozone_in_dry_air*.
115 | 
116 | #. When referring to soil quantities,
117 |    *volume_fraction* should be used to express the volumetric soil moisture.
118 | 
119 | #. Number concentration should appear as a prefix, that is, *number_concentration_of*. By default,
120 |    number concentrations are specified per unit of volume. When they are specified per
121 |    unit of mass, they should be written as *mass_number_concentration_of*.
122 | 
123 | #. By default, *precipitation* refers to the sum of all phases of precipitating hydrometeors,
124 |    for example rain plus graupel plus hail.  The term *frozen_precipitation* refers to the
125 |    sum of all frozen precipitating hydrometers, for example graupel plus hail (but not rain).
126 |    Otherwise the standard name should explicitly state the type of hydrometeor(s) the
127 |    named quantity represents (e.g. *graupel*).
128 | 
129 | #. By default, the term *cloud* refers to all cloud phases and cloud types. Otherwise
130 |    an additional prefix or suffix should be added to the standard name specifying what kind(s)
131 |    of clouds the variable represents (e.g. *ice_cloud* if only including glaciated clouds, or
132 |    *cloud_at_500hPa* if only including clouds that exist at 500 hPa).
133 | 
134 | #. If possible, qualifiers should be limited in order to allow for a wide
135 |    applicability of the variable. In other words, don't qualify with ``_for_specific_context``
136 |    unless a variable could not conceivably be used outside of the more
137 |    narrowly-defined context or a variable without the scope-narrowing qualifiers
138 |    already exists and cannot be reused.
139 | 
140 |    **Discouraged:** upward_virtual_potential_temperature_flux_for_mellor_yamada_janjic_surface_layer_scheme
141 | 
142 |    **Preferred:** upward_virtual_potential_temperature_flux
143 | 
144 | #. If there are two identical quantities from different schemes/processes that
145 |    need to be kept apart, suitable qualifiers are added to the names of the processes.
146 |    If one process is already established and more common than the other, then it is
147 |    sufficient to only prefix the new process with a suitable qualifier. Example:
148 |    ``due_to_convective_GWD`` and ``due_to_convenctive_whole_atmosphere_GWD``
149 |    as discussed in https://github.com/ESCOMP/ESMStandardNames/issues/79.
150 | 
151 | #. Spell out acronyms unless they are obvious to a vast majority of
152 |    scientists/developers who may come across them. A list of currently-used
153 |    aliases is below. Whenever such an alias exist, use the alias in the
154 |    standard name and the full term in the description.
155 | 
156 | #. Chemical species in standard names should be denoted by chemical formulae (e.g. ``co2``,
157 |    ``ch4``, ``c5h8``) or commonly accepted designations (e.g. ``cfc12``); generally when there are
158 |    multiple options the shorter name is preferred. A few species with well-established and
159 |    unambiguous common names (e.g. water, ozone) are also included. In all cases, the standard name
160 |    should include specific details about the substance's chemical makeup, as well as the
161 |    phase/state of matter if relevant; e.g. ``water_vapor``, ``liquid_h2so4``
162 | 
163 | #. If the ionization of the chemical species is relevant, "ionized" should be included in the standard
164 |    name as a prefix to the substance; e.g. ``number_density_of_ionized_he`` for ionized helium. If
165 |    relevant, the net ionization charge should be included as a prefix (in words, because +/- are
166 |    not valid standard name characters); e.g. ``number_density_of_plus_1_ionized_he``
167 | 
168 | #. For control-oriented variables, if the variable is a Fortran logical,
169 |    use flag_for ``_X``. If it is any other data type, use control_for ``_X``. All flags
170 |    should be Fortran logicals.
171 | 
172 | #  **Disallowed terms:** A few terms are disallowed as standard name components for various reasons; mostly due to
173 |    ambiguity.
174 | 
175 |    - `specific_humidity` Disallowed due to ambiguity and different definitions between different fields. See above section describing `mixing_ratio` for more information.
176 |    - `amount` In most contexts this word is superfluous, and in all contexts it is non-descriptive. Consider a more specific term such as `mass_content`
177 | 
178 | #. **Reserved names:** The prefix ``ccpp_`` is reserved for CCPP framework-provided variables.
179 |    All other standard names should avoid the use of ``ccpp`` in their name.
180 | 
181 | .. _tech_specs:
182 | 
183 | Technical specifications
184 | ========================
185 | 
186 | #. The standard name dictionary consists of a number of individual XML elements:
187 |    one ``standard_name`` element for each entry. A standard name entry consist of a ``name`` attribute
188 |    that represents the variable name, and (optionally) a ``description`` attribute that gives
189 |    a detailed description of what that name represents. Note that the ``description`` field is only
190 |    provided for information and disambiguation only (though it should be unique), and does not need to be included for
191 |    individual implementations of the standard names. This is not necessarily the same as the ``long_name`` entry as described
192 |    in the `CCPP Technical Documentation <https://ccpp-techdoc.readthedocs.io/en/latest/CompliantPhysicsParams.html#ccpp-arg-table>`_,
193 |    but it can be used to inform the contents of that field. The ``standard_name`` XML entry also contains a nested
194 |    ``type`` entry, indicating the data type that a ``standard_name`` should represent, and as an attribute the
195 |    physical units of that variable quantity (see the `section on Units <#units>`_). For example, the element
196 |    for the variable name ``exner_function`` may look similar to this:
197 | 
198 |     <standard_name name="exner_function"
199 |                    description="exner function, (p/p0)^(Rd/cp), where p0 is 1000 hPa">
200 |       <type units="1">real</type>
201 |     </standard_name>
202 | 
203 |    This XML element indicates that the variable ``exner_function`` represents the quantity described by the ``description``
204 |    attribute. It is a real variable with units of "1", meaning it is non-dimensional and
205 |    does not correspond to a more descriptive non-dimensional type such as "fraction"; see the `section on Units <#units>`_
206 |    for more details.
207 | 
208 |    These standard_name elements can optionally be separated by "section" elements. These are parsed out into human-readable sections in the generated markdown file (``Metadata-standard-names.md``).
209 | 
210 | #. Only alphanumeric, punctuation, and whitespace characters from the ASCII character set may be used in the standard_names dictionary.
211 |    The "name" attributes of ``standard_name`` entries (i.e. the standard names themselves) are further restricted to the character set of capital/lowercase letters, numerals, and ``_`` (underscore).
212 | 
213 | .. _qualifiers:
214 | 
215 | Qualifiers
216 | ========================
217 | 
218 | ``this font`` = words or phrases to be substituted
219 | 
220 | XY-surface
221 | ----------
222 | 
223 | Prefixes
224 | ^^^^^^^^
225 | 
226 | None. Note that this is a departure from the CF conventions, which in
227 | many cases - but not all - use surface_ as a prefix. This departure from
228 | the CF convention is to maintain consistency with all other level
229 | qualifiers that are used as _at_level-qualifier (i.e. as suffix).
230 | 
231 | Suffixes
232 | ^^^^^^^^
233 | 
234 | | at_adiabatic_condensation_level
235 | | at_cloud_top
236 | | at_convective_cloud_top
237 | | at_cloud_base
238 | | at_convective_cloud_base
239 | | at_freezing_level
240 | | at_ground_level
241 | | at_maximum_wind_speed_level
242 | | at_sea_ice_base
243 | | at_sea_level
244 | | at_top_of_atmosphere_boundary_layer
245 | | at_top_of_atmosphere_model
246 | | at_top_of_dry_convection
247 | | at_interfaces
248 | | at_toa
249 | | at_tropopause
250 | | at_surface
251 | | at_surface_adjacent_layer
252 | | at_2m
253 | | at_10m
254 | | at_bottom_interface
255 | | at_pressure_levels
256 | | at_top_of_viscous_sublayer
257 | | at_various_atmosphere_layers
258 | | extended_up_by_1
259 | 
260 | 
261 | Component
262 | ---------
263 | 
264 | Prefixes
265 | ^^^^^^^^
266 | 
267 | | upward
268 | | downward
269 | | northward
270 | | southward
271 | | eastward
272 | | westward
273 | | x
274 | | y
275 | 
276 | Special Radiation Component
277 | ---------------------------
278 | 
279 | Prefixes
280 | ^^^^^^^^
281 | 
282 | | net
283 | | upwelling
284 | | downwelling
285 | | incoming
286 | | outgoing
287 | 
288 | Medium
289 | ------
290 | 
291 | Suffixes
292 | ^^^^^^^^
293 | 
294 | | in_air
295 | | in_atmosphere_boundary_layer
296 | | in_mesosphere
297 | | in_sea_ice
298 | | in_sea_water
299 | | in_soil
300 | | in_soil_water
301 | | in_stratosphere
302 | | in_thermosphere
303 | | in_troposphere
304 | | in_atmosphere
305 | | in_surface_snow
306 | | in_diurnal_thermocline
307 | | in_canopy
308 | | in_lake
309 | | in_aquifer
310 | | in_aquifer_and_saturated_soil
311 | | in_convective_tower
312 | | between_soil_bottom_and_water_table
313 | 
314 | Process
315 | -------
316 | 
317 | Suffixes
318 | ^^^^^^^^
319 | 
320 | | due_to_advection
321 | | due_to_convection
322 | | due_to_deep_convection
323 | | due_to_diabatic_processes
324 | | due_to_diffusion
325 | | due_to_dry_convection
326 | | due_to_gwd
327 | | due_to_convective_gwd
328 | | due_to_convective_whole_atmosphere_gwd
329 | | due_to_orographic_gwd
330 | | due_to_gyre
331 | | due_to_isostatic_adjustment
332 | | due_to_large_scale_precipitation
333 | | due_to_longwave_heating
334 | | due_to_moist_convection
335 | | due_to_overturning
336 | | due_to_shallow_convection
337 | | due_to_pbl_processes
338 | | due_to_shortwave_heating
339 | | due_to_thermodynamics
340 | | due_to_background
341 | | due_to_subgrid_scale_vertical_mixing
342 | | due_to_convective_microphysics
343 | | due_to_model_physics
344 | | due_to_shoc
345 | | due_to_dynamics
346 | 
347 | Condition
348 | ---------
349 | 
350 | Suffixes
351 | ^^^^^^^^
352 | 
353 | | assuming_clear_sky
354 | | assuming_deep_snow
355 | | assuming_no_snow
356 | | over_land
357 | | over_ocean
358 | | over_ice
359 | | for_momentum
360 | | for_heat
361 | | for_moisture
362 | | for_heat_and_moisture
363 | | assuming_shallow
364 | | assuming_deep
365 | 
366 | Time
367 | ----
368 | 
369 | Suffixes
370 | ^^^^^^^^
371 | 
372 | | of_new_state
373 | | on_physics_timestep
374 | | on_dynamics_timestep
375 | 
376 | | on_radiation_timestep
377 | | on_previous_timestep
378 | | ``N`` _timesteps_back
379 | | since_ ``T``
380 | 
381 | Computational
382 | -------------
383 | 
384 | Prefixes
385 | ^^^^^^^^
386 | 
387 | | lower_bound_of
388 | | upper_bound_of
389 | | unfiltered
390 | | nonnegative
391 | | flag_for
392 | | control_for
393 | | number_of
394 | | index_of
395 | | vertical_index_at
396 | | vertical_dimension_of
397 | | cumulative
398 | | iounit_of
399 | | filename_of
400 | | frequency_of
401 | | period_of
402 | | XYZ_dimensioned
403 | | tendency_of ``X``
404 | | generic_tendency
405 | | one_way_coupling_of ``_X`` _to ``_Y``
406 | | tunable_parameter[s]_for ``_X``
407 | | map_of
408 | 
409 | 
410 | Infixes
411 | ^^^^^^^
412 | 
413 | | directory_for ``_X`` _source_code
414 | | flag_for_reading ``_X`` _from_input
415 | 
416 | Suffixes
417 | ^^^^^^^^
418 | 
419 | | for_coupling
420 | | for_chemistry_coupling
421 | | from_coupled_process
422 | | from_wave_model
423 | | collection_array
424 | | multiplied_by_timestep
425 | | for_current_mpi_rank
426 | | for_current_cubed_sphere_tile
427 | | plus_one
428 | | minus_one
429 | | for_radiation
430 | | for_deep_convection
431 | | for_microphysics
432 | 
433 | Transformations
434 | ---------------
435 | 
436 | Prefixes
437 | ^^^^^^^^
438 | | change_over_time_in ``_X``
439 | | convergence_of ``_X`` or horizontal_convergence_of ``_X``
440 | | correlation_of ``_X`` _and ``_Y`` [_over ``_Z``]
441 | | cosine_of ``_X``
442 | | covariance_of ``_X`` _and ``_Y`` [_over ``_Z``]
443 | | component_derivative_of ``_X``
444 | | derivative_of ``_X`` _wrt ``_Y``
445 | | direction_of ``_X``
446 | | divergence_of ``_X`` or horizontal_divergence_of ``_X``
447 | | histogram_of ``_X`` [_over ``_Z``]
448 | | integral_of ``_Y`` _wrt ``_X``
449 | | ln ``_X``
450 | | log10 ``_X``
451 | | lwe_thickness_of ``_X``
452 | | magnitude_of ``_X``
453 | | probability_distribution_of ``_X`` [_over ``_Z``]
454 | | probability_density_function_of ``_X`` [_over ``_Z``]
455 | | product_of ``_X`` _and ``_Y``
456 | | ratio_of ``_X`` _to ``_Y``
457 | | reciprocal_of ``_X``
458 | | sine_of ``_X``
459 | | square_of ``_X``
460 | | standard_deviation_of ``_X``
461 | | tendency_of ``_X``
462 | | variance_of ``_X``
463 | | volume_mixing_ratio_of ``_X``
464 | 
465 | Suffixes
466 | ^^^^^^^^
467 | | ``X_`` mixing_ratio_wrt ``_Y``
468 | 
469 | Other common standard name components
470 | =====================================
471 | 
472 | Reserved phrase
473 | ---------------
474 | 
475 | These words/phrases should not be used outside of the described context
476 | 
477 | +------------------------+-------------------------------------------------------------------------------------+
478 | | **Phrase**             |  **Usage**                                                                          |
479 | +========================+=====================================================================================+
480 | | ccpp                   | Variable names provided by the CCPP framework                                       |
481 | +------------------------+-------------------------------------------------------------------------------------+
482 | 
483 | 
484 | Special phrases
485 | ---------------
486 | 
487 | +------------------------+-------------------------------------------------------------------------------------+
488 | | **Phrase**             |  **Meaning**                                                                        |
489 | +========================+=====================================================================================+
490 | | anomaly                | difference from climatology                                                         |
491 | +------------------------+-------------------------------------------------------------------------------------+
492 | | area                   | horizontal area unless otherwise stated                                             |
493 | +------------------------+-------------------------------------------------------------------------------------+
494 | | atmosphere             | used instead of in_air for quantities which are large-scale rather than local       |
495 | +------------------------+-------------------------------------------------------------------------------------+
496 | | condensed_water        | liquid and ice                                                                      |
497 | +------------------------+-------------------------------------------------------------------------------------+
498 | |frozen_water            | ice                                                                                 |
499 | +------------------------+-------------------------------------------------------------------------------------+
500 | | longwave               | Longwave radiation. Defined as thermal emission of radiation from the planet.       |
501 | +------------------------+-------------------------------------------------------------------------------------+
502 | | moisture               | water in all phases contained in soil                                               |
503 | +------------------------+-------------------------------------------------------------------------------------+
504 | | ocean                  | used instead of in_sea_water for quantities which are large-scale rather than local |
505 | +------------------------+-------------------------------------------------------------------------------------+
506 | | shortwave              | Shortwave radiation. Defined as electromagnetic emissions from the sun              |
507 | +------------------------+-------------------------------------------------------------------------------------+
508 | | specific               | per unit mass unless otherwise stated                                               |
509 | +------------------------+-------------------------------------------------------------------------------------+
510 | | unfrozen_water         | liquid and vapor                                                                    |
511 | +------------------------+-------------------------------------------------------------------------------------+
512 | | water                  | water in all phases if not otherwise qualified                                      |
513 | +------------------------+-------------------------------------------------------------------------------------+
514 | | dimensionless          | lacking units                                                                       |
515 | +------------------------+-------------------------------------------------------------------------------------+
516 | | kinematic              | refers to surface fluxes in "native" units (K m s-1 and kg kg-1 m s-1)              |
517 | +------------------------+-------------------------------------------------------------------------------------+
518 | | direct                 | used in radiation (as opposed to diffuse)                                           |
519 | +------------------------+-------------------------------------------------------------------------------------+
520 | | diffuse                | used in radiation (as opposed to direct)                                            |
521 | +------------------------+-------------------------------------------------------------------------------------+
522 | 
523 | Chemical Species
524 | ----------------
525 | 
526 | +------------------------+
527 | | **Species**            |
528 | +========================+
529 | |carbon_dioxide          |
530 | +------------------------+
531 | |dimethyl_sulfide        |
532 | +------------------------+
533 | |nitrate                 |
534 | +------------------------+
535 | |nitrate_and_nitrite     |
536 | +------------------------+
537 | |nitrite                 |
538 | +------------------------+
539 | |oxygen                  |
540 | +------------------------+
541 | |ozone                   |
542 | +------------------------+
543 | |phosphate               |
544 | +------------------------+
545 | |silicate                |
546 | +------------------------+
547 | |sulfate                 |
548 | +------------------------+
549 | |sulfur_dioxide          |
550 | +------------------------+
551 | 
552 | Generic Names
553 | -------------
554 | 
555 | The following names are used with consistent meanings and units as elements in
556 | other standard names, although they are themselves too general to be chosen as
557 | standard names. They are recorded here for reference only. These are not
558 | standard names.
559 | 
560 | +-------------------------------------------+-----------------+
561 | | **Generic Name**                          |  **Units**      |
562 | +===========================================+=================+
563 | | amount                                    | kg m-2          |
564 | +-------------------------------------------+-----------------+
565 | | area                                      | m2              |
566 | +-------------------------------------------+-----------------+
567 | | area_fraction                             | 1               |
568 | +-------------------------------------------+-----------------+
569 | | binary_mask                               | 1               |
570 | +-------------------------------------------+-----------------+
571 | | data_mask                                 | 1               |
572 | +-------------------------------------------+-----------------+
573 | | density                                   | kg m-3          |
574 | +-------------------------------------------+-----------------+
575 | | energy                                    | J               |
576 | +-------------------------------------------+-----------------+
577 | | energy_content                            | J m-2           |
578 | +-------------------------------------------+-----------------+
579 | | energy_density                            | J m-3           |
580 | +-------------------------------------------+-----------------+
581 | | frequency                                 | s-1             |
582 | +-------------------------------------------+-----------------+
583 | | frequency_of_occurrence                   | s-1             |
584 | +-------------------------------------------+-----------------+
585 | | heat_flux                                 | W m-2           |
586 | +-------------------------------------------+-----------------+
587 | | heat_transport                            | W               |
588 | +-------------------------------------------+-----------------+
589 | | streamfunction                            | m2 s-1          |
590 | +-------------------------------------------+-----------------+
591 | | velocity_potential                        | m2 s-1          |
592 | +-------------------------------------------+-----------------+
593 | | mass                                      | kg              |
594 | +-------------------------------------------+-----------------+
595 | | mass_flux                                 | kg m-2 s-1      |
596 | +-------------------------------------------+-----------------+
597 | | mass_fraction                             | 1               |
598 | +-------------------------------------------+-----------------+
599 | | mixing_ratio                              | kg kg-1         |
600 | +-------------------------------------------+-----------------+
601 | | mass_transport k                          | g s-1           |
602 | +-------------------------------------------+-----------------+
603 | | mole_fraction                             | 1               |
604 | +-------------------------------------------+-----------------+
605 | | mole_flux mol                             | m-2 s-1         |
606 | +-------------------------------------------+-----------------+
607 | | momentum_flux                             | Pa              |
608 | +-------------------------------------------+-----------------+
609 | | partial_pressure                          | Pa              |
610 | +-------------------------------------------+-----------------+
611 | | period                                    | s               |
612 | +-------------------------------------------+-----------------+
613 | | power                                     | W               |
614 | +-------------------------------------------+-----------------+
615 | | pressure                                  | Pa              |
616 | +-------------------------------------------+-----------------+
617 | | probability                               | 1               |
618 | +-------------------------------------------+-----------------+
619 | | radiative_flux                            | W m-2           |
620 | +-------------------------------------------+-----------------+
621 | | specific_eddy_kinetic_energy              | m2 s-2          |
622 | +-------------------------------------------+-----------------+
623 | | speed                                     | m s-1           |
624 | +-------------------------------------------+-----------------+
625 | | stress                                    | Pa              |
626 | +-------------------------------------------+-----------------+
627 | | temperature                               | K               |
628 | +-------------------------------------------+-----------------+
629 | | thickness                                 | m               |
630 | +-------------------------------------------+-----------------+
631 | | velocity                                  | m s-1           |
632 | +-------------------------------------------+-----------------+
633 | | volume                                    | m3              |
634 | +-------------------------------------------+-----------------+
635 | | volume_flux                               | m s-1           |
636 | +-------------------------------------------+-----------------+
637 | | volume_fraction                           | 1               |
638 | +-------------------------------------------+-----------------+
639 | | volume_mixing_ratio                       | mol mol-1       |
640 | +-------------------------------------------+-----------------+
641 | | volume_transport                          | m3 s-1          |
642 | +-------------------------------------------+-----------------+
643 | | vorticity                                 | s-1             |
644 | +-------------------------------------------+-----------------+
645 | 
646 | .. _Aliases:
647 | 
648 | Acronyms, Abbreviations, and Aliases
649 | ====================================
650 | 
651 | +---------------------+---------------------------------------------------------+
652 | | **Short**           |  **Meaning**                                            |
653 | +=====================+=========================================================+
654 | | cnvc90              | GFS Convective Cloud Diagnostics                        |
655 | +---------------------+---------------------------------------------------------+
656 | | edmf                | eddy-diffusivity/mass-flux                              |
657 | +---------------------+---------------------------------------------------------+
658 | | gwd                 | gravity wave drag                                       |
659 | +---------------------+---------------------------------------------------------+
660 | | gfdl                | Geophysical Fluid Dynamics Laboratory                   |
661 | +---------------------+---------------------------------------------------------+
662 | | gfs                 | Global Forecast System                                  |
663 | +---------------------+---------------------------------------------------------+
664 | | ir                  | infrared                                                |
665 | +---------------------+---------------------------------------------------------+
666 | | lwe                 | liquid water equivalent                                 |
667 | +---------------------+---------------------------------------------------------+
668 | | max                 | maximum                                                 |
669 | +---------------------+---------------------------------------------------------+
670 | | min                 | minimum                                                 |
671 | +---------------------+---------------------------------------------------------+
672 | | myj                 | Mellor-Yamada-Janjic scheme                             |
673 | +---------------------+---------------------------------------------------------+
674 | | mynn                | Mellor-Yamada-Nakanishi-Niino scheme                    |
675 | +---------------------+---------------------------------------------------------+
676 | | nir                 | near-infrared part of the EM spectrum (radiation)       |
677 | +---------------------+---------------------------------------------------------+
678 | | nrl                 | Naval Research Lab                                      |
679 | +---------------------+---------------------------------------------------------+
680 | | nsstm               | GFS near-surface sea temperature scheme                 |
681 | +---------------------+---------------------------------------------------------+
682 | | pbl                 | planetary boundary layer                                |
683 | +---------------------+---------------------------------------------------------+
684 | | pdf                 | probability density function                            |
685 | +---------------------+---------------------------------------------------------+
686 | | rrtmgp              | Rapid Radiative Transfer Model for General circulation  |
687 | |                     | model applications - Parallel                           |
688 | +---------------------+---------------------------------------------------------+
689 | | sas                 | simplified Arakawa-Schubert scheme                      |
690 | +---------------------+---------------------------------------------------------+
691 | | skeb                | stochastic kinetic energy backscatter                   |
692 | +---------------------+---------------------------------------------------------+
693 | | shoc                | simplified higher-order closure stochastic scheme       |
694 | +---------------------+---------------------------------------------------------+
695 | | shum                | stochastically perturbed boundary-layer humidity scheme |
696 | +---------------------+---------------------------------------------------------+
697 | | sppt                | stochastically perturbed physics tendencies             |
698 | +---------------------+---------------------------------------------------------+
699 | | stp                 | standard temperature (0 degC) and pressure (101325 Pa)  |
700 | +---------------------+---------------------------------------------------------+
701 | | tke                 | turbulent kinetic energy                                |
702 | +---------------------+---------------------------------------------------------+
703 | | toa                 | top of atmosphere                                       |
704 | +---------------------+---------------------------------------------------------+
705 | | ugwp                | Unified Gravity Wave Physics                            |
706 | +---------------------+---------------------------------------------------------+
707 | | uv                  | ultraviolet part of the EM spectrum (radiation)         |
708 | +---------------------+---------------------------------------------------------+
709 | | vis                 | visible part of the EM spectrum (radiation)             |
710 | +---------------------+---------------------------------------------------------+
711 | | wrt                 | with respect to                                         |
712 | +---------------------+---------------------------------------------------------+
713 | 
714 | Units
715 | =====
716 | 
717 | Entries in the Standard Names dictionary contain a "units" property that serves to indicate the
718 | typical/recommended units for a given variable. It is not mandatory to use the indicated units exactly,
719 | but any use of a given standard name variable should have units of the same dimensionality.
720 | 
721 | When adding a new standard name, units should follow the `International System of Units (SI/metric system) <http://nist.gov/pml/owm/metric-si/si-units>`_.
722 | If the new standard name has an existing match in the `Climate and Forecast (CF) metadata
723 | conventions <https://cfconventions.org/standard-names.html>`_, the units should be identical to the canonical units listed there
724 | 
725 | For dimensionless variables, the following units can be used:
726 | 
727 | +------------------------+-----------------------------------------------------------------------------------------------+
728 | | **Unit**               |  **Use case**                                                                                 |
729 | +========================+===============================================================================================+
730 | | count                  | integers that describe the dimension/length of an array                                       |
731 | +------------------------+-----------------------------------------------------------------------------------------------+
732 | | flag                   | logicals/booleans that can be either true or false                                            |
733 | +------------------------+-----------------------------------------------------------------------------------------------+
734 | | index                  | integers that can be an index in an array                                                     |
735 | +------------------------+-----------------------------------------------------------------------------------------------+
736 | | kg kg-1                | mass mixing ratios                                                                            |
737 | +------------------------+-----------------------------------------------------------------------------------------------+
738 | | m3 m-3                 | volume fraction (e.g. for soil moisture)                                                      |
739 | +------------------------+-----------------------------------------------------------------------------------------------+
740 | | mol mol-1              | molar mixing ratios (also volumetric mixing ratio for gases)                                  |
741 | +------------------------+-----------------------------------------------------------------------------------------------+
742 | | none                   | strings and character arrays                                                                  |
743 | +------------------------+-----------------------------------------------------------------------------------------------+
744 | | fraction               | fractions not listed above, typically valid in the range [0,1]                                |
745 | +------------------------+-----------------------------------------------------------------------------------------------+
746 | | percent                | fractions expressed in percent, typically ranging from 0% to 100%                             |
747 | +------------------------+-----------------------------------------------------------------------------------------------+
748 | | 1                      | any number (integer, real, complex) not listed above, e.g. scaling factors, error codes, etc. |
749 | +------------------------+-----------------------------------------------------------------------------------------------+
750 | 


--------------------------------------------------------------------------------