YAML Parser: Difference between revisions
| No edit summary   (change visibility)  | |||
| (9 intermediate revisions by 2 users not shown) | |||
| Line 1: | Line 1: | ||
| <div class="title">YAML Parser</div> | <div class="title">YAML Parser</div> | ||
| Starting with '''svn''' revision '''-r 1902''' released on March 1, 2022, the ROMS metadata is managed with a [[varinfo.yaml|YAML file]], and the regular text file '''varinfo.dat''' is deprecated. The YAML files are simple, easy to follow, elegant, portable, and expandable. ROMS can now process YAML files with its parser module, '''yaml_parser.F'''. Therefore, there is no need to use third-party YAML parsers. | Starting with '''svn''' revision '''-r 1902''' released on March 1, 2022, the ROMS metadata is managed with a [[varinfo.yaml|YAML file]], and the regular text file '''varinfo.dat''' is deprecated. The '''YAML''' files are simple, easy to follow, elegant, portable, and expandable. ROMS can now process YAML files with its parser module, '''[https://www.myroms.org/doxygen/yaml__parser_8F_source.html yaml_parser.F]'''. Therefore, there is no need to use third-party '''YAML''' parsers. | ||
| The ROMS YAML parser source code can be found in <span class="forestGreen">ROMS/Utility</span>. It is written in Fortran 2003 and includes a '''CLASS''' of type '''yaml_tree''' for parsing input YAML files. | The ROMS '''YAML''' parser source code can be found in <span class="forestGreen">ROMS/Utility</span>. It is written in Fortran 2003 and includes a '''CLASS''' of type '''yaml_tree''' for parsing input '''YAML''' files. You may download our YAML parser [https://www.myroms.org/links/yaml_parser.F here|download=yaml_parser.F].   | ||
| ==Introduction== | ==Introduction== | ||
| Although several '''YAML''' parsers for Fortran exist, a more straightforward and uncomplicated parser with substantial capabilities was coded in '''ROMS''',  | Although several '''YAML''' parsers for Fortran exist, a more straightforward and uncomplicated parser with substantial capabilities was coded in '''ROMS''', '''[https://www.myroms.org/links/yaml_parser.F yaml_parser.F]'''. It is a hybrid between standard and Object-Oriented Programming (OOP) principles but without the need for recurrency, polymorphism, and containers (another library). | ||
| The only constraint in the '''ROMS''' parser is that the '''YAML''' file is read '''twice''' for simplicity and to avoid containers. The container is a Fortran vector! The first read determines the indentation policy and the length of the collection vector, <span class="red">list(:)</span> pairs object (derived-type structure <span class="forestGreen">yaml_pair</span>). The first reading is quick. | The only constraint in the '''ROMS''' parser is that the '''YAML''' file is read '''twice''' for simplicity and to avoid containers. The container is a Fortran vector! The first read determines the indentation policy and the length of the collection vector, <span class="red">list(:)</span> pairs object (derived-type structure <span class="forestGreen">yaml_pair</span>). The first reading is quick. | ||
| <div class=" | <div class="code">     <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_pair</span><br /><br />       <span class="darkTurquoise">logical</span> :: <span class="red">has_alias</span>                      <span class="twilightBlue">! alias <span class="violet">*</span> token</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">has_anchor</span>                     <span class="twilightBlue">! anchor <span class="violet">&</span> token</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_block</span>                       <span class="twilightBlue">! block <span class="violet">-</span> list</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_sequence</span>                    <span class="twilightBlue">! sequence <span class="violet">[]</span> tokens</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_logical</span>                     <span class="twilightBlue">! logical value</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_integer</span>                     <span class="twilightBlue">! integer value</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_real</span>                        <span class="twilightBlue">! floating-point value</span><br />       <span class="darkTurquoise">logical</span> :: <span class="red">is_string</span>                      <span class="twilightBlue">! string value</span><br /><br />       <span class="darkTurquoise">integer</span> :: <span class="red">id</span>                             <span class="twilightBlue">! key/value ID</span><br />       <span class="darkTurquoise">integer</span> :: <span class="red">parent_id</span>                      <span class="twilightBlue">! parent ID</span><br />       <span class="darkTurquoise">integer</span> :: <span class="red">left_padding</span>                   <span class="twilightBlue">! indent level: 0,1,..</span><br /><br />       <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:), <span class="darkTurquoise">allocatable</span> :: <span class="red">line</span>    <span class="twilightBlue">! YAML line</span><br />       <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:), <span class="darkTurquoise">allocatable</span> :: <span class="red">key</span>     <span class="twilightBlue">! YAML keyword</span><span class="violet">:</span><br />       <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:), <span class="darkTurquoise">allocatable</span> :: <span class="red">value</span>   <span class="twilightBlue">! YAML value(s)</span><br />       <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:), <span class="darkTurquoise">allocatable</span> :: <span class="red">anchor</span>  <span class="twilightBlue">! anchor keyword</span><br /><br />     <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_pair</span></div> | ||
| The '''YAML''' file dictionary <span class="darkTurquoise">CLASS</span> <span class="forestGreen">yaml_tree</span> is defined as: | The '''YAML''' file dictionary <span class="darkTurquoise">CLASS</span> <span class="forestGreen">yaml_tree</span> is defined as: | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_tree</span><br /><br />        <span class="darkTurquoise">logical</span> :: <span class="red">IsCreated</span> '''= .FALSE.''' <span class="twilightBlue">! YAML object creation switch</span><br /><br />        <span class="darkTurquoise">integer</span> :: <span class="red">Nbranches</span>           <span class="twilightBlue">! total number of branches</span><br />        <span class="darkTurquoise">integer</span> :: <span class="red">Npairs</span>              <span class="twilightBlue">! total number of pairs</span><br />        <span class="darkTurquoise">integer</span> :: <span class="red">indent</span>              <span class="twilightBlue">! blank indentation policy</span><br /><br />        <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:),  <span class="darkTurquoise">allocatable</span> :: <span class="red">filename</span>  <span class="twilightBlue">! YAML file name</span><br /><br />        <span class="darkTurquoise">TYPE</span> (<span class="forestGreen">yaml_pair</span>), <span class="darkTurquoise">pointer</span> :: <span class="red">list(:)</span>         <span class="twilightBlue">! collection pairs</span><br /><br />        <span class="darkTurquoise">CONTAINS</span>                                     <span class="twilightBlue">! CLASS objects</span><br /><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">create</span>       => <span class="blue">yaml_tree_create</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">destroy</span>      => <span class="blue">yaml_tree_destroy</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">dump</span>         => <span class="blue">yaml_tree_dump</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">extract</span>      => <span class="blue">yaml_tree_extract</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">fill</span>         => <span class="blue">yaml_tree_fill</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">fill_aliases</span> => <span class="blue">yaml_tree_fill_aliases</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">has</span>          => <span class="blue">yaml_tree_has</span><br />        <span class="darkTurquoise">PROCEDURE</span> :: <span class="red">read_line</span>    => <span class="blue">yaml_tree_read_line</span><br /><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_tree</span></div> | ||
| The <span class="forestGreen">yaml_tree</span> object stores all the data contained in a specific '''YAML''' file. For Example, in '''ROMS''' the input '''YAML''' metadata dictionary is created and initialized as follows: | The <span class="forestGreen">yaml_tree</span> object stores all the data contained in a specific '''YAML''' file. For Example, in '''ROMS''' the input '''YAML''' metadata dictionary is created and initialized as follows: | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">USE</span> <span class="blue">yaml_parser_mod</span>, <span class="blue">ONLY</span> : <span class="blue">yaml_initialize</span><br /><br />      <span class="darkTurquoise">logical</span> :: <span class="red">Lreport</span>                           <span class="twilightBlue">! verbose report switch</span><br />      <span class="darkTurquoise">integer</span> :: <span class="red">ErrorFlag</span>                         <span class="twilightBlue">! processing error flag</span><br /><br />      <span class="darkTurquoise">TYPE</span> (<span class="forestGreen">yaml_tree</span>) :: <span class="red">self</span>                     <span class="twilightBlue">! declare a dummy YAML object</span><br /><br />      <span class="darkTurquoise">IF</span> (.<span class="darkTurquoise">not</span>.<span class="red">self%IsCreated</span>) <span class="darkTurquoise">THEN</span>                <span class="twilightBlue">! process input YAML file</span><br />        <span class="red">Lreport</span> = <span class="red">.TRUE.</span><br />        <span class="red">ErrorFlag</span> = <span class="blue">yaml_initialize</span> (<span class="red">self</span>, <span class="red">'varinfo.yaml'</span>, <span class="red">Lreport</span>)<br />      <span class="darkTurquoise">END IF</span></div> | ||
| The error management is omitted for clarity. Then, the needed data is extracted from the <span class="red">self</span> object and loaded into the internal '''ROMS''' variables using the overloaded <span class="mediumOrchid">yaml_get</span> API: | The error management is omitted for clarity. Then, the needed data is extracted from the <span class="red">self</span> object and loaded into the internal '''ROMS''' variables using the overloaded <span class="mediumOrchid">yaml_get</span> API: | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">INTERFACE</span> <span class="mediumOrchid">yaml_get</span><br /><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_i_struc</span>       <span class="twilightBlue">! Gets integer structure</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_l_struc</span>       <span class="twilightBlue">! Gets logical structure</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_r_struc</span>       <span class="twilightBlue">! Gets real    structure</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_s_struc</span>       <span class="twilightBlue">! Gets string  structure</span><br /><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_ivar_0d</span>       <span class="twilightBlue">! Gets integer value</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_ivar_1d</span>       <span class="twilightBlue">! Gest integer values</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_lvar_0d</span>       <span class="twilightBlue">! Gets logical value</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_lvar_1d</span>       <span class="twilightBlue">! Gets logical values</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_rvar_0d</span>       <span class="twilightBlue">! Gets real    value</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_rvar_1d</span>       <span class="twilightBlue">! Gets real    values</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_svar_0d</span>       <span class="twilightBlue">! Gets string  value</span><br />        <span class="darkTurquoise">MODULE PROCEDURE</span> <span class="blue">yaml_Get_svar_1d</span>       <span class="twilightBlue">! Gets string  values</span><br /><br />      <span class="darkTurquoise">END INTERFACE</span> <span class="mediumOrchid">yaml_get</span></div> | ||
| Additionally, the '''YAML''' parser module has the following public and private routines/functions used during processing: | Additionally, the '''YAML''' parser module has the following public and private routines/functions used during processing: | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">PUBLIC</span>  :: <span class="blue">yaml_AssignString</span><br />      <span class="darkTurquoise">PUBLIC</span>  :: <span class="blue">yaml_Error</span><br />      <span class="darkTurquoise">PUBLIC</span>  :: <span class="blue">yaml_get</span><br />      <span class="darkTurquoise">PUBLIC</span>  :: <span class="blue">yaml_initialize</span><br /><br />      <span class="darkTurquoise">PRIVATE</span> :: <span class="blue">yaml_CountKeys</span><br />      <span class="darkTurquoise">PRIVATE</span> :: <span class="blue">yaml_LowerCase</span><br />      <span class="darkTurquoise">PRIVATE</span> :: <span class="blue">yaml_UpperCase</span><br />      <span class="darkTurquoise">PRIVATE</span> :: <span class="blue">yaml_ValueType</span></div> | ||
| The parser module is self-contained with minimal dependencies on other '''ROMS''' modules: | The parser module is self-contained with minimal dependencies on other '''ROMS''' modules: | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">USE</span> <span class="blue">mod_kinds</span>,    <span class="darkTurquoise">ONLY</span> : <span class="red">kind_real</span>    => <span class="red">dp</span>         <span class="twilightBlue">! double-precision</span><br />      <span class="darkTurquoise">USE</span> <span class="blue">mod_parallel</span>, <span class="darkTurquoise">ONLY</span> : <span class="red">yaml_Master</span>  => <span class="red">Master</span>     <span class="twilightBlue">! master PET</span><br />      <span class="darkTurquoise">USE</span> <span class="blue">mod_scalars</span>,  <span class="darkTurquoise">ONLY</span> : <span class="red">yaml_ErrFlag</span> => <span class="red">exit_flag</span>  <span class="twilightBlue">! error flag</span><br />      <span class="darkTurquoise">USE</span> <span class="blue">mod_iounits</span>,  <span class="darkTurquoise">ONLY</span> : <span class="red">yaml_stdout</span>  => <span class="red">stdout</span>     <span class="twilightBlue">! standard ouput</span></div> | ||
| Overall, the parser is very fast and works in parallel. All PETs are involved in their dictionary copy to avoid overhead from collective MPI calls. | Overall, the parser is very fast and works in parallel. All PETs are involved in their dictionary copy to avoid overhead from collective MPI calls. | ||
| Line 56: | Line 56: | ||
| ==Extraction== | ==Extraction== | ||
| Several derived-type structures are declared to facilitate the extraction of similar data blocks compactly from the YAML dictionary ('''CLASS  | Several derived-type structures are declared to facilitate the extraction of similar data blocks compactly from the YAML dictionary ('''CLASS''' <span class="forestGreen">yaml_tree</span>) in a compact way. | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_Ivec</span>                      <span class="twilightBlue">! integer structure</span><br />        <span class="darkTurquoise">integer</span>, <span class="darkTurquoise">allocatable</span> :: <span class="red">vector(:)</span>            <span class="twilightBlue">! vector values</span><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_Ivec</span><br /><br />      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_Lvec</span>                      <span class="twilightBlue">! logical structure</span><br />        <span class="darkTurquoise">logical</span>, <span class="darkTurquoise">allocatable</span> :: <span class="red">vector(:)</span>            <span class="twilightBlue">! vector values</span><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_Lvec</span><br /><br />      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_Rvec</span>                      <span class="twilightBlue">! real structure</span><br />        <span class="darkTurquoise">real</span> (<span class="red">kind_real</span>), <span class="darkTurquoise">allocatable</span> :: <span class="red">vector(:)</span>   <span class="twilightBlue">! vector values</span><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_Rvec</span><br /><br />      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PUBLIC</span> :: <span class="forestGreen">yaml_Svec</span>                      <span class="twilightBlue">! string structure</span><br />        <span class="darkTurquoise">character</span> (<span class="darkTurquoise">len</span>=:), <span class="darkTurquoise">allocatable</span> :: <span class="darkTurquoise">value</span>      <span class="twilightBlue">! scalar value</span><br />        <span class="darkTurquoise">TYPE</span> (<span class="forestGreen">yaml_Svec</span>),  <span class="darkTurquoise">pointer</span> :: <span class="red">vector(:)</span>      <span class="twilightBlue">! recursive vector</span><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_Svec</span>                             <span class="twilightBlue">! values</span></div> | ||
| The derived-type structure below, extended/inherited from parent "yaml_Svec", extracts hierarchies of keys and associated values from the '''YAML''' dictionary object. The calling program specifies a key string that may be generated by aggregating nested keys with a period. Also, it can extract flow sequence string element values that are separated by commas. | The derived-type structure below, extended/inherited from parent "yaml_Svec", extracts hierarchies of keys and associated values from the '''YAML''' dictionary object. The calling program specifies a key string that may be generated by aggregating nested keys with a period. Also, it can extract flow sequence string element values that are separated by commas. | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">TYPE</span>, <span class="darkTurquoise">PRIVATE</span>, <span class="darkTurquoise">EXTENDS</span>(<span class="forestGreen">yaml_Svec</span>) :: <span class="forestGreen">yaml_extract</span><br />        <span class="darkTurquoise">logical</span> :: <span class="red">has_vector</span>                        <span class="twilightBlue">! true if loaded vector values</span><br />      <span class="darkTurquoise">END TYPE</span> <span class="forestGreen">yaml_extract</span></div> | ||
| These public structures can be used in applications to extract block list '''YAML''' constructs. The key may represent a '''sequence flow''' <span class="red">[</span>...<span class="red">]</span> with a vector of values. The values can be integers, logicals, reals, or strings.  For example, suppose that the '''YAML''' file has the following '''blocking list''' entries: | These public structures can be used in applications to extract block list '''YAML''' constructs. The key may represent a '''sequence flow''' <span class="red">[</span>...<span class="red">]</span> with a vector of values. The values can be integers, logicals, reals, or strings.  For example, suppose that the '''YAML''' file has the following '''blocking list''' entries: | ||
| Line 70: | Line 70: | ||
| Then, the code to extract the data from the '''YAML''' dictionary (<span class="red">self</span>) is straightforward (see above for how to create and initialize a '''YAML''' dictionary object): | Then, the code to extract the data from the '''YAML''' dictionary (<span class="red">self</span>) is straightforward (see above for how to create and initialize a '''YAML''' dictionary object): | ||
| <div class=" | <div class="code">      <span class="darkTurquoise">USE</span> <span class="blue">yaml_parser_mod</span>, <span class="darkTurquoise">ONLY</span> : <span class="blue">yaml_get</span><br /><br />      <span class="darkTurquoise">integer</span> :: <span class="red">ErrorFlag</span>                             <span class="twilightBlue">! processing error flag</span><br /><br />      <span class="darkTurquoise">logical</span>,  <span class="darkTurquoise">allocatable</span> :: <span class="red">L(:)</span>                    <span class="twilightBlue">! logical data</span><br />      <span class="darkTurquoise">real</span>(<span class="red">dp</span>), <span class="darkTurquoise">allocatable</span> :: <span class="red">F(:)</span>                    <span class="twilightBlue">! floating point data</span><br /><br />      <span class="darkTurquoise">TYPE</span> (<span class="forestGreen">yaml_Svec</span>), <span class="darkTurquoise">allocatable</span> :: <span class="red">S1(:)</span>, <span class="red">S2(:)</span>    <span class="twilightBlue">! string data</span><br /><br />      <span class="red">ErrorFlag</span> = <span class="blue">yaml_get</span> (<span class="red">self</span>, <span class="mediumOrchid">'import<span class="red">.</span>short_name'</span>,     <span class="red">S1</span>)<br />      <span class="red">ErrorFlag</span> = <span class="blue">yaml_get</span> (<span class="red">self</span>, <span class="mediumOrchid">'import<span class="red">.</span>data_variables'</span>, <span class="red">S2</span>)<br />      <span class="red">ErrorFlag</span> = <span class="blue">yaml_get</span> (<span class="red">self</span>, <span class="mediumOrchid">'import<span class="red">.</span>scale'</span>,          <span class="red">F</span>)<br />      <span class="red">ErrorFlag</span> = <span class="blue">yaml_get</span> (<span class="red">self</span>, <span class="mediumOrchid">'import<span class="red">.</span>debug write'</span>,    <span class="red">L</span>)</div> | ||
| The above extraction statement yields the following values from a single invocation of the overloaded function <span class="blue">yaml_get</span>: | Notice that the internal adopted syntax is to aggregate the hierarchy of <span class="violet">keys</span> separated by a <span class="red">dot</span> in the <span class="mediumOrchid">keystring</span> (''e.g.'', <span class="mediumOrchid">'import<span class="red">.</span>debug write'</span>) during the extraction call. The above extraction statement yields the following values from a single invocation of the overloaded function <span class="blue">yaml_get</span>: | ||
| <div class=" | <div class="code">     <span class="red">S1(1)%value</span> = <span class="mediumOrchid">'shflux'</span>              <span class="twilightBlue">! <span class="violet">'short_name'</span> single value</span><br />     <span class="red">S1(2)%value</span> = <span class="mediumOrchid">'sustr'</span><br />     <span class="red">S1(3)%value</span> = <span class="mediumOrchid">'svstr'</span><br /><br />     <span class="red">S2(1)%vector(1)%value</span> = <span class="mediumOrchid">'shf'</span>       <span class="twilightBlue">! <span class="violet">'data_variables'</span> vector of values</span><br />     <span class="red">S2(1)%vector(2)%value</span> = <span class="mediumOrchid">'shf_time'</span><br /><br />     <span class="red">S2(2)%vector(1)%value</span> = <span class="mediumOrchid">'taux'</span>      <span class="twilightBlue">! <span class="violet">'data_variables'</span> vector of values</span><br />     <span class="red">S2(2)%vector(2)%value</span> = <span class="mediumOrchid">'atm_time'</span><br /><br />     <span class="red">S2(3)%vector(1)%value</span> = <span class="mediumOrchid">'tauy'</span>      <span class="twilightBlue">! <span class="violet">'data_variables'</span> vector of values</span><br />     <span class="red">S2(3)%vector(2)%value</span> = <span class="mediumOrchid">'atm_time'</span><br /><br />     <span class="red">F(1)</span> = <span class="mediumOrchid">1.0d0</span>                        <span class="twilightBlue">! <span class="violet">'scale'</span> single value</span><br />     <span class="red">F(2)</span> = <span class="mediumOrchid">1.0d0</span><br />     <span class="red">F(3)</span> = <span class="mediumOrchid">1.0d0</span><br /><br />     <span class="red">L(1)</span> = <span class="mediumOrchid">.TRUE.</span>                       <span class="twilightBlue">! <span class="violet">'debug write'</span> single value</span><br />     <span class="red">L(2)</span> = <span class="mediumOrchid">.FALSE.</span><br />     <span class="red">L(3)</span> = <span class="mediumOrchid">.FALSE.</span></div> | ||
| Check <span class="blue">ROMS/Utility/get_metdata.F</span> for more details on how the metadata is processed from input '''YAML''' files. | Check <span class="blue">ROMS/Utility/get_metdata.F</span> for more details on how the metadata is processed from input '''YAML''' files. | ||
Latest revision as of 14:48, 16 March 2023
Starting with svn revision -r 1902 released on March 1, 2022, the ROMS metadata is managed with a YAML file, and the regular text file varinfo.dat is deprecated. The YAML files are simple, easy to follow, elegant, portable, and expandable. ROMS can now process YAML files with its parser module, yaml_parser.F. Therefore, there is no need to use third-party YAML parsers.
The ROMS YAML parser source code can be found in ROMS/Utility. It is written in Fortran 2003 and includes a CLASS of type yaml_tree for parsing input YAML files. You may download our YAML parser here|download=yaml_parser.F.
Introduction
Although several YAML parsers for Fortran exist, a more straightforward and uncomplicated parser with substantial capabilities was coded in ROMS, yaml_parser.F. It is a hybrid between standard and Object-Oriented Programming (OOP) principles but without the need for recurrency, polymorphism, and containers (another library).
The only constraint in the ROMS parser is that the YAML file is read twice for simplicity and to avoid containers. The container is a Fortran vector! The first read determines the indentation policy and the length of the collection vector, list(:) pairs object (derived-type structure yaml_pair). The first reading is quick.
logical :: has_alias ! alias * token
logical :: has_anchor ! anchor & token
logical :: is_block ! block - list
logical :: is_sequence ! sequence [] tokens
logical :: is_logical ! logical value
logical :: is_integer ! integer value
logical :: is_real ! floating-point value
logical :: is_string ! string value
integer :: id ! key/value ID
integer :: parent_id ! parent ID
integer :: left_padding ! indent level: 0,1,..
character (len=:), allocatable :: line ! YAML line
character (len=:), allocatable :: key ! YAML keyword:
character (len=:), allocatable :: value ! YAML value(s)
character (len=:), allocatable :: anchor ! anchor keyword
END TYPE yaml_pair
The YAML file dictionary CLASS yaml_tree is defined as:
logical :: IsCreated = .FALSE. ! YAML object creation switch
integer :: Nbranches ! total number of branches
integer :: Npairs ! total number of pairs
integer :: indent ! blank indentation policy
character (len=:), allocatable :: filename ! YAML file name
TYPE (yaml_pair), pointer :: list(:) ! collection pairs
CONTAINS ! CLASS objects
PROCEDURE :: create => yaml_tree_create
PROCEDURE :: destroy => yaml_tree_destroy
PROCEDURE :: dump => yaml_tree_dump
PROCEDURE :: extract => yaml_tree_extract
PROCEDURE :: fill => yaml_tree_fill
PROCEDURE :: fill_aliases => yaml_tree_fill_aliases
PROCEDURE :: has => yaml_tree_has
PROCEDURE :: read_line => yaml_tree_read_line
END TYPE yaml_tree
The yaml_tree object stores all the data contained in a specific YAML file. For Example, in ROMS the input YAML metadata dictionary is created and initialized as follows:
logical :: Lreport ! verbose report switch
integer :: ErrorFlag ! processing error flag
TYPE (yaml_tree) :: self ! declare a dummy YAML object
IF (.not.self%IsCreated) THEN ! process input YAML file
Lreport = .TRUE.
ErrorFlag = yaml_initialize (self, 'varinfo.yaml', Lreport)
END IF
The error management is omitted for clarity. Then, the needed data is extracted from the self object and loaded into the internal ROMS variables using the overloaded yaml_get API:
MODULE PROCEDURE yaml_Get_i_struc ! Gets integer structure
MODULE PROCEDURE yaml_Get_l_struc ! Gets logical structure
MODULE PROCEDURE yaml_Get_r_struc ! Gets real structure
MODULE PROCEDURE yaml_Get_s_struc ! Gets string structure
MODULE PROCEDURE yaml_Get_ivar_0d ! Gets integer value
MODULE PROCEDURE yaml_Get_ivar_1d ! Gest integer values
MODULE PROCEDURE yaml_Get_lvar_0d ! Gets logical value
MODULE PROCEDURE yaml_Get_lvar_1d ! Gets logical values
MODULE PROCEDURE yaml_Get_rvar_0d ! Gets real value
MODULE PROCEDURE yaml_Get_rvar_1d ! Gets real values
MODULE PROCEDURE yaml_Get_svar_0d ! Gets string value
MODULE PROCEDURE yaml_Get_svar_1d ! Gets string values
END INTERFACE yaml_get
Additionally, the YAML parser module has the following public and private routines/functions used during processing:
PUBLIC :: yaml_Error
PUBLIC :: yaml_get
PUBLIC :: yaml_initialize
PRIVATE :: yaml_CountKeys
PRIVATE :: yaml_LowerCase
PRIVATE :: yaml_UpperCase
PRIVATE :: yaml_ValueType
The parser module is self-contained with minimal dependencies on other ROMS modules:
USE mod_parallel, ONLY : yaml_Master => Master ! master PET
USE mod_scalars, ONLY : yaml_ErrFlag => exit_flag ! error flag
USE mod_iounits, ONLY : yaml_stdout => stdout ! standard ouput
Overall, the parser is very fast and works in parallel. All PETs are involved in their dictionary copy to avoid overhead from collective MPI calls.
Capabilities
Currently, the YAML parser supports the following features:
- Single or multiple line comments start with a hash #. Also, comment after a key/value pair is allowed. All comments are skipped during processing.
- It has an unlimited nested structure (lists, mappings, hierarchies). Indentation of whitespace is used to denote structure.
- It has an unrestricted schema indentation. However, some schema validators recommend or impose two whitespaces indentations.
- A colon follows a key to denote a mapping value like:ocean_model: ROMS
- It supports Anchors and Aliases.ATM_component: &ATM WRF
 metadata:
 - standard_name: surface_eastward_wind
 long_name: surface eastward wind
 short_name: Uwind
 data_variables: [uwind, time]
 source_units: m s-1
 destination_units: m s-1
 source_grid: cell_center
 destination_grid: cell_center
 add_offset: 0.0d0
 scale: 1.0d0
 debug_write: false
 connected_to: *ATM # u10
 regrid_method: bilinear
 extrapolate_method: none
- It supports blocking lists: members are denoted by a leading hyphen-and-space, which is considered part of the indentation.
- It supports a flow sequence: a vector list with values enclosed in square brackets and separated by a comma-and-space, like a keyword: [val1, ..., valN].
- The keyword value(s) is (are) processed and stored as strings but converted to a logical, integer, floating-point, or derived-type when appropriate during extraction. If particular derived-type values are needed, the caller can process such a structure outside the parser.
- It removes unwanted control characters like tabs and separators (ASCII character code 0-31).
- It is restricted to the English uppercase and lowercase alphabet but can be expanded to other characters (see yaml_ValueType routine).
- Multiple or continuation lines are supported. So, for example, we can have:state variables: [sea_surface_height_anomaly,
 barotropic_sea_water_x_velocity,
 barotropic_sea_water_y_velocity,
 sea_water_x_velocity,
 sea_water_y_velocity,
 sea_water_potential_temperature,
 sea_water_practical_salinity]
Extraction
Several derived-type structures are declared to facilitate the extraction of similar data blocks compactly from the YAML dictionary (CLASS yaml_tree) in a compact way.
integer, allocatable :: vector(:) ! vector values
END TYPE yaml_Ivec
TYPE, PUBLIC :: yaml_Lvec ! logical structure
logical, allocatable :: vector(:) ! vector values
END TYPE yaml_Lvec
TYPE, PUBLIC :: yaml_Rvec ! real structure
real (kind_real), allocatable :: vector(:) ! vector values
END TYPE yaml_Rvec
TYPE, PUBLIC :: yaml_Svec ! string structure
character (len=:), allocatable :: value ! scalar value
TYPE (yaml_Svec), pointer :: vector(:) ! recursive vector
END TYPE yaml_Svec ! values
The derived-type structure below, extended/inherited from parent "yaml_Svec", extracts hierarchies of keys and associated values from the YAML dictionary object. The calling program specifies a key string that may be generated by aggregating nested keys with a period. Also, it can extract flow sequence string element values that are separated by commas.
logical :: has_vector ! true if loaded vector values
END TYPE yaml_extract
These public structures can be used in applications to extract block list YAML constructs. The key may represent a sequence flow [...] with a vector of values. The values can be integers, logicals, reals, or strings. For example, suppose that the YAML file has the following blocking list entries:
- standard_name: surface_downward_heat_flux_in_sea_water
long_name: surface net heat flux
short_name: shflux
data_variables: [shf, shf_time]
scale: 1.0d0
debug write: true
- standard_name: surface_wind_x_stress
long_name: surface zonal wind stress component
short_name: sustr
data_variables: [taux, atm_time]
scale: 1.0d0
debug write: false
- standard_name: surface_wind_y_stress
long_name: surface meridional wind stress component
short_name: svstr
data_variables: [tauy, atm_time]
scale: 1.0d0
debug write: false
Then, the code to extract the data from the YAML dictionary (self) is straightforward (see above for how to create and initialize a YAML dictionary object):
integer :: ErrorFlag ! processing error flag
logical, allocatable :: L(:) ! logical data
real(dp), allocatable :: F(:) ! floating point data
TYPE (yaml_Svec), allocatable :: S1(:), S2(:) ! string data
ErrorFlag = yaml_get (self, 'import.short_name', S1)
ErrorFlag = yaml_get (self, 'import.data_variables', S2)
ErrorFlag = yaml_get (self, 'import.scale', F)
ErrorFlag = yaml_get (self, 'import.debug write', L)
Notice that the internal adopted syntax is to aggregate the hierarchy of keys separated by a dot in the keystring (e.g., 'import.debug write') during the extraction call. The above extraction statement yields the following values from a single invocation of the overloaded function yaml_get:
S1(2)%value = 'sustr'
S1(3)%value = 'svstr'
S2(1)%vector(1)%value = 'shf' ! 'data_variables' vector of values
S2(1)%vector(2)%value = 'shf_time'
S2(2)%vector(1)%value = 'taux' ! 'data_variables' vector of values
S2(2)%vector(2)%value = 'atm_time'
S2(3)%vector(1)%value = 'tauy' ! 'data_variables' vector of values
S2(3)%vector(2)%value = 'atm_time'
F(1) = 1.0d0 ! 'scale' single value
F(2) = 1.0d0
F(3) = 1.0d0
L(1) = .TRUE. ! 'debug write' single value
L(2) = .FALSE.
L(3) = .FALSE.
Check ROMS/Utility/get_metdata.F for more details on how the metadata is processed from input YAML files.
