diff --git a/Test/expected-results/test.epub b/Test/expected-results/test.epub index c50e88870..9e89ccdcd 100644 Binary files a/Test/expected-results/test.epub and b/Test/expected-results/test.epub differ diff --git a/Test/expected-results/test.isosch b/Test/expected-results/test.isosch index 476b0e2a9..2844175fb 100644 --- a/Test/expected-results/test.isosch +++ b/Test/expected-results/test.isosch @@ -11,7 +11,6 @@ - diff --git a/Test/expected-results/test.odd.html b/Test/expected-results/test.odd.html index 5f1f25d00..7ff8dcfa1 100644 --- a/Test/expected-results/test.odd.html +++ b/Test/expected-results/test.odd.html @@ -566,10 +566,7 @@

att.datcat : - provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 standard and stored in the Web repository called ISOCat at - - http://www.isocat.org/ - .[ + provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by an external taxonomy, in this way establishing the identity of information containers and values, and providing means of interpreting them.[ 9.5.2. Lexical View 18.3. Other Atomic Feature Values] diff --git a/Test/expected-results/test.rng b/Test/expected-results/test.rng index eb03d0f02..c76c90043 100644 --- a/Test/expected-results/test.rng +++ b/Test/expected-results/test.rng @@ -561,11 +561,12 @@ Sample values include: 1] rubbing; 2] mildew; 3] smoke + - - contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. + + provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If @valueDatcat is present in the immediate context, this attribute takes on role (a), while @valueDatcat performs role (b). @@ -578,8 +579,22 @@ Sample values include: 1] rubbing; 2] mildew; 3] smoke - - contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. + + provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with @datcat is needed. + + + + \S+ + + + + + + + + + + provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the @datcat attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element. @@ -6568,6 +6583,7 @@ On , either the @marks attribute should be used, or a paragraph of descri (element usage) documents the usage of a specific element within a specified document. [2.3.4. The Tagging Declaration] + (generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent <namespace> element. @@ -10993,7 +11009,8 @@ Suggested values include: 1] simple; 2] lemma; 3] variant; 4] compound; 5] deriv - classifies the grammatical information given according to some convenient typology—in the case of terminological information, preferably the dictionary of data element types specified in ISO 12620. + classifies grammatical information provided by <gram> according to some convenient and shared typology, ideally one defined in an external reference taxonomy, such as the CLARIN Concept + Registry. Sample values include: 1] pos (part of speech); 2] gen (gender); 3] num (number); 4] animate; 5] proper [^\p{C}\p{Z}]+ diff --git a/Test/expected-results/test.xml.odt b/Test/expected-results/test.xml.odt index 3f5a9ef81..acf13f5b6 100644 Binary files a/Test/expected-results/test.xml.odt and b/Test/expected-results/test.xml.odt differ diff --git a/Test/expected-results/test15.odd.html b/Test/expected-results/test15.odd.html index b4fc7b336..bdeb9bf9a 100644 --- a/Test/expected-results/test15.odd.html +++ b/Test/expected-results/test15.odd.html @@ -1920,10 +1920,7 @@

att.datcat : - provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 standard and stored in the Web repository called ISOCat at - - http://www.isocat.org/ - .[ + provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by an external taxonomy, in this way establishing the identity of information containers and values, and providing means of interpreting them.[ 9.5.2. Lexical View 18.3. Other Atomic Feature Values] diff --git a/Test/expected-results/test15.odd.rnc b/Test/expected-results/test15.odd.rnc index cb3d81086..a618f2544 100644 --- a/Test/expected-results/test15.odd.rnc +++ b/Test/expected-results/test15.odd.rnc @@ -1,6 +1,5 @@ namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" default namespace tei = "http://www.tei-c.org/ns/1.0" @@ -400,19 +399,46 @@ att.datable.attribute.period = } }? att.datcat.attributes = - att.datcat.attribute.datcat, att.datcat.attribute.valueDatcat + att.datcat.attribute.datcat, + att.datcat.attribute.valueDatcat, + att.datcat.attribute.targetDatcat att.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? att.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +att.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -4003,6 +4029,7 @@ tagUsage = element tagUsage { macro.limitedContent, att.global.attributes, + att.datcat.attributes, [ a:documentation [ "(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent " diff --git a/Test/expected-results/test21.odd.rnc b/Test/expected-results/test21.odd.rnc index c8c0fe731..e7787ca09 100644 --- a/Test/expected-results/test21.odd.rnc +++ b/Test/expected-results/test21.odd.rnc @@ -1,6 +1,5 @@ namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" default namespace tei = "http://www.tei-c.org/ns/1.0" @@ -433,19 +432,46 @@ att.datable.attribute.period = } }? att.datcat.attributes = - att.datcat.attribute.datcat, att.datcat.attribute.valueDatcat + att.datcat.attribute.datcat, + att.datcat.attribute.valueDatcat, + att.datcat.attribute.targetDatcat att.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? att.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +att.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -5852,6 +5878,7 @@ tagUsage = element tagUsage { macro.limitedContent, att.global.attributes, + att.datcat.attributes, ## (nom de l'élément) nom (identifiant générique) de l’élément indiqué par la balise. attribute gi { xsd:Name }, diff --git a/Test/expected-results/test30.dtd b/Test/expected-results/test30.dtd index 42a3f2cc7..471ed9143 100644 --- a/Test/expected-results/test30.dtd +++ b/Test/expected-results/test30.dtd @@ -342,9 +342,13 @@ datcat CDATA #IMPLIED'> + + + %Tatt.datcat.attribute.valueDatcat; + %Tatt.datcat.attribute.targetDatcat;'> diff --git a/Test/expected-results/test30.rnc b/Test/expected-results/test30.rnc index 7003181e3..8c0aaf772 100644 --- a/Test/expected-results/test30.rnc +++ b/Test/expected-results/test30.rnc @@ -1,7 +1,6 @@ namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" -namespace ns3 = "http://www.oucs.ox.ac.uk/ns" +namespace ns2 = "http://www.oucs.ox.ac.uk/ns" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" default namespace tei = "http://www.tei-c.org/ns/1.0" @@ -31,7 +30,7 @@ Tmacro.specialPara = | Tmodel.global)* Tmacro.xtext = (text | Tmodel.gLike)* anyElement-xenoData = - element * - (tei:* | ns3:* | teix:egXML) { + element * - (tei:* | ns2:* | teix:egXML) { attribute * { text }*, (text | anyElement-xenoData)* } @@ -451,19 +450,46 @@ Tatt.datable.attribute.period = } }? Tatt.datcat.attributes = - Tatt.datcat.attribute.datcat, Tatt.datcat.attribute.valueDatcat + Tatt.datcat.attribute.datcat, + Tatt.datcat.attribute.valueDatcat, + Tatt.datcat.attribute.targetDatcat Tatt.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? Tatt.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +Tatt.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -4466,6 +4492,7 @@ TtagUsage = element tagUsage { Tmacro.limitedContent, Tatt.global.attributes, + Tatt.datcat.attributes, [ a:documentation [ "(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent " @@ -8291,7 +8318,7 @@ Tplatypus = TMenu = ## - element ns3:Menu { + element ns2:Menu { text, Tatt.global.attribute.xmlid, Tatt.global.rendition.attribute.rend, diff --git a/Test/expected-results/test33.rnc b/Test/expected-results/test33.rnc index 03e0dfcdb..eeda7dd0a 100644 --- a/Test/expected-results/test33.rnc +++ b/Test/expected-results/test33.rnc @@ -1,8 +1,7 @@ default namespace = "http://foo.foo/foo" namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" -namespace ns3 = "http://blah.blah/blah" +namespace ns2 = "http://blah.blah/blah" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" namespace tei = "http://www.tei-c.org/ns/1.0" @@ -406,19 +405,46 @@ tei_att.datable.attribute.period = } }? tei_att.datcat.attributes = - tei_att.datcat.attribute.datcat, tei_att.datcat.attribute.valueDatcat + tei_att.datcat.attribute.datcat, + tei_att.datcat.attribute.valueDatcat, + tei_att.datcat.attribute.targetDatcat tei_att.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? tei_att.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +tei_att.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -4223,6 +4249,7 @@ tei_tagUsage = element tei:tagUsage { tei_macro.limitedContent, tei_att.global.attributes, + tei_att.datcat.attributes, [ a:documentation [ "(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent " @@ -5973,7 +6000,7 @@ foo_cit = attribute q { text }?, ## - attribute ns3:r { text }?, + attribute ns2:r { text }?, empty } start = tei_TEI diff --git a/Test/expected-results/test34.combined.json b/Test/expected-results/test34.combined.json index 1281b8e0d..27cd273cf 100644 --- a/Test/expected-results/test34.combined.json +++ b/Test/expected-results/test34.combined.json @@ -10269,7 +10269,8 @@ { "model" : [ ], "atts" : - [ "att.global" ], + [ "att.global", + "att.datcat" ], "unknown" : [ ] }, "attributes" : @@ -13870,8 +13871,8 @@ "type" : "classSpec", "module" : "tei", "desc" : - [ "provides attributes that are used to \n align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 \n standard and stored in the Web repository called ISOCat at .<\/desc>" ], - "shortDesc" : "provides attributes that are used to \n align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 \n standard and stored in the Web repository called ISOCat at .", + [ "provides attributes that are used to align XML\n elements or attributes with the appropriate Data Categories (DCs) defined by an external \n taxonomy, in this way establishing the identity of information containers and values, \n and providing means of interpreting them.<\/desc>" ], + "shortDesc" : "provides attributes that are used to align XML\n elements or attributes with the appropriate Data Categories (DCs) defined by an external \n taxonomy, in this way establishing the identity of information containers and values, \n and providing means of interpreting them.", "gloss" : [ ], "altIdent" : @@ -13881,11 +13882,11 @@ { "onElement" : false, "ident" : "datcat", "mode" : "add", - "ns" : "http:\/\/www.isocat.org\/ns\/dcr", + "ns" : "", "usage" : "opt", "desc" : - [ "contains a PID (persistent identifier) that aligns the given element with the appropriate \n Data Category (or categories) in ISOcat.<\/desc>" ], - "shortDesc" : "contains a PID (persistent identifier) that aligns the given element with the appropriate \n Data Category (or categories) in ISOcat.", + [ "provides a pointer to a definition of, and\/or general\n information about, (a) an information container (element or attribute) or (b) a value of an\n information container (element content or attribute value), by referencing an external\n taxonomy or ontology. If valueDatcat<\/att> is present in the immediate context, this attribute\n takes on role (a), while valueDatcat<\/att> performs role (b).<\/desc>" ], + "shortDesc" : "provides a pointer to a definition of, and\/or general\n information about, (a) an information container (element or attribute) or (b) a value of an\n information container (element content or attribute value), by referencing an external\n taxonomy or ontology. If valueDatcat is present in the immediate context, this attribute\n takes on role (a), while valueDatcat performs role (b).", "gloss" : [ ], "altIdent" : @@ -13901,11 +13902,31 @@ { "onElement" : false, "ident" : "valueDatcat", "mode" : "add", - "ns" : "http:\/\/www.isocat.org\/ns\/dcr", + "ns" : "", + "usage" : "opt", + "desc" : + [ "provides a definition of, and\/or general\n information about a value of an information container (element content or attribute value),\n by reference to an external taxonomy or ontology. Used especially where a contrast with\n datcat<\/att> is needed.<\/desc>" ], + "shortDesc" : "provides a definition of, and\/or general\n information about a value of an information container (element content or attribute value),\n by reference to an external taxonomy or ontology. Used especially where a contrast with\n datcat is needed.", + "gloss" : + [ ], + "altIdent" : + [ ], + "valDesc" : + [ ], + "datatype" : + { "min" : "1", + "max" : "unbounded", + "dataRef" : + { "key" : "teidata.pointer" } } }, + + { "onElement" : false, + "ident" : "targetDatcat", + "mode" : "add", + "ns" : "", "usage" : "opt", "desc" : - [ "contains a PID (persistent identifier) that aligns the content of the given\n element or the value of the given attribute with the appropriate simple Data\n Category (or categories) in ISOcat.<\/desc>" ], - "shortDesc" : "contains a PID (persistent identifier) that aligns the content of the given\n element or the value of the given attribute with the appropriate simple Data\n Category (or categories) in ISOcat.", + [ "provides a definition of, and\/or general\n information about, information structure of an object referenced or modeled by the\n containing element, by reference to an external taxonomy or ontology. This attribute has the\n characteristics of the datcat<\/att> attribute, except that it addresses not its containing\n element, but an object that is being referenced or modeled by its containing element.<\/desc>" ], + "shortDesc" : "provides a definition of, and\/or general\n information about, information structure of an object referenced or modeled by the\n containing element, by reference to an external taxonomy or ontology. This attribute has the\n characteristics of the datcat attribute, except that it addresses not its containing\n element, but an object that is being referenced or modeled by its containing element.", "gloss" : [ ], "altIdent" : diff --git a/Test/expected-results/test34.odd.html b/Test/expected-results/test34.odd.html index 4dd8f93ab..cd51f41d1 100644 --- a/Test/expected-results/test34.odd.html +++ b/Test/expected-results/test34.odd.html @@ -1189,10 +1189,7 @@

att.datcat : - provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 standard and stored in the Web repository called ISOCat at - - http://www.isocat.org/ - .[ + provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by an external taxonomy, in this way establishing the identity of information containers and values, and providing means of interpreting them.[ 9.5.2. Lexical View 18.3. Other Atomic Feature Values] diff --git a/Test/expected-results/test34.rnc b/Test/expected-results/test34.rnc index c58e95853..e963d0f5d 100644 --- a/Test/expected-results/test34.rnc +++ b/Test/expected-results/test34.rnc @@ -3,7 +3,6 @@ namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace bar = "http://blah.blah/bar" namespace blah = "http://blah.blah/blah" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" namespace tei = "http://www.tei-c.org/ns/1.0" @@ -407,19 +406,46 @@ tei_att.datable.attribute.period = } }? tei_att.datcat.attributes = - tei_att.datcat.attribute.datcat, tei_att.datcat.attribute.valueDatcat + tei_att.datcat.attribute.datcat, + tei_att.datcat.attribute.valueDatcat, + tei_att.datcat.attribute.targetDatcat tei_att.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? tei_att.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +tei_att.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -4228,6 +4254,7 @@ tei_tagUsage = element tagUsage { tei_macro.limitedContent, tei_att.global.attributes, + tei_att.datcat.attributes, [ a:documentation [ "(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent " diff --git a/Test/expected-results/test35.rnc b/Test/expected-results/test35.rnc index 6120b9105..ca8737a3b 100644 --- a/Test/expected-results/test35.rnc +++ b/Test/expected-results/test35.rnc @@ -1,7 +1,6 @@ default namespace = "http://foo.foo/foo" namespace a = "http://relaxng.org/ns/compatibility/annotations/1.0" namespace ns1 = "http://www.w3.org/1999/xhtml" -namespace ns2 = "http://www.isocat.org/ns/dcr" namespace rng = "http://relaxng.org/ns/structure/1.0" namespace sch = "http://purl.oclc.org/dsdl/schematron" namespace tei = "http://www.tei-c.org/ns/1.0" @@ -405,19 +404,46 @@ tei_att.datable.attribute.period = } }? tei_att.datcat.attributes = - tei_att.datcat.attribute.datcat, tei_att.datcat.attribute.valueDatcat + tei_att.datcat.attribute.datcat, + tei_att.datcat.attribute.valueDatcat, + tei_att.datcat.attribute.targetDatcat tei_att.datcat.attribute.datcat = - - ## contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. - attribute ns2:datcat { + [ + a:documentation [ + "provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If " + ns1:code [ "@valueDatcat" ] + " is present in the immediate context, this attribute takes on role (a), while " + ns1:code [ "@valueDatcat" ] + " performs role (b)." + ] + ] + attribute datcat { list { xsd:anyURI { pattern = "\S+" }+ } }? tei_att.datcat.attribute.valueDatcat = - - ## contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. - attribute ns2:valueDatcat { + [ + a:documentation [ + "provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with " + ns1:code [ "@datcat" ] + " is needed." + ] + ] + attribute valueDatcat { + list { + xsd:anyURI { pattern = "\S+" }+ + } + }? +tei_att.datcat.attribute.targetDatcat = + [ + a:documentation [ + "provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the " + ns1:code [ "@datcat" ] + " attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element." + ] + ] + attribute targetDatcat { list { xsd:anyURI { pattern = "\S+" }+ } @@ -4202,6 +4228,7 @@ tei_tagUsage = element tei:tagUsage { tei_macro.limitedContent, tei_att.global.attributes, + tei_att.datcat.attributes, [ a:documentation [ "(generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent " diff --git a/Test/expected-results/testdrama.compiled.xml b/Test/expected-results/testdrama.compiled.xml index ebeece4c2..4dfbc43c1 100644 --- a/Test/expected-results/testdrama.compiled.xml +++ b/Test/expected-results/testdrama.compiled.xml @@ -990,51 +990,244 @@ Elements 3.6.4. Dates and Times 13.4. Dates - - provides attributes that are used to - align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 - standard and stored in the Web repository called ISOCat at . - dcr:datacat属性とdcr:ValueDatacat属性を提供する。これらがXML要素や属性を連携させる日付分類は、国際標準ISO 12620:2009で定義されるものであり、にあるISOCatと呼ばれるWebリポジトリに格納されている。 + + provides attributes that are used to align XML + elements or attributes with the appropriate Data Categories (DCs) defined by an external + taxonomy, in this way establishing the identity of information containers and values, + and providing means of interpreting them. + datacat属性とValueDatacat属性を提供する.これらがXML要素や属性を連携させる日付分類は、国際標準ISO + 12620:2009で定義されるものであり,にあるISOCatと呼ばれるWebリポジトリに格納されている. - - contains a PID (persistent identifier) that aligns the given element with the appropriate - Data Category (or categories) in ISOcat. - 指定された要素をISOcatの適切なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む。 - + + provides a pointer to a definition of, and/or general + information about, (a) an information container (element or attribute) or (b) a value of an + information container (element content or attribute value), by referencing an external + taxonomy or ontology. If valueDatcat is present in the immediate context, this attribute + takes on role (a), while valueDatcat performs role (b). + 指定された要素をISOcatの適切なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む. + + + - - contains a PID (persistent identifier) that aligns the content of the given - element or the value of the given attribute with the appropriate simple Data - Category (or categories) in ISOcat. - 指定された要素の内容または指定された属性の値を、ISOcatの適切かつ単純なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む。 - + + provides a definition of, and/or general + information about a value of an information container (element content or attribute value), + by reference to an external taxonomy or ontology. Used especially where a contrast with + datcat is needed. + 指定された要素の内容または指定された属性の値を,ISOcatの適切かつ単純なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む. + + + + + + provides a definition of, and/or general + information about, information structure of an object referenced or modeled by the + containing element, by reference to an external taxonomy or ontology. This attribute has the + characteristics of the datcat attribute, except that it addresses not its containing + element, but an object that is being referenced or modeled by its containing element. + + + -

In this example dcr:datcat relates the feature name to the data category - "partOfSpeech" and dcr:valueDatcat the feature value to the data category - "commonNoun". Both these data categories reside in the ISOcat DCR at www.isocat.org, which is the DCR used by ISO - TC37 and hosted by its registration authority, the MPI for Psycholinguistics in - Nijmegen.

- +

The example below presents the TEI encoding of the name-value pair + <part of speech, common + noun>, where the name (key) part + of speech is abbreviated as POS, and the value, common noun is symbolized by NN. The + entire name-value pair is encoded by means of the element f. In TEI XML, that element + acts as the container, labeled with the name attribute. Its contents may be complex + or simple. In the case at hand, the content is the symbol NN.

+

The datcat attribute relates the feature name (i.e., the key) to the data + category part of speech, while the attribute valueDatcat relates the feature value to the data category common noun. Both these data categories + should be defined in an external and preferably open reference taxonomy or ontology.

+ - + + + + +

NN is the symbol for common noun used e.g. in the CLAWS-7 tagset defined by the + University Centre for Computer Corpus Research on Language at the University of Lancaster. The + very same data category used for tagging an early version of the British National Corpus, and + coming from the BNC Basic + (C5) tagset, uses the symbol NN0 (rather than NN). Making these values semantically + interoperable would be extremely difficult without a human expert if they + were not anchored in a single point of an established reference taxonomy of morphosyntactic + data categories. In the case at hand, the string http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545 is both + a persistent identifier of the data category in question, as well as a pointer to a shared definition + of common noun.

+

While the symbols NN, NN0, and many others (often coming from languages other than + English) are implicitly members of the container category part of speech, it is sometimes + useful not to rely on such an implicit relationship but rather use an explicit identifier + for that data category, to distinguish it from other morphosyntactic data categories, such + as gender, tense, etc. For that purpose, the above example uses the datcat + attribute to reference a definition of + part of speech. The reference taxonomy in this example is the + CLARIN Concept + Registry.

+ +

If the feature structure markup exemplified above is to be repeated many times in a single + document, it is much more efficient to gather the persistent identifiers in a single place and to + only reference them, implicitly or directly, from feature structure markup. The following + example is much more concise than the one above and relies on the concepts of feature structure declaration and + feature value library, discussed in chapter . + + + + + +

+ +

The assumption here is that the relevant feature values are collected in a place that the + annotation document in question has access to — preferably, a single document per linguistic + resource, for example an fsdDecl that is XIncluded as a sibling of text or a + child of encodingDesc; a taxonomy available resource-wide (e.g., in a shared + header) is also an option.

+

The example below presents an fvLib element that collects the relevant feature values + (most of them omitted). At the same time, this example shows one way of encoding a + tagset, i.e., an established inventory of values of (in the case at hand) + morphosyntactic categories. + + + + + + +

+

Note that these Guidelines do not prescribe a specific choice between datcat and + valueDatcat in such cases. The former is the generic way of referencing a data + category, whereas the latter is more specific, in that it references a data category that + represents a value. The choice between them comes into play where a single element — or a + tight element complex, such as the f/symbol complex illustrated above — + make it necessary or useful to distinguish between the container data category and its + value.

+
+ +

In the context of dictionaries designed with semantic interoperability in mind, the following + example ensures that the pos element is interpreted as the same information container + as in the case of the example of f name="POS" above.

+ + + NN + + +

Efficiency of this type of interoperable markup demands that the references to the particular + data categories should best be provided in a single place within the dictionary (or a single + place within the project), rather than being repeated inside every entry. For the container + elements, this can be achieved at the level of tagUsage, although here, the valueDatcat attribute should be used, because it is not the + tagUsage element that is associated with the relevant data category, but rather the element pos (or case, etc.) that is described by + tagUsage: + + + + Contains the part of speech. + Contains information about the grammatical case that the described form is inflected for. + + + + +

+

Another possibility is to shorten the URIs by means of the prefixDef mechanism, as illustrated below: + + + + + + + + +

+ isotope +
+ + adj + + +

+

This mechanism creates implications that are not always wanted, among others, in the case at + hand, suggesting that the identifiers pos and adj belong to a namespace + associated with the CLARIN Concept Repository (CCR), whereas that is solely a shorthand + mechanism whose scope is the current resource. Documenting this clearly in the header of the + dictionary is therefore advised.

+

Yet another possibility is to associate the information about the relationship between a TEI + markup element and the data category that it is intended to model already at the level of + modeling the dictionary resource, that is, at the level of the ODD, in equiv element + that is a child of elementSpec or attDef.

+
+ +

The targetDatcat attribute is designed to be used in, e.g., feature structure + declarations, and is analogous to the targetLang attribute of the + att.pointing class, in that it describes the object that is being referenced, + rather than the referencing object.

+ + + part of speech (morphosyntactic category) + + + + + + + + +

Above, the fDecl uses targetDatcat, because if it were to use + datcat, it would be asserting that it is an instance of the container data + category part of speech, whereas it is not — it models a container (f) + that encodes a part of speech. Note also that it is the f that is modeled above, not + its values, which are used as direct references to data categories; hence the use of + datcat in the symbol element.

- -

ISO 12620:2009 is a standard describing the data model and procedures for a Data - Category Registry (DCR). Data categories are defined as elementary descriptors in a - linguistic structure. In the DCR data model each data category gets assigned a - unique Peristent IDentifier (PID), i.e., an URI. Linguistic resources or preferably - their schemas that make use of data categories from a DCR should refer to them using - this PID. For XML-based resources, like TEI documents, ISO 12620:2009 normative - Annex A gives a small Data Category Reference XML vocabulary (also available online - at ), - which provides two attributes, dcr:datcat and - dcr:valueDatcat.

+ + +

The TEI Abstract Model can be expressed as a hierarchy of attribute-value matrices (AVMs) + of various types and of various levels of complexity, nested or grouped in various ways. At + the most abstract level, an AVM consists of an information container and the value + (contents) of that container.

+

A simple example of an XML serialization of such structures is, on the one hand, the opening + and closing tags that delimit and name the container, and, on the other, the content enclosed + by the two tags that constitues the value. An analogous example is an + attribute name and the value of that attribute.

+

In a TEI XML example of two equivalent serializations expressing the name-value pair + <part-of-speech,common-noun>, namely + <pos>commonNoun</pos> and + pos="common-noun", one would + classify the element pos and the attribute pos as containers (mapping onto + the first member of the relevant name-value pair), while the character data + content of pos or the value of pos would be seen as + mapping onto the second member of the pair.

+

The att.datcat class provides means of addressing the containers + and their values, while at the same time providing a way to interpret them in the context of + external taxonomies or ontologies. Aligning e.g. both the pos element and the + pos attribute with the same value of an external reference point (i.e., an entry + in an agreed taxonomy) affirms the identity of the concept serialised by both the element container + and the attribute container, and optionally provides a definition of that concept (in the case at hand, + the concept part of speech).

+

The value of the att.datcat attributes should be a PID + (persistent identifier) that points to a specific — and, ideally, shared — taxonomy or + ontology. Among the resources that can, to a lesser or greater extent, be used as inventories + of (more or less) standardized linguistic categories are the GOLD ontology, CLARIN CCR, OLiA, or TermWeb's DatCatInfo, and also the Universal Dependencies inventory, on the + assumption that its URIs are going to persist. It is imaginable that a project may choose to + address a local taxonomy store instead, but this risks losing the advantage of + interchangeability with other projects.

+

Historically, datcat and valueDatcat originate from the (the now obsolete) ISO + 12620:2009 standard, describing the data model and procedures for a Data Category Registry + (DCR). The current version of that standard, ISO 12620-1, does not standardize the + serialization of pointers, merely mentioning the TEI att.datcat as + an example.

+

Note that no constraint prevents the occurrence of a combination of + att.datcat attributes: + the fDecl element, which is a natural bearer of the targetDatcat + attribute, is an instance of a specific modeling element, and, in principle, could be + semantically fixed by an appropriate reference taxonomy of modeling devices.

+
+ +

ISO + 12620:2009は、データ分類のレジストリ(DCR)に関するデータモデルと手続きについての国際標準である.データ分類は,一つの言語の構造における基本的な記述子として定義される。DCRのデータモデルでは、個々のデータ分類には、ユニークな永続的識別子(PID)、つまり、URIが割り当てられる.DCRからデータ分類を利用する言語資源,あるいは、できることならそのスキーマは、このPIDを用いて参照すべきである。TEI文書のようなXMLベースの資源(にて参照可能)に関しては、ISO 12620:2009 付録 + Aに、datcat属性とvalueDatcat属性を提供する小さなデータ分類参照のXML語彙がある。

-

ISO 12620:2009は、データ分類のレジストリ(DCR)に関するデータモデルと手続きについての国際標準である。データ分類は、一つの言語の構造における基本的な記述子として定義される。DCRのデータモデルでは、個々のデータ分類には、ユニークな永続的識別子(PID)、つまり、URIが割り当てられる。DCRからデータ分類を利用する言語資源、あるいは、できることならそのスキーマは、このPIDを用いて参照すべきである。TEI文書のようなXMLベースの資源(にて参照可能)に関しては、ISO 12620:2009 付録 Aに、dcr:datcat属性とdcr:valueDatcat属性を提供する小さなデータ分類参照のXML語彙がある。

9.5.2. Lexical View 18.3. Other Atomic Feature Values @@ -17747,7 +17940,7 @@ creation of an electronic text. -

The measure element may be used to supply normalised +

The measure element may be used to supply normalized or machine tractable versions of the size or sizes concerned.

@@ -19984,6 +20177,7 @@ On , either the @marks attribute should be used, or a paragraph of descri elemento all'interno di un testo. + diff --git a/Test2/expected-results/testPure1.rng b/Test2/expected-results/testPure1.rng index 2c58f0e3f..7e18b14f1 100644 --- a/Test2/expected-results/testPure1.rng +++ b/Test2/expected-results/testPure1.rng @@ -539,11 +539,12 @@ Sample values include: 1] all; 2] most; 3] range + - - contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. + + provides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If @valueDatcat is present in the immediate context, this attribute takes on role (a), while @valueDatcat performs role (b). @@ -556,8 +557,22 @@ Sample values include: 1] all; 2] most; 3] range - - contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. + + provides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with @datcat is needed. + + + + \S+ + + + + + + + + + + provides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the @datcat attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element. @@ -6309,6 +6324,7 @@ On , either the @marks attribute should be used, or a paragraph of descri (element usage) documents the usage of a specific element within a specified document. [2.3.4. The Tagging Declaration] + (generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent <namespace> element. diff --git a/source/p5subset.xml b/source/p5subset.xml index faecbf583..c68a40674 100644 --- a/source/p5subset.xml +++ b/source/p5subset.xml @@ -7,7 +7,7 @@ Version 4.5.0a. Last updated on - 22nd October 2022, revision 1427c0adb + 25th October 2022, revision 5406e8d10 TEI Consortium @@ -1057,7 +1057,7 @@ of any adjacent whitespace μηνὶ Ἰουνίου εἰς κ ἔτους ςωϞ -

In this example, a date is given in a Mediaeval text measured "from the creation of the world", which is normalised +

In this example, a date is given in a Mediaeval text measured from the creation of the world, which is normalized (in when) to the Gregorian date, but is also normalized (in when-custom) to a machine-actionable, numeric version of the date from the Creation.

@@ -1421,51 +1421,244 @@ of any adjacent whitespace
- - provides attributes that are used to - align XML elements or attributes with the appropriate Data Categories (DCs) defined by the ISO 12620:2009 - standard and stored in the Web repository called ISOCat at . - dcr:datacat属性とdcr:ValueDatacat属性を提供する。これらがXML要素や属性を連携させる日付分類は、国際標準ISO 12620:2009で定義されるものであり、にあるISOCatと呼ばれるWebリポジトリに格納されている。 + + provides attributes that are used to align XML + elements or attributes with the appropriate Data Categories (DCs) defined by an external + taxonomy, in this way establishing the identity of information containers and values, + and providing means of interpreting them. + datacat属性とValueDatacat属性を提供する.これらがXML要素や属性を連携させる日付分類は、国際標準ISO + 12620:2009で定義されるものであり,にあるISOCatと呼ばれるWebリポジトリに格納されている. - - contains a PID (persistent identifier) that aligns the given element with the appropriate - Data Category (or categories) in ISOcat. - 指定された要素をISOcatの適切なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む。 - + + provides a pointer to a definition of, and/or general + information about, (a) an information container (element or attribute) or (b) a value of an + information container (element content or attribute value), by referencing an external + taxonomy or ontology. If valueDatcat is present in the immediate context, this attribute + takes on role (a), while valueDatcat performs role (b). + 指定された要素をISOcatの適切なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む. + + + - - contains a PID (persistent identifier) that aligns the content of the given - element or the value of the given attribute with the appropriate simple Data - Category (or categories) in ISOcat. - 指定された要素の内容または指定された属性の値を、ISOcatの適切かつ単純なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む。 - + + provides a definition of, and/or general + information about a value of an information container (element content or attribute value), + by reference to an external taxonomy or ontology. Used especially where a contrast with + datcat is needed. + 指定された要素の内容または指定された属性の値を,ISOcatの適切かつ単純なデータカテゴリ(または複数のデータカテゴリ)と対応付けるPID(永続的識別子)を含む. + + + + + + provides a definition of, and/or general + information about, information structure of an object referenced or modeled by the + containing element, by reference to an external taxonomy or ontology. This attribute has the + characteristics of the datcat attribute, except that it addresses not its containing + element, but an object that is being referenced or modeled by its containing element. + + + -

In this example dcr:datcat relates the feature name to the data category - "partOfSpeech" and dcr:valueDatcat the feature value to the data category - "commonNoun". Both these data categories reside in the ISOcat DCR at www.isocat.org, which is the DCR used by ISO - TC37 and hosted by its registration authority, the MPI for Psycholinguistics in - Nijmegen.

- +

The example below presents the TEI encoding of the name-value pair + <part of speech, common + noun>, where the name (key) part + of speech is abbreviated as POS, and the value, common noun is symbolized by NN. The + entire name-value pair is encoded by means of the element f. In TEI XML, that element + acts as the container, labeled with the name attribute. Its contents may be complex + or simple. In the case at hand, the content is the symbol NN.

+

The datcat attribute relates the feature name (i.e., the key) to the data + category part of speech, while the attribute valueDatcat relates the feature value to the data category common noun. Both these data categories + should be defined in an external and preferably open reference taxonomy or ontology.

+ - + + + + +

NN is the symbol for common noun used e.g. in the CLAWS-7 tagset defined by the + University Centre for Computer Corpus Research on Language at the University of Lancaster. The + very same data category used for tagging an early version of the British National Corpus, and + coming from the BNC Basic + (C5) tagset, uses the symbol NN0 (rather than NN). Making these values semantically + interoperable would be extremely difficult without a human expert if they + were not anchored in a single point of an established reference taxonomy of morphosyntactic + data categories. In the case at hand, the string http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545 is both + a persistent identifier of the data category in question, as well as a pointer to a shared definition + of common noun.

+

While the symbols NN, NN0, and many others (often coming from languages other than + English) are implicitly members of the container category part of speech, it is sometimes + useful not to rely on such an implicit relationship but rather use an explicit identifier + for that data category, to distinguish it from other morphosyntactic data categories, such + as gender, tense, etc. For that purpose, the above example uses the datcat + attribute to reference a definition of + part of speech. The reference taxonomy in this example is the + CLARIN Concept + Registry.

+ +

If the feature structure markup exemplified above is to be repeated many times in a single + document, it is much more efficient to gather the persistent identifiers in a single place and to + only reference them, implicitly or directly, from feature structure markup. The following + example is much more concise than the one above and relies on the concepts of feature structure declaration and + feature value library, discussed in chapter . + + + + + +

+ +

The assumption here is that the relevant feature values are collected in a place that the + annotation document in question has access to — preferably, a single document per linguistic + resource, for example an fsdDecl that is XIncluded as a sibling of text or a + child of encodingDesc; a taxonomy available resource-wide (e.g., in a shared + header) is also an option.

+

The example below presents an fvLib element that collects the relevant feature values + (most of them omitted). At the same time, this example shows one way of encoding a + tagset, i.e., an established inventory of values of (in the case at hand) + morphosyntactic categories. + + + + + + +

+

Note that these Guidelines do not prescribe a specific choice between datcat and + valueDatcat in such cases. The former is the generic way of referencing a data + category, whereas the latter is more specific, in that it references a data category that + represents a value. The choice between them comes into play where a single element — or a + tight element complex, such as the f/symbol complex illustrated above — + make it necessary or useful to distinguish between the container data category and its + value.

+
+ +

In the context of dictionaries designed with semantic interoperability in mind, the following + example ensures that the pos element is interpreted as the same information container + as in the case of the example of f name="POS" above.

+ + + NN + + +

Efficiency of this type of interoperable markup demands that the references to the particular + data categories should best be provided in a single place within the dictionary (or a single + place within the project), rather than being repeated inside every entry. For the container + elements, this can be achieved at the level of tagUsage, although here, the valueDatcat attribute should be used, because it is not the + tagUsage element that is associated with the relevant data category, but rather the element pos (or case, etc.) that is described by + tagUsage: + + + + Contains the part of speech. + Contains information about the grammatical case that the described form is inflected for. + + + + +

+

Another possibility is to shorten the URIs by means of the prefixDef mechanism, as illustrated below: + + + + + + + + +

+ isotope +
+ + adj + + +

+

This mechanism creates implications that are not always wanted, among others, in the case at + hand, suggesting that the identifiers pos and adj belong to a namespace + associated with the CLARIN Concept Repository (CCR), whereas that is solely a shorthand + mechanism whose scope is the current resource. Documenting this clearly in the header of the + dictionary is therefore advised.

+

Yet another possibility is to associate the information about the relationship between a TEI + markup element and the data category that it is intended to model already at the level of + modeling the dictionary resource, that is, at the level of the ODD, in equiv element + that is a child of elementSpec or attDef.

+
+ +

The targetDatcat attribute is designed to be used in, e.g., feature structure + declarations, and is analogous to the targetLang attribute of the + att.pointing class, in that it describes the object that is being referenced, + rather than the referencing object.

+ + + part of speech (morphosyntactic category) + + + + + + + + +

Above, the fDecl uses targetDatcat, because if it were to use + datcat, it would be asserting that it is an instance of the container data + category part of speech, whereas it is not — it models a container (f) + that encodes a part of speech. Note also that it is the f that is modeled above, not + its values, which are used as direct references to data categories; hence the use of + datcat in the symbol element.

- -

ISO 12620:2009 is a standard describing the data model and procedures for a Data - Category Registry (DCR). Data categories are defined as elementary descriptors in a - linguistic structure. In the DCR data model each data category gets assigned a - unique Peristent IDentifier (PID), i.e., an URI. Linguistic resources or preferably - their schemas that make use of data categories from a DCR should refer to them using - this PID. For XML-based resources, like TEI documents, ISO 12620:2009 normative - Annex A gives a small Data Category Reference XML vocabulary (also available online - at ), - which provides two attributes, dcr:datcat and - dcr:valueDatcat.

+ + +

The TEI Abstract Model can be expressed as a hierarchy of attribute-value matrices (AVMs) + of various types and of various levels of complexity, nested or grouped in various ways. At + the most abstract level, an AVM consists of an information container and the value + (contents) of that container.

+

A simple example of an XML serialization of such structures is, on the one hand, the opening + and closing tags that delimit and name the container, and, on the other, the content enclosed + by the two tags that constitues the value. An analogous example is an + attribute name and the value of that attribute.

+

In a TEI XML example of two equivalent serializations expressing the name-value pair + <part-of-speech,common-noun>, namely + <pos>commonNoun</pos> and + pos="common-noun", one would + classify the element pos and the attribute pos as containers (mapping onto + the first member of the relevant name-value pair), while the character data + content of pos or the value of pos would be seen as + mapping onto the second member of the pair.

+

The att.datcat class provides means of addressing the containers + and their values, while at the same time providing a way to interpret them in the context of + external taxonomies or ontologies. Aligning e.g. both the pos element and the + pos attribute with the same value of an external reference point (i.e., an entry + in an agreed taxonomy) affirms the identity of the concept serialised by both the element container + and the attribute container, and optionally provides a definition of that concept (in the case at hand, + the concept part of speech).

+

The value of the att.datcat attributes should be a PID + (persistent identifier) that points to a specific — and, ideally, shared — taxonomy or + ontology. Among the resources that can, to a lesser or greater extent, be used as inventories + of (more or less) standardized linguistic categories are the GOLD ontology, CLARIN CCR, OLiA, or TermWeb's DatCatInfo, and also the Universal Dependencies inventory, on the + assumption that its URIs are going to persist. It is imaginable that a project may choose to + address a local taxonomy store instead, but this risks losing the advantage of + interchangeability with other projects.

+

Historically, datcat and valueDatcat originate from the (the now obsolete) ISO + 12620:2009 standard, describing the data model and procedures for a Data Category Registry + (DCR). The current version of that standard, ISO 12620-1, does not standardize the + serialization of pointers, merely mentioning the TEI att.datcat as + an example.

+

Note that no constraint prevents the occurrence of a combination of + att.datcat attributes: + the fDecl element, which is a natural bearer of the targetDatcat + attribute, is an instance of a specific modeling element, and, in principle, could be + semantically fixed by an appropriate reference taxonomy of modeling devices.

+
+ +

ISO + 12620:2009は、データ分類のレジストリ(DCR)に関するデータモデルと手続きについての国際標準である.データ分類は,一つの言語の構造における基本的な記述子として定義される。DCRのデータモデルでは、個々のデータ分類には、ユニークな永続的識別子(PID)、つまり、URIが割り当てられる.DCRからデータ分類を利用する言語資源,あるいは、できることならそのスキーマは、このPIDを用いて参照すべきである。TEI文書のようなXMLベースの資源(にて参照可能)に関しては、ISO 12620:2009 付録 + Aに、datcat属性とvalueDatcat属性を提供する小さなデータ分類参照のXML語彙がある。

-

ISO 12620:2009は、データ分類のレジストリ(DCR)に関するデータモデルと手続きについての国際標準である。データ分類は、一つの言語の構造における基本的な記述子として定義される。DCRのデータモデルでは、個々のデータ分類には、ユニークな永続的識別子(PID)、つまり、URIが割り当てられる。DCRからデータ分類を利用する言語資源、あるいは、できることならそのスキーマは、このPIDを用いて参照すべきである。TEI文書のようなXMLベースの資源(にて参照可能)に関しては、ISO 12620:2009 付録 Aに、dcr:datcat属性とdcr:valueDatcat属性を提供する小さなデータ分類参照のXML語彙がある。

@@ -9282,7 +9475,7 @@ creation of an electronic text. -

The measure element may be used to supply normalised +

The measure element may be used to supply normalized or machine tractable versions of the size or sizes concerned.

@@ -11597,6 +11790,7 @@ On , either the @marks attribute should be used, or a paragraph of descri elemento all'interno di un testo. + @@ -36701,78 +36895,86 @@ list of values. Typical values might be IPA, - classifies the grammatical information given according to some convenient typology—in - the case of terminological information, preferably the dictionary of data element types - specified in ISO 12620. - 다양한 유형에 따라 문법 정보를 분류한다. - 전문용어 정보의 경우 가급적 ISO WD 12 - 620에서 명시한 데이터 요소 유형의 사전을 따른다. - 用合適的分類方法將不同的文法資訊加以分類-若是專門術語,可參照 ISO 12620 - 所指定資料元素類別。 + classifies grammatical information provided by gram + according to some convenient and shared typology, ideally one defined in an external + reference taxonomy, such as the CLARIN Concept + Registry. + 다양한 유형에 따라 문법 정보를 분류한다. - 전문용어 정보의 경우 가급적 ISO WD + 12 620에서 명시한 데이터 요소 유형의 사전을 따른다. + 用合適的分類方法將不同的文法資訊加以分類-若是專門術語,可參照 ISO 12620 所指定資料元素類別。 当該文法情報の分類を示す。用語を示す場合、ISO 12620に従うこ とが望ましい。 - classe l'information grammaticale fournie selon une - typologie particulière : dans le cas d'informations terminologiques, de préférence au moyen - du dictionnaire des types d'éléments de données spécifiés dans la norme ISO 12620. - clasifica la información gramatical dada de acuerdo a - una tipología funcional — en el caso de información terminológica, preferiblemente el - diccionario de tipos de elemento de datos se especifica en ISO 12620. + classe l'information grammaticale fournie selon + une typologie particulière : dans le cas d'informations terminologiques, de préférence au + moyen du dictionnaire des types d'éléments de données spécifiés dans la norme ISO 12620. + clasifica la información gramatical dada de + acuerdo a una tipología funcional — en el caso de información terminológica, preferiblemente + el diccionario de tipos de elemento de datos se especifica en ISO 12620. classifica le informazioni grammaticali secondo un'appropriata tipologia - nel caso di informazioni terminologiche, preferibilmente il - dizionario di degli elementi specificati in ISO 12620. - + dizionario di degli elementi specificati in ISO + 12620. + + + part of speech 품사 - partie du discours (chacune des classes de mot - auxquelles un mot peut appartenir dans une langue donnée, en fonction de la forme, du - sens ou d'une combinaison de caractéristiques, par exemple nom, verbe, adjectif, etc.) + partie du discours (chacune des classes de + mot auxquelles un mot peut appartenir dans une langue donnée, en fonction de la forme, + du sens ou d'une combinaison de caractéristiques, par exemple nom, verbe, adjectif, + etc.) parte del discorso - parte del discurso (cualquiera de las categorías - de palabras que se puede encontrar en una lengua dada, basándose en la forma, + parte del discurso (cualquiera de las + categorías de palabras que se puede encontrar en una lengua dada, basándose en la forma, significado, o combinación de rasgos, p.ej. nombre, verbo, adjetivo, etc.) - any of the word classes to which a word may be assigned in a given language, based - on form, meaning, or a combination of features, e.g. noun, verb, adjective, etc. - 형식, 의미, 또는 자질 결합에 근거하여 단어가 제시된 언어에서 할당될 수 있는 단어 - 부류, 예, 명사, 동사, 형용사 등 + any of the word classes to which a word may + be assigned in a given language, based on form, meaning, or a combination of features, + e.g. noun, verb, adjective, etc. + 형식, 의미, 또는 자질 결합에 근거하여 단어가 제시된 언어에서 할당될 수 있는 + 단어 부류, 예, 명사, 동사, 형용사 등 詞性 (在已知語言中的詞性分類,根據字詞的形式、意義或特性組合而有所不同。例如名詞、動詞、形容詞等。) - cualquier clase de palabras a las que se puede - asignar una palabra en una lengua dada, basándose en la forma, significado, o una + cualquier clase de palabras a las que se + puede asignar una palabra en una lengua dada, basándose en la forma, significado, o una combinación de las características, p.ej. sustantivo, verbo, adjetivo, etc. - 当該語に関する分類。形、意味、素性の組み合わせなどによる。 例えば、名詞、動詞、形容詞など。 - toute catégorie grammaticale à laquelle un mot - peut être assigné dans une langue donnée, qui repose sur la morphologie, la sémantique, - ou une combinaison de mots du discours, par exemple le nom, le verbe, l'adjectif, etc. - parte del discorso (una qualsiasi delle classi a - cui una parola può essere assegnata in una data lingua, basata sulla forma, il + 当該語に関する分類。形、意味、素性の組み合わせなどによる。 + 例えば、名詞、動詞、形容詞など。 + toute catégorie grammaticale à laquelle un + mot peut être assigné dans une langue donnée, qui repose sur la morphologie, la + sémantique, ou une combinaison de mots du discours, par exemple le nom, le verbe, + l'adjectif, etc. + parte del discorso (una qualsiasi delle + classi a cui una parola può essere assegnata in una data lingua, basata sulla forma, il significato o una combinazione di tratti, ad esempio nome, verbo, aggettivo, - ecc.) + ecc.) gender - genre (classification formelle selon laquelle - les noms et pronoms et souvent les qualificatifs associés sont groupés et fléchis, ou - prennent une autre forme afin d'exprimer certaines relations syntaxiques) + genre (classification formelle selon + laquelle les noms et pronoms et souvent les qualificatifs associés sont groupés et + fléchis, ou prennent une autre forme afin d'exprimer certaines relations + syntaxiques) genere género (clasificación formal para nombres y pronombres, y a menudo para modificadores que los acompañan, aparecen agrupados y flexionan o cambian de forma del mismo modo, y permiten controlar las relaciones sintácticas) - formal classification by which nouns and pronouns, and often accompanying modifiers, - are grouped and inflected, or changed in form, so as to control certain syntactic - relationships - 특정 통사적 관련성을 통제하기 위해, 명사와 대명사, 그리고 종종 동반 수식어의 그룹을 - 나누는, 굴절 또는 형태 변화에 따른 형식적 분류 + formal classification by which nouns and + pronouns, and often accompanying modifiers, are grouped and inflected, or changed in + form, so as to control certain syntactic relationships + 특정 통사적 관련성을 통제하기 위해, 명사와 대명사, 그리고 종종 동반 수식어의 + 그룹을 나누는, 굴절 또는 형태 변화에 따른 형식적 분류 性別 (名詞、代名詞、或伴隨的修飾詞透過這樣的規則系統在表現形式上被加以分組、或產生屈折變化,以展現特定的語法關係。) - clasificación formal para la cual los sustantivos - y los pronombres, y a menudo sus modificantes, son agrupados y jerarquiados, o variados - en la forma, para controlar enlaces sintácticos + clasificación formal para la cual los + sustantivos y los pronombres, y a menudo sus modificantes, son agrupados y jerarquiados, + o variados en la forma, para controlar enlaces sintácticos 文法上の性。時に修飾詞を伴う名詞や代名詞がまとまり、屈折、す なわち語形変形が行われ、ある統語的関係が示される。 - classification formelle par lequel des noms et - des pronoms souvent avec des modificateurs, est groupée et conjuguée, ou changée + classification formelle par lequel des noms + et des pronoms souvent avec des modificateurs, est groupée et conjuguée, ou changée morphologiquement, afin de contrôler certaines relations syntaxiques genere (classificazione formale secondo cui i nomi e i pronomi, e spesso i modificatori che li accompagnano, sono raggruppati e @@ -36781,14 +36983,17 @@ list of values. Typical values might be IPA, number - nombre (par exemple singulier, pluriel, duel, - etc.) + nombre (par exemple singulier, pluriel, + duel, etc.) numero - número (p.ej. individual, plural, dual, ...) - grammatical number, e.g. singular, plural, dual, … + número (p.ej. individual, plural, dual, + ...) + grammatical number, e.g. singular, plural, + dual, … 문법적 수. 예, 단수, 복수, 이중, … 單複數 (例如單數、複數、雙數詞等) - número gramatical, p.ej. singular, plural, dual,… + número gramatical, p.ej. singular, plural, + dual,… 文法上の数。単数、複数、双数など。 nombre grammatical, par exemple singulier, pluriel, duel, … @@ -36814,24 +37019,18 @@ list of values. Typical values might be IPA, nome proprio o nome comune - -

A much fuller list of values for the type - attribute may be generated from the data category registry - accessible from and - maintained by ISO technical committee 37.

+ +

The list of values for the type attribute should best come from a standardized set of container data categories.

Une liste de valeurs beaucoup plus complète pour l'attribut type peut être produite à partir du dictionnaire des types d'éléments de données en préparation sous la - référence ISO TC 37/SC 3/WD 12 620, Computational Aids in Terminology. Voir l'ISO 12620 - pour plus de détails.

+ référence ISO TC 37/SC 3/WD 12 620, Computational Aids in Terminology. Voir l'ISO 12620 pour plus de détails.

属性typeに付与される値の全リストは、 ISO TC 37/SC 3が策定したWD 12620 Computational Aids in - Terminologyにあるデータ要素型から得られるだろう。 詳細は、ISO 12620 Computer applications in terminology — Data - categories — を参照のこと。

+ Terminologyにあるデータ要素型から得られるだろう。 詳細は、ISO 12620 + Computer applications in terminology — Data categories — を参照のこと。

@@ -53005,7 +53204,7 @@ specification.

composed of two alloys of gold with a lighter 18.4 karat shade being used for the face and neck while a heavier 22.5 karat gold was used for the rest of the mask.

In the mask Tutankhamun wears a nemes headcloth which has the royal insignia of a cobra (Wadjet) and vulture - (Nekhbet) on it. These are thought respectively to symbolise Tutankhamun's rule of both Lower Egypt and Upper + (Nekhbet) on it. These are thought respectively to symbolize Tutankhamun's rule of both Lower Egypt and Upper Egypt. His ears are pierced for earrings. The mask has rich inlays of coloured glass and gemstones, including lapis lazuli surrounding the eye and eyebrows, quartz for the eyes, obsidian for the pupils. The broad collar is made up of carnelian, feldspar, turquoise, amazonite, faience and other stones.

@@ -59329,6 +59528,7 @@ Feature Value + @@ -59594,9 +59794,7 @@ range of allowed values, and optionally its default value. declara un único rasgo, indicando su nombre, organización, gama de valores posibles y, opcionalmente, su valor predeterminado. dichiara un singolo tratto, specificandone il nome, l'organizzazione, la gamma di valori permessi, e opzionale il suo valore predefinito. -