This document is meant to serve as a reference for the encoding of PressMint corpora of historical newspapers. In order for the PressMint corpora to be interoperable (i.e. so that the same scripts can be used to process them), their structure is fairly rigid, primarily in terms of file names and folder structure, and, partially, their TEI XML encoding. This is not to say that all the corpora have to contain exactly the same information because we distinguish obligatory information, which all the corpora should contain, from that which is optional, and present only in the corpora for which it has been possible to gather it from the corpus sources.
This document is a modification of the ParlaMint encoding guidelines, which are a customisation the TEI Guidelines. But while ParlaMint specifies many reguirements on the structures of the docuemnts and obligatory data and metadata, PressMint makes only minimal requirements for the purposes of interoperability although leaves considerable space for optional extensions.
The rest of these recommendations are structured as follows:
The <teiHeader> of a corpus component (further detailed in the Section on Corpus metadata) contains the metadata specific for this component (along with some redundant metadata about its provenance), and which should be unique in the corpus, i.e. the corpus component metadata should distinguish it from all the other components of the corpus.
The fact that a corpus is one XML document does not mean that it is also stored in one file. In fact, PressMint requires that each corpus component is stored in a separate file, with the corpus root, i.e. the top-level <teiCorpus>, also stored as one file.
PressMint has strict rules on how to name the various files that constitute a corpus, and how to collect them in directories.
The file names have the the following structure:
PressMint-
, followed by the ISO 3166 country code (cf. Section on Standard values) of the country whose team is contributing the corpus, e.g. PressMint-SI.xml
.PressMint-SI_1899-04-16.xml
. In case the exact date of the publications is unknown, only the month or year can be given, e.g. PressMint-SI_1899-04.xml
or PressMint-SI_1899.xml
.PressMint-SI_1899_KRN-NUK.xml
. This extra suffix can encode an abbreviation of the newspaper's name, and serve to distinguish two different newspapers that were published on the same day, or distunguish sources from where a newspaper was obtained.PressMint-SI-listBil.xml
for the list of bibliographic items (newspapers) that represent the sources of the Slovenian corpus. In case there are more files for instances with the same element name, as is the case for taxonomies, the filename should end with another hypen, followed by the a distinguishing suffix, e.g. PressMint-SI-taxonomy-topic.xml
.PressMint-SI_1899_KRN-NUK.txt
; this is further explained in the Section on Conversions..ana
on the corpus root and components, e.g. PressMint-SI.ana.xml
or PressMint-SI_1899_KRN-NUK.ana.xml
.For distribution the complete XML corpus should be stored in a directory that has the same name prefix as the corpus root file and extended with the format (e.g. TEI
). The directory then contains the corpus root file (and its metadata files, if such exists), while the corpus components should be in subdirectories, one per year, for example:
PressMint-SI.TEI/PressMint-SI.xml
PressMint-SI.TEI/PressMint-SI-taxonomy-topic.xml
...
PressMint-SI.TEI/1899/PressMint-SI_1899-01-02.xml
PressMint-SI.TEI/1899/PressMint-SI_1899-01-03.xml
PressMint-SI.TEI/1899/PressMint-SI_1899-01-04.xml
...
PressMint-SI.TEI/1900/PressMint-SI_1900-01-02.xml
PressMint-SI.TEI/1900/PressMint-SI_1900-01-03.xml
PressMint-SI.TEI/1900/PressMint-SI_1900-01-04.xml
...
⚓
The lingistically annotated version of the corpus is stored separately, with the main directory and, as mentioned, the corpus root and component filenames having the additional suffix .ana
, e.g.
PressMint-SI.TEI.ana/PressMint-SI.ana.xml
PressMint-SI.TEI.ana/PressMint-SI-taxonomy-topic.xml
...
PressMint-SI.TEI.ana/1899/PressMint-SI_1899-01-02.ana.xml
PressMint-SI.TEI.ana/1899/PressMint-SI_1899-01-03.ana.xml
PressMint-SI.TEI.ana/1899/PressMint-SI_1899-01-04.ana.xml
...
PressMint-SI.TEI.ana/1900/PressMint-SI_1900-01-02.ana.xml
PressMint-SI.TEI.ana/1900/PressMint-SI_1900-01-03.ana.xml
PressMint-SI.TEI.ana/1900/PressMint-SI_1900-01-04.ana.xml
...
⚓
This section gives some general requirements a PressMint corpus has to meet, in particular those relating to the characters in a corpus, and the use of standards. It also details the structure of the file names of the PressMint root and component files, as well as the attributes expected on the <teiCorpus> and <TEI> tags.
The corpus should be encoded in Unicode, using the UTF-8 character encoding, at least for European languages. In cases where the original contains characters from the Unicode Private Use Area, these should, if possible, be given their closest Unicode equivalents or substituted by the Unicode replacement character U+FFFD. End-of-line hyphens, if present in the source files, should be removed, and the split words joined in order to enhance searching the corpus and to simplify linguistic processing.
The following characters, esp. prevalent when the source documents were in Word or HTML, deserve special mention:
Text-bearing elements should also not start or end with space characters, and sequences of whitespace characters should be changed into a single space.
Whenever possible, PressMint uses standards for information coding. In particular, the following information must be standardised:
The Chapter on Overall corpus structure introduced the top level elements of the corpus root file and of the component files (i.e. the <teiCorpus> and <TEI> elements), but did not elaborate on their attributes; these are presented in this section.
The PressMint encoding can use pointing attributes for various purposes, e.g. for references to the IDs of the facsimile elements.
While a few elements have dedicated pointing attributes, there are some generally used ones. They share the characteristics that they can all be used by a number of different elements and that their value is a series of pointers, i.e. a white-space delimited sequence of references to the values of some xml:id attribute in the corpus or, in general, to an URI. The attributes are:
PressMint makes use of temporal information, in particular to encode when a newspaper was published. As mentioned in the Section on Standard values, the ISO 8601 format should be used to specify the dates or times.
The following attributes are used to specify temporal information:
As mentioned, <teiCorpus> and <TEI> elements contain the obligatory <teiHeader> element, which stores the metadata to the corpus root or component. In this section we explain and give examples of the required and optional metadata that is contained in the <teiHeader>, proceeding through its various elements, and there distinguishing which parts and what content is appropriate for the corpus root, and which for a corpus component.
As a general remark, most metadata contains free text, and it is a requirement of PressMint that this data is given in the English language, to help researchers for other countries to understand it, and it is recommended to also give it in the local language in which the (main portion of) newspapers is written, for a local researcher to be able to use it in their native tongue.
The title statement, <titleStmt> gives the title of the corpus root or component, along with the specification of the particular session(s) of the parliament contained, the persons responsible for compiling the corpus, and the funder(s) of the project.
The main title has a formulaic structure ‘<Country_name> historical newspaper corpus PressMint-<Country_code> [PressMint]’, with an equivalent structure for the local language. Note that the corpus ‘stamp’ in square brackets can also be ‘[PressMint.ana]’ for the linguistically annotated version of the corpus (as explained in the Chapter on Linguistic annotation) or ‘[PressMint SAMPLE]’ for corpus data samples, as available on the PressMint GitHub repository.
After the titles come one or more responsibility statements, <respStmt>, each one containing one or more person names, <persName>, with an optional ref attribute, giving the (typically ORCID) URL, where more information about the person can be found, and the responsibility element <resp>, which specifies what responsibility the statement is about.
In a similar manner, the <funder> elements give information on the organisations which have financially contributed to the compilation of the corpus, with the names of the organisations given in the <orgName> elements.
It should be noted that while the number of texts corresponds to the number of corpus components, the number of words can be somewhat complex to compute. Both are, however, inserted into the TEI headers in the finalisation of a corpus (cf. the Section on Finalisation of corpora) by a common script, so it is not necessary to insert the extent in the process of developing a PressMint corpus.
In contrast, the encoding description of a corpus component contains only two elements, namely (and redundantly) the <projectDesc> and the <tagsDecl>.
Pointing attributes, such as url or ana, take as their value a reference or space-delimited series of references to a URL and/or the value of xml:id elements. If the reference is to an ID, then it is prefixed the hash character, #
, e.g. #parla.uni, and if they are to an ID in another XML document, then the hash follows the URL of the document, e.g. https://nl.ijs.si/ME/V6/msd/tables/msd-fslib2-sl.xml#Vmpr1p.
Because complete URLs tend to be long, especially inconvenient when such references are given to every token in a corpus, TEI introduces the so called Abbreviated Pointers, whereby the reference to an ID can be given in the form of a prefix, which is separated by a colon from the local part of the ID reference, and the value of this prefix is determined via the <prefixDef> element in the <encodingDesc> of the TEI header.
mte
prefix, so for any ID reference with this prefix, e.g. mte:Nps, the part after the prefix (Nps
) should be matched against (.+)
and the result being the matched part (here the entire relation Nps
) substituted by #$1
, i.e. by the hash character followed by the original value, so that mte:Nps gives https://nl.ijs.si/ME/V6/msd/tables/msd-fslib-sl.xml#Nps.Finally, each prefix definition also contains a possibly bi-lingual paragraph explaining the definition.
true
for the local language.@default="true"
.Facsimile (i.e. images) of the newspapers are highly useful, both for providing the original to the trancriptions in their analyis, as well as for allowing better OCR as the state-of-the-art improves. If the facsimile is available it also be also published together with the PressMint corpora, and should be referred to from the corpus, in particular from each corpus component.
How to encode references to the facsimile images in TEI is, in the general case, explained in the Chapter on Representation of Primary Sources of the TEI Guidelines. In this chapter we only provide the basic representation that is directly supported in PressMint.
The <facsimile> element should appear in a corpus component immediately after the <teiHeader>, c.f. the Section on Overall XML corpus structure. It contains pointers to the complete facsimile or its parts, i.e. URLs of the images of an issue or its individual pages, and can further structure or document these images.
Apart from modelling pages with <surface>, areas inside them can also be specified. For this, <zone> elements inside <surface> are used; these can specify a rectangle or, in general, a polygon inside it; the details are given in the TEI Section on Digital Facsimiles. Note, however, that if this approach is used, a mechanism needs to be implemented to show the correct zone on the image.
Note that these breaks can appear anywhere in the text, including in the middle of a (end-of-line hyphenated) word, which makes the linguistic annotation of such text more complicated, as texual data is mixed with markup, typically not otherwise the case. Furthermore, by convention, the breaks should appear as high up in the hierarchy as possible, i.e. if a break should appear at the begining of a paragraph, it should be encoded before its start, as in the example above.
The newspaper texts are encoded in the <text> element of corpus components. This element can contain <front> and <back> and must contain <body>. If they are used, <front> will typically contain the front-matter of a newspaper issue, i.e. its banner, while back would contain material that does not fit in well with the article-based structure of a newspaper, e.g. advertisements. However, this material can also be included directly in the <body> but at the risk of disrupting the flow of the article texts.
PressMint makes few assumptions on the structure of the texts in the TEI <body> and optional <front> and <back>. At minimum, they need to contain a series of paragraphs, i.e. <p> elements possibly with interspersed empty break elements, as discussed in the preceding Section on Connecting the text to the facsimile.
This section introduces the PressMint linguistic annotation. An important note is that a linguistically annotated PressMint corpus is stored separately from its base (or plain-text) TEI version, i.e. the version that has been discussed in the preceding sections. The encoding of the linguistically annotated version differs from the plain-text one in the following:
.ana.xml
. For example, if the plain-text TEI root has the file name PressMint-CZ.xml
, the linguistically annotated one should be PressMint-CZ.ana.xml
, and if the component plain-text files is PressMint-CZ_2016-04-13.xml
the linguistically annotated one is PressMint-CZ_2016-04-13.ana.xml
..ana
, e.g. <teiCorpus xml:id="PressMint-CZ.ana">
.[PressMint]
in the plain-text version, should be [PressMint.ana]
for the linguistically annotated version.Linguistic annotation is added only to the text content of <p> elements. For this text, PressMint requires the following additional markup to be present:
Below, we explain the encoding of each of these levels.
The base form or lemmas of a word is given as the value of the lemma attribute, while punctuation characters, <pc>, do not have this attribute.
The UD part-of-speech and morphological features are both packed in the msd attribute, with the part-of-speech having the UPosTag
linguistic attribute, and the features separated by the vertical bar.
PressMint also allows (but does not require) part-of-speech tags from some other tagset3 to be added to the linguistic annotation. Where this information is encoded, depends on the type of tagset.
mte:
is a prefix that is, via the TEI extended pointer syntax as defined in the TEI header (cf. the Section on Prefix definitions) expanded so that the value of such an ana attribute points to the expansions of the given tag to a feature structure. For example, the value mte:Vmpr1p would be expanded to https://nl.ijs.si/ME/V6/msd/tables/msd-fslib2-sl.xml#Vmpr1p, which then resolves to the feature-structure below: The language of older newspapers might differ significantly from the contemporary norm. This has an impact on the quality of linguistic annotations, in cases where the annotation tool has been trained on contemporary texts only, as well as hindering searching for particular words or lemmas in their contemporary spellings. To alleviate this, normalisation (i.e. modernisation) is often used on archaic texts, and the subsequent linguist annotation is performed on such modernised text.
Modern neural approaches typically take a complete chunk of text and normalise it, while more traditional approaches perform the normalisation on individual words. The former has the advantage of being capable not only of modernising the spelling individual words but also substituting archaic words with their contemporary equivalents, modernising multi-word units or even syntactic constructions. However, if such a method is used on a PressMint corpus this means that the linguistically annotated variant of the corpus will contain only the modernised text, and the alignment to the plain-text variant of the corpus will be at the paragraph level only. In other words, losing word-alignment with the original tokens means also losing the ability to search for or directly view the original tokens.
In contrast, traditional methods (such as cSMTiser) will typically normalise only the spelling of individual words, or, at most, sequences of words. This means that the text has to be first tokenised, normalisation applied to such (series of) tokens, and the resulting normalised word-tokens then linguistically annotated. Here both the original and normalised and annotated words are available in the linguistically annotated version of the corpus.
join="right"
should be added to the top level word as well as to the last nested word.PressMint also requires annotation of Named Entities (NE), which should be categorised into the following four types:
What kind of metadata a plain-text PressMint corpus should contain was explained in the Section on Corpus metadata and in this section we detail what additions must be made to the metadata for the linguistically annotated version. Note that the other changes for this version of a corpus have been already explained at the start of this Chapter. For PressMint, this information has been, so far, simplified in comparison with ParlaMint; we do not foresee linguistic taxonomies (in particular, for NER), so there are one obligatory and one optional metadata element dedicated to linguistic processing. Both are added to the <teiHeader> of the root of the linguistically analysed corpus, namely a description of the tool(s) used to linguistically annotate the corpus, and optional taxonomies of corpus-specific PoS tags.
The chapter explains how to validate and finalise a PressMint corpus, and introduces scripts for converting a PressMint corpus to other, derived formats.
The XML structure of PressMint corpora can be validated via RelaxNG schema produced as a customisation of the TEI Guidelines.
The TEI customisation is written as a TEI ODD document, which is, in fact, the XML version of this document, and is available in the TEI/ directory of the PressMint GitHub repository. The XML contains not only the prose guidelines, but also the formal specification of the TEI schema, which is given in the Appendix A. In the XML it contains the formal schema specification, while in the on-line version this is converted to a reference to all the elements, attributes and classes used in PressMint corpora --- quite a lot, as the PressMint schema has been left open enough to accommodate differing requirements in the encoding.
The ODD document is not immediately useful for XML validation but has to be converted with standard TEI XSLT stylesheets to a RelaxNG schema. The TEI ODD and its RelaxNG schema (PressMint.rng (and the HTML guidelelines) are always kept in sync. This schema should be used to check that PressMint component files validate against TEI, typically using Jing (cf. Contributing to PressMint.
While the vast majority of converting source encodings into the PressMint corpus format is left to the compilers of a corpus, there are a few metadata elements that can be produced by a common script on the basis of nearly finished corpora, which then results in the final version of the corpus for a particular release. This includes setting the date, edition and handle under which the corpus will be distributed, and also calculating the size of the corpus (cf. the Sections on Extents and on Tags declaration). The script for finalisation can be found in the Scripts/ directory of the PressMint GitHub repository and the README file briefly explains its function; more comments can be found in the script itself.
A TEI encoded document is, in general, not meant to be used directly by software programs, rather, it serves as an interchange and storage format. The PressMint project has produced various scripts to down-convert the XML encoded corpora to other formats and they can be found in the Scripts/ directory of the PressMint GitHub repository, with the README file listing them and explaining their function. In short, the scripts convert the PressMint XML to plain text, to CoNLL-U, and to vertical format. There is also a script that takes a PressMint corpus and makes from it a sample for inclusion to the PressMint GitHub repository.
The PressMint GitHub repository contains these guidelines, the PressMint XML schemas, the scripts used to validate, finalise and convert the PressMint TEI XML corpora to derived formats, and samples of the PressMint corpora. There are four main branches in the repository:
The validation procedure for corpora is explained in the Section on Validating PressMint corpora, while the technical aspects of contributing corpora is further explained in the CONTRIBUTING file of the repository.
The work on these recommendations was funded by the CLARIN Research Infrastructure for Language Resources and Tools.
<TEI> (TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resource class. Multiple <TEI> elements may be combined within a <TEI> (or <teiCorpus>) element. [4. Default Text Structure 16.1. Varieties of Composite Text] | |||||||||
Module | textstructure — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | |||||||||
Note | As with all elements in the TEI scheme (except <egXML>) this element is in the TEI namespace (see 5.7.2. Namespaces). Thus, when it is used as the outermost element of a TEI document, it is necessary to specify the TEI namespace on it. This is customarily achieved by including http://www.tei-c.org/ns/1.0 as the value of the XML namespace declaration (xmlns), without indicating a prefix, and then not using a prefix on TEI elements in the rest of the document. For example: <TEI version="4.8.1" xml:lang="it" xmlns="http://www.tei-c.org/ns/1.0">. | ||||||||
Example | <TEI version="3.3.0" xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader>
<fileDesc>
<titleStmt>
<title>The shortest TEI Document Imaginable</title>
</titleStmt>
<publicationStmt>
<p>First published as part of TEI P2, this is the P5
version using a namespace.</p>
</publicationStmt>
<sourceDesc>
<p>No source: this is an original work.</p>
</sourceDesc>
</fileDesc>
</teiHeader>
<text>
<body>
<p>This is about the shortest TEI document imaginable.</p>
</body>
</text>
</TEI> | ||||||||
Example | <TEI version="2.9.1" xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader>
<fileDesc>
<titleStmt>
<title>A TEI Document containing four page images </title>
</titleStmt>
<publicationStmt>
<p>Unpublished demonstration file.</p>
</publicationStmt>
<sourceDesc>
<p>No source: this is an original work.</p>
</sourceDesc>
</fileDesc>
</teiHeader>
<facsimile>
<graphic url="page1.png"/>
<graphic url="page2.png"/>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile>
</TEI> | ||||||||
Content model | <content> <sequence> <elementRef key="teiHeader"/> <alternate> <sequence> <classRef key="model.resource" minOccurs="1" maxOccurs="unbounded"/> <elementRef key="TEI" minOccurs="0" maxOccurs="unbounded"/> </sequence> <elementRef key="TEI" minOccurs="1" maxOccurs="unbounded"/> </alternate> </sequence> </content> ⚓ | ||||||||
Schema Declaration | element TEI { tei_att.global.attributes, tei_att.typed.attributes, attribute version { text }?, ( tei_teiHeader, ( ( tei_model.resource+, tei_TEI* ) | tei_TEI+ ) ) }⚓ |
<addSpan> (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also <add>). [12.3.1.4. Additions and Deletions] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | Empty element |
Note | Both the beginning and the end of the added material must be marked; the beginning by the <addSpan> element itself, the end by the spanTo attribute. |
Example | <handNote xml:id="HEOL"
scribe="HelgiÓlafsson"/>
<!-- ... -->
<body>
<div>
<!-- text here -->
</div>
<addSpan n="added_gathering" hand="#HEOL"
spanTo="#P025"/>
<div>
<!-- text of first added poem here -->
</div>
<div>
<!-- text of second added poem here -->
</div>
<div>
<!-- text of third added poem here -->
</div>
<div>
<!-- text of fourth added poem here -->
</div>
<anchor xml:id="P025"/>
<div>
<!-- more text here -->
</div>
</body> |
Schematron |
<sch:rule context="tei:addSpan">
<sch:assert test="@spanTo">The @spanTo attribute of <sch:name/> is required.</sch:assert>
</sch:rule> |
Schematron |
<sch:rule context="tei:addSpan">
<sch:assert test="@spanTo">L'attribut spanTo est requis.</sch:assert>
</sch:rule> |
Content model | <content> <empty/> </content> ⚓ |
Schema Declaration | element addSpan { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, tei_att.typed.attributes, empty }⚓ |
<appInfo> (application information) records information about an application which has edited the TEI file. [2.3.11. The Application Information Element] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc |
May contain | header: application |
Example | <appInfo>
<application version="1.24" ident="Xaira">
<label>XAIRA Indexer</label>
<ptr target="#P1"/>
</application>
</appInfo> |
Content model | <content> <classRef key="model.applicationLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element appInfo { tei_att.global.attributes, tei_model.applicationLike+ }⚓ |
<application> provides information about an application which has acted upon the document. [2.3.11. The Application Information Element] | |||||||||||||
Module | header — Formal specification | ||||||||||||
Attributes |
| ||||||||||||
Member of | |||||||||||||
Contained by | header: appInfo | ||||||||||||
May contain | |||||||||||||
Example | <appInfo>
<application version="1.5"
ident="ImageMarkupTool1" notAfter="2006-06-01">
<label>Image Markup Tool</label>
<ptr target="#P1"/>
<ptr target="#P2"/>
</application>
</appInfo> This example shows an appInfo element documenting the fact that version 1.5 of the Image Markup Tool1 application has an interest in two parts of a document which was last saved on June 6 2006. The parts concerned are accessible at the URLs given as target for the two <ptr> elements. | ||||||||||||
Content model | <content> <sequence> <classRef key="model.labelLike" minOccurs="1" maxOccurs="unbounded"/> <alternate> <classRef key="model.ptrLike" minOccurs="0" maxOccurs="unbounded"/> <classRef key="model.pLike" minOccurs="0" maxOccurs="unbounded"/> </alternate> </sequence> </content> ⚓ | ||||||||||||
Schema Declaration | element application { tei_att.global.attributes, tei_att.datable.attributes, tei_att.typed.attributes, attribute ident { text }, attribute version { text }, ( tei_model.labelLike+, ( tei_model.ptrLike* | tei_model.pLike* ) ) }⚓ |
<availability> (availability) supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | core: bibl header: publicationStmt | ||||||||
May contain | |||||||||
Note | A consistent format should be adopted | ||||||||
Example | <availability status="restricted">
<p>Available for academic research purposes only.</p>
</availability>
<availability status="free">
<p>In the public domain</p>
</availability>
<availability status="restricted">
<p>Available under licence from the publishers.</p>
</availability> | ||||||||
Example | <availability>
<licence target="http://opensource.org/licenses/MIT">
<p>The MIT License
applies to this document.</p>
<p>Copyright (C) 2011 by The University of Victoria</p>
<p>Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:</p>
<p>The above copyright notice and this permission notice shall be included in
all copies or substantial portions of the Software.</p>
<p>THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
THE SOFTWARE.</p>
</licence>
</availability> | ||||||||
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:availability"/>
</sch:pattern> | ||||||||
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.availabilityPart"/> <classRef key="model.pLike"/> </alternate> </content> ⚓ | ||||||||
Schema Declaration | element availability { tei_att.global.attributes, tei_att.declarable.attributes, attribute status { "free" | "unknown" | "restricted" }?, ( tei_model.availabilityPart | tei_model.pLike )+ }⚓ |
<back> (back matter) contains any appendixes, etc. following the main part of a text. [4.7. Back Matter 4. Default Text Structure] | |
Module | textstructure — Formal specification |
Attributes |
|
Contained by | |
May contain | |
Note | Because cultural conventions differ as to which elements are grouped as back matter and which as front matter, the content models for the <back> and <front> elements are identical. |
Example | <back>
<div type="appendix">
<head>The Golden Dream or, the Ingenuous Confession</head>
<p>TO shew the Depravity of human Nature, and how apt the Mind is to be misled by Trinkets
and false Appearances, Mrs. Two-Shoes does acknowledge, that after she became rich, she
had like to have been, too fond of Money
<!-- .... -->
</p>
</div>
<!-- ... -->
<div type="epistle">
<head>A letter from the Printer, which he desires may be inserted</head>
<salute>Sir.</salute>
<p>I have done with your Copy, so you may return it to the Vatican, if you please;
<!-- ... -->
</p>
</div>
<div type="advert">
<head>The Books usually read by the Scholars of Mrs Two-Shoes are these and are sold at Mr
Newbery's at the Bible and Sun in St Paul's Church-yard.</head>
<list>
<item n="1">The Christmas Box, Price 1d.</item>
<item n="2">The History of Giles Gingerbread, 1d.</item>
<!-- ... -->
<item n="42">A Curious Collection of Travels, selected from the Writers of all Nations,
10 Vol, Pr. bound 1l.</item>
</list>
</div>
<div type="advert">
<head>By the KING's Royal Patent, Are sold by J. NEWBERY, at the Bible and Sun in St.
Paul's Church-Yard.</head>
<list>
<item n="1">Dr. James's Powders for Fevers, the Small-Pox, Measles, Colds, &c. 2s.
6d</item>
<item n="2">Dr. Hooper's Female Pills, 1s.</item>
<!-- ... -->
</list>
</div>
</back> |
Content model | <content> <sequence> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.frontPart"/> <classRef key="model.pLike.front"/> <classRef key="model.pLike"/> <classRef key="model.listLike"/> <classRef key="model.global"/> </alternate> <alternate minOccurs="0"> <sequence> <classRef key="model.div1Like"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.frontPart"/> <classRef key="model.div1Like"/> <classRef key="model.global"/> </alternate> </sequence> <sequence> <classRef key="model.divLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.frontPart"/> <classRef key="model.divLike"/> <classRef key="model.global"/> </alternate> </sequence> </alternate> <sequence minOccurs="0"> <classRef key="model.divBottomPart"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divBottomPart"/> <classRef key="model.global"/> </alternate> </sequence> </sequence> </content> ⚓ |
Schema Declaration | element back { tei_att.global.attributes, tei_att.declaring.attributes, ( ( tei_model.frontPart | tei_model.pLike.front | tei_model.pLike | tei_model.listLike | tei_model.global )*, ( ( tei_model.div1Like, ( tei_model.frontPart | tei_model.div1Like | tei_model.global )* ) | ( tei_model.divLike, ( tei_model.frontPart | tei_model.divLike | tei_model.global )* ) )?, ( ( tei_model.divBottomPart, ( tei_model.divBottomPart | tei_model.global )* )? ) ) }⚓ |
<bibl> (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 16.3.2. Declarable Elements] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | core: bibl date email gap measure meeting name note num pb pubPlace publisher ref respStmt term time title unit gaiji: g header: availability edition extent funder idno transcr: addSpan damage damageSpan delSpan ex fw handShift listTranspose metamark mod redo restore retrace secl space subst substJoin supplied surplus undo character data |
Note | Contains phrase-level elements, together with any combination of elements from the model.biblPart class |
Example | <bibl>Blain, Clements and Grundy: Feminist Companion to Literature in English (Yale,
1990)</bibl> |
Example | <bibl>
<title level="a">The Interesting story of the Children in the Wood</title>. In
<author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
<publisher>OUP</publisher>
<date>1968</date>.
</bibl> |
Example | <bibl type="article" subtype="book_chapter"
xml:id="carlin_2003">
<author>
<name>
<surname>Carlin</surname>
(<forename>Claire</forename>)</name>
</author>,
<title level="a">The Staging of Impotence : France’s last
congrès</title> dans
<bibl type="monogr">
<title level="m">Theatrum mundi : studies in honor of Ronald W.
Tobin</title>, éd.
<editor>
<name>
<forename>Claire</forename>
<surname>Carlin</surname>
</name>
</editor> et
<editor>
<name>
<forename>Kathleen</forename>
<surname>Wine</surname>
</name>
</editor>,
<pubPlace>Charlottesville, Va.</pubPlace>,
<publisher>Rookwood Press</publisher>,
<date when="2003">2003</date>.
</bibl>
</bibl> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde" value="tei:bibl"/>
</sch:pattern> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.highlighted"/> <classRef key="model.pPart.data"/> <classRef key="model.pPart.edit"/> <classRef key="model.segLike"/> <classRef key="model.ptrLike"/> <classRef key="model.biblPart"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Schema Declaration | element bibl { tei_att.global.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.declarable.attributes, tei_att.docStatus.attributes, tei_att.sortable.attributes, tei_att.typed.attributes, ( text | tei_model.gLike | tei_model.highlighted | tei_model.pPart.data | tei_model.pPart.edit | tei_model.segLike | tei_model.ptrLike | tei_model.biblPart | tei_model.global )* }⚓ |
<body> (text body) contains the whole body of a single unitary text, excluding any front or back matter. [4. Default Text Structure] | |
Module | textstructure — Formal specification |
Attributes |
|
Contained by | textstructure: text |
May contain | |
Example | <body>
<l>Nu scylun hergan hefaenricaes uard</l>
<l>metudæs maecti end his modgidanc</l>
<l>uerc uuldurfadur sue he uundra gihuaes</l>
<l>eci dryctin or astelidæ</l>
<l>he aerist scop aelda barnum</l>
<l>heben til hrofe haleg scepen.</l>
<l>tha middungeard moncynnæs uard</l>
<l>eci dryctin æfter tiadæ</l>
<l>firum foldu frea allmectig</l>
<trailer>primo cantauit Cædmon istud carmen.</trailer>
</body> |
Content model | <content> <sequence> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> <sequence minOccurs="0"> <classRef key="model.divTop"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divTop"/> </alternate> </sequence> <sequence minOccurs="0"> <classRef key="model.divGenLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divGenLike"/> </alternate> </sequence> <alternate> <sequence minOccurs="1" maxOccurs="unbounded"> <classRef key="model.divLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divGenLike"/> </alternate> </sequence> <sequence minOccurs="1" maxOccurs="unbounded"> <classRef key="model.div1Like"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divGenLike"/> </alternate> </sequence> <sequence> <sequence minOccurs="1" maxOccurs="unbounded"> <alternate minOccurs="1" maxOccurs="1"> <elementRef key="schemaSpec"/> <classRef key="model.common"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> <alternate minOccurs="0"> <sequence minOccurs="1" maxOccurs="unbounded"> <classRef key="model.divLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divGenLike"/> </alternate> </sequence> <sequence minOccurs="1" maxOccurs="unbounded"> <classRef key="model.div1Like"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.divGenLike"/> </alternate> </sequence> </alternate> </sequence> </alternate> <sequence minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divBottom"/> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> </sequence> </content> ⚓ |
Schema Declaration | element body { tei_att.global.attributes, tei_att.declaring.attributes, ( tei_model.global*, ( ( tei_model.divTop, ( tei_model.global | tei_model.divTop )* )? ), ( ( tei_model.divGenLike, ( tei_model.global | tei_model.divGenLike )* )? ), ( ( ( tei_model.divLike, ( tei_model.global | tei_model.divGenLike )* )+ ) | ( ( tei_model.div1Like, ( tei_model.global | tei_model.divGenLike )* )+ ) | ( ( ( ( schemaSpec | tei_model.common ), tei_model.global* )+ ), ( ( ( tei_model.divLike, ( tei_model.global | tei_model.divGenLike )* )+ ) | ( ( tei_model.div1Like, ( tei_model.global | tei_model.divGenLike )* )+ ) )? ) ), ( ( tei_model.divBottom, tei_model.global* )* ) ) }⚓ |
<catDesc> (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal <textDesc>. [2.3.7. The Classification Declaration] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: category |
May contain | |
Example | <catDesc>Prose reportage</catDesc> |
Example | <catDesc>
<textDesc n="novel">
<channel mode="w">print; part issues</channel>
<constitution type="single"/>
<derivation type="original"/>
<domain type="art"/>
<factuality type="fiction"/>
<interaction type="none"/>
<preparedness type="prepared"/>
<purpose type="entertain" degree="high"/>
<purpose type="inform" degree="medium"/>
</textDesc>
</catDesc> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.limitedPhrase"/> <classRef key="model.catDescPart"/> </alternate> </content> ⚓ |
Schema Declaration | element catDesc { tei_att.global.attributes, tei_att.canonical.attributes, ( text | tei_model.limitedPhrase | tei_model.catDescPart )* }⚓ |
<catRef> (category reference) specifies one or more defined categories within some taxonomy or text typology. [2.4.3. The Text Classification] | |||||||
Module | header — Formal specification | ||||||
Attributes |
| ||||||
Contained by | header: textClass | ||||||
May contain | Empty element | ||||||
Note | The scheme attribute needs to be supplied only if more than one taxonomy has been declared. | ||||||
Example | <catRef scheme="#myTopics"
target="#news #prov #sales2"/>
<!-- elsewhere -->
<taxonomy xml:id="myTopics">
<category xml:id="news">
<catDesc>Newspapers</catDesc>
</category>
<category xml:id="prov">
<catDesc>Provincial</catDesc>
</category>
<category xml:id="sales2">
<catDesc>Low to average annual sales</catDesc>
</category>
</taxonomy> | ||||||
Content model | <content> <empty/> </content> ⚓ | ||||||
Schema Declaration | element catRef { tei_att.global.attributes, tei_att.pointing.attributes, attribute scheme { text }?, empty }⚓ |
<category> (category) contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy. [2.3.7. The Classification Declaration] | |
Module | header — Formal specification |
Attributes |
|
Contained by | |
May contain | |
Example | <category xml:id="b1">
<catDesc>Prose reportage</catDesc>
</category> |
Example | <category xml:id="b2">
<catDesc>Prose </catDesc>
<category xml:id="b11">
<catDesc>journalism</catDesc>
</category>
<category xml:id="b12">
<catDesc>fiction</catDesc>
</category>
</category> |
Example | <category xml:id="LIT">
<catDesc xml:lang="pl">literatura piękna</catDesc>
<catDesc xml:lang="en">fiction</catDesc>
<category xml:id="LPROSE">
<catDesc xml:lang="pl">proza</catDesc>
<catDesc xml:lang="en">prose</catDesc>
</category>
<category xml:id="LPOETRY">
<catDesc xml:lang="pl">poezja</catDesc>
<catDesc xml:lang="en">poetry</catDesc>
</category>
<category xml:id="LDRAMA">
<catDesc xml:lang="pl">dramat</catDesc>
<catDesc xml:lang="en">drama</catDesc>
</category>
</category> |
Content model | <content> <sequence> <alternate> <elementRef key="catDesc" minOccurs="1" maxOccurs="unbounded"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.descLike"/> <elementRef key="equiv"/> <elementRef key="gloss"/> </alternate> </alternate> <elementRef key="category" minOccurs="0" maxOccurs="unbounded"/> </sequence> </content> ⚓ |
Schema Declaration | element category { tei_att.global.attributes, tei_att.datcat.attributes, ( ( tei_catDesc+ | ( tei_model.descLike | equiv | gloss )* ), tei_category* ) }⚓ |
<change> (change) documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.6. The Revision Description 2.4.1. Creation 12.7. Identifying Changes and Revisions] | |||||||
Module | header — Formal specification | ||||||
Attributes |
| ||||||
Contained by | header: revisionDesc | ||||||
May contain | |||||||
Note | The who attribute may be used to point to any other element, but will typically specify a <respStmt> or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it. It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented. | ||||||
Example | <titleStmt>
<title> ... </title>
<editor xml:id="LDB">Lou Burnard</editor>
<respStmt xml:id="BZ">
<resp>copy editing</resp>
<name>Brett Zamir</name>
</respStmt>
</titleStmt>
<!-- ... -->
<revisionDesc status="published">
<change who="#BZ" when="2008-02-02"
status="public">Finished chapter 23</change>
<change who="#BZ" when="2008-01-02"
status="draft">Finished chapter 2</change>
<change n="P2.2" when="1991-12-21"
who="#LDB">Added examples to section 3</change>
<change when="1991-11-11" who="#MSM">Deleted chapter 10</change>
</revisionDesc> | ||||||
Example | <profileDesc>
<creation>
<listChange>
<change xml:id="DRAFT1">First draft in pencil</change>
<change xml:id="DRAFT2"
notBefore="1880-12-09">First revision, mostly
using green ink</change>
<change xml:id="DRAFT3"
notBefore="1881-02-13">Final corrections as
supplied to printer.</change>
</listChange>
</creation>
</profileDesc> | ||||||
Content model | <content> <macroRef key="macro.specialPara"/> </content> ⚓ | ||||||
Schema Declaration | element change { tei_att.global.attributes, tei_att.ascribed.attributes, tei_att.datable.attributes, tei_att.docStatus.attributes, tei_att.typed.attributes, attribute target { list { + } }?, tei_macro.specialPara }⚓ |
<char> (character) provides descriptive information about a character. [5.2. Markup Constructs for Representation of Characters and Glyphs] | |
Module | gaiji — Formal specification |
Attributes |
|
Contained by | gaiji: charDecl |
May contain | gaiji: localProp mapping unicodeProp unihanProp |
Example | <char xml:id="circledU4EBA">
<localProp name="Name"
value="CIRCLED IDEOGRAPH 4EBA"/>
<localProp name="daikanwa" value="36"/>
<unicodeProp name="Decomposition_Mapping"
value="circle"/>
<mapping type="standard">人</mapping>
</char> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <elementRef key="unicodeProp"/> <elementRef key="unihanProp"/> <elementRef key="localProp"/> <elementRef key="mapping"/> <elementRef key="figure"/> <classRef key="model.graphicLike"/> <classRef key="model.noteLike"/> <classRef key="model.descLike"/> </alternate> </content> ⚓ |
Schema Declaration | element char { tei_att.global.attributes, ( tei_unicodeProp | tei_unihanProp | tei_localProp | tei_mapping | figure | tei_model.graphicLike | tei_model.noteLike | tei_model.descLike )* }⚓ |
<charDecl> (character declarations) provides information about nonstandard characters and glyphs. [5.2. Markup Constructs for Representation of Characters and Glyphs] | |
Module | gaiji — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc |
May contain | |
Example | <charDecl>
<char xml:id="aENL">
<unicodeProp name="Name"
value="LATIN LETTER ENLARGED SMALL A"/>
<mapping type="standard">a</mapping>
</char>
</charDecl> |
Content model | <content> <sequence> <elementRef key="desc" minOccurs="0"/> <alternate minOccurs="1" maxOccurs="unbounded"> <elementRef key="char"/> <elementRef key="glyph"/> </alternate> </sequence> </content> ⚓ |
Schema Declaration | element charDecl { tei_att.global.attributes, ( tei_desc?, ( tei_char | tei_glyph )+ ) }⚓ |
<classDecl> (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text. [2.3.7. The Classification Declaration 2.3. The Encoding Description] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc |
May contain | header: taxonomy |
Example | <classDecl>
<taxonomy xml:id="LCSH">
<bibl>Library of Congress Subject Headings</bibl>
</taxonomy>
</classDecl>
<!-- ... -->
<textClass>
<keywords scheme="#LCSH">
<term>Political science</term>
<term>United States -- Politics and government --
Revolution, 1775-1783</term>
</keywords>
</textClass> |
Content model | <content> <elementRef key="taxonomy" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element classDecl { tei_att.global.attributes, tei_taxonomy+ }⚓ |
<correction> (correction principles) states how and under what circumstances corrections have been made in the text. [2.3.3. The Editorial Practices Declaration 16.3.2. Declarable Elements] | |||||||||||||||||
Module | header — Formal specification | ||||||||||||||||
Attributes |
| ||||||||||||||||
Member of | |||||||||||||||||
Contained by | header: editorialDecl | ||||||||||||||||
May contain | core: p | ||||||||||||||||
Note | May be used to note the results of proof reading the text against its original, indicating (for example) whether discrepancies have been silently rectified, or recorded using the editorial tags described in section 3.5. Simple Editorial Changes. | ||||||||||||||||
Example | <correction>
<p>Errors in transcription controlled by using the WordPerfect spelling checker, with a user
defined dictionary of 500 extra words taken from Chambers Twentieth Century
Dictionary.</p>
</correction> | ||||||||||||||||
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:correction"/>
</sch:pattern> | ||||||||||||||||
Content model | <content> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ | ||||||||||||||||
Schema Declaration | element correction { tei_att.global.attributes, tei_att.declarable.attributes, attribute status { "high" | "medium" | "low" | "unknown" }?, attribute method { "silent" | "markup" }?, tei_model.pLike+ }⚓ |
<damage> (damage) contains an area of damage to the text witness. [12.3.3.1. Damage, Illegibility, and Supplied Text] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | Since damage to text witnesses frequently makes them harder to read, the <damage> element will often contain an <unclear> element. If the damaged area is not continuous (e.g. a stain affecting several strings of text), the group attribute may be used to group together several related <damage> elements; alternatively the <join> element may be used to indicate which <damage> and <unclear> elements are part of the same physical phenomenon. The <damage>, <gap>, <del>, <unclear> and <supplied> elements may be closely allied in use. See section 12.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance. |
Example | <l>The Moving Finger wri<damage agent="water" group="1">es; and</damage> having writ,</l>
<l>Moves <damage agent="water" group="1">
<supplied>on: nor all your</supplied>
</damage> Piety nor Wit</l> |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element damage { tei_att.global.attributes, tei_att.damaged.attributes, tei_att.typed.attributes, tei_macro.paraContent }⚓ |
<damageSpan> (damaged span of text) marks the beginning of a longer sequence of text which is damaged in some way but still legible. [12.3.3.1. Damage, Illegibility, and Supplied Text] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | Empty element |
Note | Both the beginning and ending of the damaged sequence must be marked: the beginning by the <damageSpan> element, the ending by the target of the spanTo attribute: if no other element available, the <anchor> element may be used for this purpose. The damaged text must be at least partially legible, in order for the encoder to be able to transcribe it. If it is not legible at all, the <damageSpan> element should not be used. Rather, the <gap> or <unclear> element should be employed, with the value of the reason attribute giving the cause. See further sections 12.3.3.1. Damage, Illegibility, and Supplied Text and 12.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination. |
Example | <p>Paragraph partially damaged. This is the undamaged
portion <damageSpan spanTo="#a34"/>and this the damaged
portion of the paragraph.</p>
<p>This paragraph is entirely damaged.</p>
<p>Paragraph partially damaged; in the middle of this
paragraph the damage ends and the anchor point marks
the start of the <anchor xml:id="a34"/> undamaged part of the text. ...</p> |
Schematron |
<sch:rule context="tei:damageSpan">
<sch:assert test="@spanTo">The @spanTo attribute of <sch:name/> is required.</sch:assert>
</sch:rule> |
Schematron |
<sch:rule context="tei:damageSpan">
<sch:assert test="@spanTo">L'attribut spanTo est requis.</sch:assert>
</sch:rule> |
Content model | <content> <empty/> </content> ⚓ |
Schema Declaration | element damageSpan { tei_att.global.attributes, tei_att.damaged.attributes, tei_att.spanning.attributes, tei_att.typed.attributes, empty }⚓ |
<date> (date) contains a date in any format. [3.6.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.6. The Revision Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 16.2.3. The Setting Description 14.4. Dates] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <date when="1980-02">early February 1980</date> |
Example | Given on the <date when="1977-06-12">Twelfth Day
of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic
the Two Hundredth and first and of the University the Eighty-Sixth.</date> |
Example | <date when="1990-09">September 1990</date> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.phrase"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Schema Declaration | element date { tei_att.global.attributes, tei_att.calendarSystem.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, tei_att.typed.attributes, ( text | tei_model.gLike | tei_model.phrase | tei_model.global )* }⚓ |
<delSpan> (deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector. [12.3.1.4. Additions and Deletions] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | Empty element |
Note | Both the beginning and ending of the deleted sequence must be marked: the beginning by the <delSpan> element, the ending by the target of the spanTo attribute. The text deleted must be at least partially legible, in order for the encoder to be able to transcribe it. If it is not legible at all, the <delSpan> tag should not be used. Rather, the <gap> tag should be employed to signal that text cannot be transcribed, with the value of the reason attribute giving the cause for the omission from the transcription as deletion. If it is not fully legible, the <unclear> element should be used to signal the areas of text which cannot be read with confidence. See further sections 12.3.1.7. Text Omitted from or Supplied in the Transcription and, for the close association of the <delSpan> tag with the <gap>, <damage>, <unclear> and <supplied> elements, 12.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination. The <delSpan> tag should not be used for deletions made by editors or encoders. In these cases, either the <corr> tag or the <gap> tag should be used. |
Example | <p>Paragraph partially deleted. This is the undeleted
portion <delSpan spanTo="#a23"/>and this the deleted
portion of the paragraph.</p>
<p>Paragraph deleted together with adjacent material.</p>
<p>Second fully deleted paragraph.</p>
<p>Paragraph partially deleted; in the middle of this
paragraph the deletion ends and the anchor point marks
the resumption <anchor xml:id="a23"/> of the text. ...</p> |
Schematron |
<sch:rule context="tei:delSpan">
<sch:assert test="@spanTo">The @spanTo attribute of <sch:name/> is required.</sch:assert>
</sch:rule> |
Schematron |
<sch:rule context="tei:delSpan">
<sch:assert test="@spanTo">L'attribut spanTo est requis.</sch:assert>
</sch:rule> |
Content model | <content> <empty/> </content> ⚓ |
Schema Declaration | element delSpan { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, tei_att.typed.attributes, empty }⚓ |
<desc> (description) contains a short description of the purpose, function, or use of its parent element, or when the parent is a documentation element, describes or defines the object being documented. [23.4.1. Description of Components] | |||||||||||||
Module | core — Formal specification | ||||||||||||
Attributes |
| ||||||||||||
Member of | |||||||||||||
Contained by | |||||||||||||
May contain | |||||||||||||
Note | When used in a specification element such as <elementSpec>, TEI convention requires that this be expressed as a finite clause, begining with an active verb. | ||||||||||||
Example | Example of a <desc> element inside a documentation element. <dataSpec module="tei"
ident="teidata.point">
<desc versionDate="2010-10-17"
xml:lang="en">defines the data type used to express a point in cartesian space.</desc>
<content>
<dataRef name="token"
restriction="(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)"/>
</content>
<!-- ... -->
</dataSpec> | ||||||||||||
Example | Example of a <desc> element in a non-documentation element. <place xml:id="KERG2">
<placeName>Kerguelen Islands</placeName>
<!-- ... -->
<terrain>
<desc>antarctic tundra</desc>
</terrain>
<!-- ... -->
</place> | ||||||||||||
Schematron | A <desc> with a type of deprecationInfo should only occur when its parent element is being deprecated. Furthermore, it should always occur in an element that is being deprecated when <desc> is a valid child of that element.
<sch:rule context="tei:desc[ @type eq 'deprecationInfo']">
<sch:assert test="../@validUntil">Information about a
deprecation should only be present in a specification element
that is being deprecated: that is, only an element that has a
@validUntil attribute should have a child <desc
type="deprecationInfo">.</sch:assert>
</sch:rule> | ||||||||||||
Content model | <content> <macroRef key="macro.limitedContent"/> </content> ⚓ | ||||||||||||
Schema Declaration | element desc { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.typed.attribute.subtype, attribute type { "deprecationInfo" }?, tei_macro.limitedContent }⚓ |
<div> (text division) contains a subdivision of the front, body, or back of a text. [4.1. Divisions of the Body] | |||||||
Module | textstructure — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | |||||||
Example | <body>
<div type="part">
<head>Fallacies of Authority</head>
<p>The subject of which is Authority in various shapes, and the object, to repress all
exercise of the reasoning faculty.</p>
<div n="1" type="chapter">
<head>The Nature of Authority</head>
<p>With reference to any proposed measures having for their object the greatest
happiness of the greatest number [...]</p>
<div n="1.1" type="section">
<head>Analysis of Authority</head>
<p>What on any given occasion is the legitimate weight or influence to be attached to
authority [...] </p>
</div>
<div n="1.2" type="section">
<head>Appeal to Authority, in What Cases Fallacious.</head>
<p>Reference to authority is open to the charge of fallacy when [...] </p>
</div>
</div>
</div>
</body> | ||||||
Schematron |
<sch:rule context="tei:l//tei:div">
<sch:assert test="ancestor::tei:floatingText"> Abstract model violation: Metrical lines may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
</sch:assert>
</sch:rule> | ||||||
Schematron |
<sch:rule context="tei:div">
<sch:report test="(ancestor::tei:p or ancestor::tei:ab) and not(ancestor::tei:floatingText)"> Abstract model violation: p and ab may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
</sch:report>
</sch:rule> | ||||||
Content model | <content> <sequence minOccurs="1" maxOccurs="1"> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divTop"/> <classRef key="model.global"/> </alternate> <sequence minOccurs="0" maxOccurs="1"> <alternate minOccurs="1" maxOccurs="1"> <sequence minOccurs="1" maxOccurs="unbounded"> <alternate minOccurs="1" maxOccurs="1"> <classRef key="model.divLike"/> <classRef key="model.divGenLike"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> <sequence minOccurs="1" maxOccurs="1"> <sequence minOccurs="1" maxOccurs="unbounded"> <alternate minOccurs="1" maxOccurs="1"> <elementRef key="schemaSpec"/> <classRef key="model.common"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> <sequence minOccurs="0" maxOccurs="unbounded"> <alternate minOccurs="1" maxOccurs="1"> <classRef key="model.divLike"/> <classRef key="model.divGenLike"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> </sequence> </alternate> <sequence minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divBottom"/> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> </sequence> </sequence> </content> ⚓ | ||||||
Schema Declaration | element div { tei_att.global.attributes, tei_att.declaring.attributes, tei_att.divLike.attributes, tei_att.placement.attributes, tei_att.typed.attribute.subtype, tei_att.written.attributes, attribute type { text }?, ( ( tei_model.divTop | tei_model.global )*, ( ( ( ( ( ( tei_model.divLike | tei_model.divGenLike ), tei_model.global* )+ ) | ( ( ( ( schemaSpec | tei_model.common ), tei_model.global* )+ ), ( ( ( tei_model.divLike | tei_model.divGenLike ), tei_model.global* )* ) ) ), ( ( tei_model.divBottom, tei_model.global* )* ) )? ) ) }⚓ |
<docDate> (document date) contains the date of a document, as given on a title page or in a dateline. [4.6. Title Pages] | |
Module | textstructure — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | Cf. the general <date> element in the core tag set. This specialized element is provided for convenience in marking and processing the date of the documents, since it is likely to require specialized handling for many applications. It should be used only for the date of the entire document, not for any subset or part of it. |
Example | <docImprint>Oxford, Clarendon Press, <docDate>1987</docDate>
</docImprint> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element docDate { tei_att.global.attributes, tei_att.calendarSystem.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_macro.phraseSeq }⚓ |
<edition> (edition) describes the particularities of one edition of a text. [2.2.2. The Edition Statement] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | core: bibl header: editionStmt |
May contain | |
Example | <edition>First edition <date>Oct 1990</date>
</edition>
<edition n="S2">Students' edition</edition> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element edition { tei_att.global.attributes, tei_macro.phraseSeq }⚓ |
<editionStmt> (edition statement) groups information relating to one edition of a text. [2.2.2. The Edition Statement 2.2. The File Description] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: fileDesc |
May contain | |
Example | <editionStmt>
<edition n="S2">Students' edition</edition>
<respStmt>
<resp>Adapted by </resp>
<name>Elizabeth Kirk</name>
</respStmt>
</editionStmt> |
Example | <editionStmt>
<p>First edition, <date>Michaelmas Term, 1991.</date>
</p>
</editionStmt> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <sequence> <elementRef key="edition"/> <classRef key="model.respLike" minOccurs="0" maxOccurs="unbounded"/> </sequence> </alternate> </content> ⚓ |
Schema Declaration | element editionStmt { tei_att.global.attributes, ( tei_model.pLike+ | ( tei_edition, tei_model.respLike* ) ) }⚓ |
<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 16.3.2. Declarable Elements] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc |
May contain | core: p |
Example | <editorialDecl>
<normalization>
<p>All words converted to Modern American spelling using
Websters 9th Collegiate dictionary
</p>
</normalization>
<quotation marks="all">
<p>All opening quotation marks converted to “ all closing
quotation marks converted to &cdq;.</p>
</quotation>
</editorialDecl> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:editorialDecl"/>
</sch:pattern> |
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.pLike"/> <classRef key="model.editorialDeclPart"/> </alternate> </content> ⚓ |
Schema Declaration | element editorialDecl { tei_att.global.attributes, tei_att.declarable.attributes, ( tei_model.pLike | tei_model.editorialDeclPart )+ }⚓ |
<email> (electronic mail address) contains an email address identifying a location to which email messages can be delivered. [3.6.2. Addresses] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | The format of a modern Internet email address is defined in RFC 2822 |
Example | <email>membership@tei-c.org</email> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element email { tei_att.global.attributes, tei_att.cmc.attributes, tei_macro.phraseSeq }⚓ |
<encodingDesc> (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: teiHeader |
May contain | |
Example | <encodingDesc>
<p>Basic encoding, capturing lexical information only. All
hyphenation, punctuation, and variant spellings normalized. No
formatting or layout information preserved.</p>
</encodingDesc> |
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.encodingDescPart"/> <classRef key="model.pLike"/> </alternate> </content> ⚓ |
Schema Declaration | element encodingDesc { tei_att.global.attributes, ( tei_model.encodingDescPart | tei_model.pLike )+ }⚓ |
<ex> (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation. [12.3.1.2. Abbreviation and Expansion] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | gaiji: g character data |
Example | The address is Southmoor <choice>
<expan>R<ex>oa</ex>d</expan>
<abbr>Rd</abbr>
</choice> |
Content model | <content> <macroRef key="macro.xtext"/> </content> ⚓ |
Schema Declaration | element ex { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, tei_macro.xtext }⚓ |
<extent> (extent) describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 11.7.1. Object Description] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <extent>3200 sentences</extent>
<extent>between 10 and 20 Mb</extent>
<extent>ten 3.5 inch high density diskettes</extent> |
Example | The <measure> element may be used to supply normalized or machine tractable versions of the size or sizes concerned. <extent>
<measure unit="MiB" quantity="4.2">About four megabytes</measure>
<measure unit="pages" quantity="245">245 pages of source
material</measure>
</extent> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element extent { tei_att.global.attributes, tei_macro.phraseSeq }⚓ |
<facsimile> contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. [12.1. Digital Facsimiles] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <facsimile>
<graphic url="page1.png"/>
<surface>
<graphic url="page2-highRes.png"/>
<graphic url="page2-lowRes.png"/>
</surface>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile> |
Example | <facsimile>
<surface ulx="0" uly="0" lrx="200" lry="300">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile> |
Schematron |
<sch:rule context="tei:facsimile//tei:line | tei:facsimile//tei:zone">
<sch:report test="child::text()[ normalize-space(.) ne '']"> A facsimile element represents a text with images, thus
transcribed text should not be present within it.
</sch:report>
</sch:rule> |
Content model | <content> <sequence> <elementRef key="front" minOccurs="0"/> <alternate> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.graphicLike"/> <elementRef key="surface"/> <elementRef key="surfaceGrp"/> </alternate> <elementRef key="facsimile" minOccurs="1" maxOccurs="unbounded"/> </alternate> <elementRef key="back" minOccurs="0"/> </sequence> </content> ⚓ |
Schema Declaration | element facsimile { tei_att.global.attributes, tei_att.declaring.attributes, ( tei_front?, ( ( tei_model.graphicLike | tei_surface | tei_surfaceGrp )+ | tei_facsimile+ ), tei_back? ) }⚓ |
<fileDesc> (file description) contains a full bibliographic description of an electronic file. [2.2. The File Description 2.1.1. The TEI Header and Its Components] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: teiHeader |
May contain | |
Note | The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived. |
Example | <fileDesc>
<titleStmt>
<title>The shortest possible TEI document</title>
</titleStmt>
<publicationStmt>
<p>Distributed as part of TEI P5</p>
</publicationStmt>
<sourceDesc>
<p>No print source exists: this is an original digital text</p>
</sourceDesc>
</fileDesc> |
Content model | <content> <sequence> <sequence> <elementRef key="titleStmt"/> <elementRef key="editionStmt" minOccurs="0"/> <elementRef key="extent" minOccurs="0"/> <elementRef key="publicationStmt"/> <elementRef key="seriesStmt" minOccurs="0" maxOccurs="unbounded"/> <elementRef key="notesStmt" minOccurs="0"/> </sequence> <elementRef key="sourceDesc" minOccurs="1" maxOccurs="unbounded"/> </sequence> </content> ⚓ |
Schema Declaration | element fileDesc { tei_att.global.attributes, ( ( tei_titleStmt, tei_editionStmt?, tei_extent?, tei_publicationStmt, seriesStmt*, notesStmt? ), tei_sourceDesc+ ) }⚓ |
<front> (front matter) contains any prefatory matter (headers, abstracts, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. [4.6. Title Pages 4. Default Text Structure] | |
Module | textstructure — Formal specification |
Attributes |
|
Contained by | |
May contain | |
Note | Because cultural conventions differ as to which elements are grouped as front matter and which as back matter, the content models for the <front> and <back> elements are identical. |
Example | <front>
<epigraph>
<quote>Nam Sibyllam quidem Cumis ego ipse oculis meis vidi in ampulla
pendere, et cum illi pueri dicerent: <q xml:lang="grc">Σίβυλλα τί
θέλεις</q>; respondebat illa: <q xml:lang="grc">ὰποθανεῖν θέλω.</q>
</quote>
</epigraph>
<div type="dedication">
<p>For Ezra Pound <q xml:lang="it">il miglior fabbro.</q>
</p>
</div>
</front> |
Example | <front>
<div type="dedication">
<p>To our three selves</p>
</div>
<div type="preface">
<head>Author's Note</head>
<p>All the characters in this book are purely imaginary, and if the
author has used names that may suggest a reference to living persons
she has done so inadvertently. ...</p>
</div>
</front> |
Example | <front>
<div type="abstract">
<div>
<head> BACKGROUND:</head>
<p>Food insecurity can put children at greater risk of obesity because
of altered food choices and nonuniform consumption patterns.</p>
</div>
<div>
<head> OBJECTIVE:</head>
<p>We examined the association between obesity and both child-level
food insecurity and personal food insecurity in US children.</p>
</div>
<div>
<head> DESIGN:</head>
<p>Data from 9,701 participants in the National Health and Nutrition
Examination Survey, 2001-2010, aged 2 to 11 years were analyzed.
Child-level food insecurity was assessed with the US Department of
Agriculture's Food Security Survey Module based on eight
child-specific questions. Personal food insecurity was assessed with
five additional questions. Obesity was defined, using physical
measurements, as body mass index (calculated as kg/m2) greater than
or equal to the age- and sex-specific 95th percentile of the Centers
for Disease Control and Prevention growth charts. Logistic
regressions adjusted for sex, race/ethnic group, poverty level, and
survey year were conducted to describe associations between obesity
and food insecurity.</p>
</div>
<div>
<head> RESULTS:</head>
<p>Obesity was significantly associated with personal food insecurity
for children aged 6 to 11 years (odds ratio=1.81; 95% CI 1.33 to
2.48), but not in children aged 2 to 5 years (odds ratio=0.88; 95%
CI 0.51 to 1.51). Child-level food insecurity was not associated
with obesity among 2- to 5-year-olds or 6- to 11-year-olds.</p>
</div>
<div>
<head> CONCLUSIONS:</head>
<p>Personal food insecurity is associated with an increased risk of
obesity only in children aged 6 to 11 years. Personal
food-insecurity measures may give different results than aggregate
food-insecurity measures in children.</p>
</div>
</div>
</front> |
Content model | <content> <sequence> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.frontPart"/> <classRef key="model.pLike"/> <classRef key="model.pLike.front"/> <classRef key="model.global"/> </alternate> <sequence minOccurs="0"> <alternate> <sequence> <classRef key="model.div1Like"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.div1Like"/> <classRef key="model.frontPart"/> <classRef key="model.global"/> </alternate> </sequence> <sequence> <classRef key="model.divLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divLike"/> <classRef key="model.frontPart"/> <classRef key="model.global"/> </alternate> </sequence> </alternate> <sequence minOccurs="0"> <classRef key="model.divBottom"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.divBottom"/> <classRef key="model.global"/> </alternate> </sequence> </sequence> </sequence> </content> ⚓ |
Schema Declaration | element front { tei_att.global.attributes, tei_att.declaring.attributes, ( ( tei_model.frontPart | tei_model.pLike | tei_model.pLike.front | tei_model.global )*, ( ( ( ( tei_model.div1Like, ( tei_model.div1Like | tei_model.frontPart | tei_model.global )* ) | ( tei_model.divLike, ( tei_model.divLike | tei_model.frontPart | tei_model.global )* ) ), ( ( tei_model.divBottom, ( tei_model.divBottom | tei_model.global )* )? ) )? ) ) }⚓ |
<funder> (funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text. [2.2.1. The Title Statement] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | core: bibl header: editionStmt titleStmt |
May contain | |
Note | Funders provide financial support for a project; they are distinct from sponsors (see element <sponsor>), who provide intellectual support and authority. |
Example | <funder>The National Endowment for the Humanities, an independent federal agency</funder>
<funder>Directorate General XIII of the Commission of the European Communities</funder>
<funder>The Andrew W. Mellon Foundation</funder>
<funder>The Social Sciences and Humanities Research Council of Canada</funder> |
Content model | <content> <macroRef key="macro.phraseSeq.limited"/> </content> ⚓ |
Schema Declaration | element funder { tei_att.global.attributes, tei_att.canonical.attributes, tei_att.datable.attributes, tei_macro.phraseSeq.limited }⚓ |
<fw> (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page. [12.6. Headers, Footers, and Similar Matter] | |||||||||||
Module | transcr — Formal specification | ||||||||||
Attributes |
| ||||||||||
Member of | |||||||||||
Contained by | |||||||||||
May contain | |||||||||||
Note | Where running heads are consistent throughout a chapter or section, it is usually more convenient to relate them to the chapter or section, e.g. by use of the rend attribute. The <fw> element is intended for cases where the running head changes from page to page, or where details of page layout and the internal structure of the running heads are of paramount importance. | ||||||||||
Example | <fw type="sig" place="bottom">C3</fw> | ||||||||||
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ | ||||||||||
Schema Declaration | element fw { tei_att.global.attributes, tei_att.placement.attributes, tei_att.typed.attribute.subtype, tei_att.written.attributes, attribute type { text }?, tei_macro.phraseSeq }⚓ |
<g> (character or glyph) represents a glyph, or a non-standard character. [5. Characters, Glyphs, and Writing Modes] | |||||||
Module | gaiji — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | Character data only | ||||||
Note | The name g is short for gaiji, which is the Japanese term for a non-standardized character or glyph. | ||||||
Example | <g ref="#ctlig">ct</g> This example points to a <glyph> element with the identifier ctlig like the following: <glyph xml:id="ctlig">
<!-- here we describe the particular ct-ligature intended -->
</glyph> | ||||||
Example | <g ref="#per-glyph">per</g> The medieval brevigraph per could similarly be considered as an individual glyph, defined in a <glyph> element with the identifier per-glyph as follows: <glyph xml:id="per-glyph">
<!-- ... -->
</glyph> | ||||||
Content model | <content> <textNode/> </content> ⚓ | ||||||
Schema Declaration | element g { tei_att.global.attributes, tei_att.typed.attributes, attribute ref { text }?, text }⚓ |
<gap> (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. [3.5.3. Additions, Deletions, and Omissions] | |||||||||||||||||
Module | core — Formal specification | ||||||||||||||||
Attributes |
| ||||||||||||||||
Member of | |||||||||||||||||
Contained by | |||||||||||||||||
May contain | core: desc | ||||||||||||||||
Note | The <gap>, <unclear>, and <del> core tag elements may be closely allied in use with the <damage> and <supplied> elements, available when using the additional tagset for transcription of primary sources. See section 12.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance. The <gap> tag simply signals the editors decision to omit or inability to transcribe a span of text. Other information, such as the interpretation that text was deliberately erased or covered, should be indicated using the relevant tags, such as <del> in the case of deliberate deletion. | ||||||||||||||||
Example | <gap quantity="4" unit="chars"
reason="illegible"/> | ||||||||||||||||
Example | <gap quantity="1" unit="essay"
reason="sampling"/> | ||||||||||||||||
Example | <del>
<gap atLeast="4" atMost="8" unit="chars"
reason="illegible"/>
</del> | ||||||||||||||||
Example | <gap extent="several lines" reason="lost"/> | ||||||||||||||||
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.descLike"/> <classRef key="model.certLike"/> </alternate> </content> ⚓ | ||||||||||||||||
Schema Declaration | element gap { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, tei_att.timed.attributes, attribute reason { list { ( "cancelled" | "deleted" | "editorial" | "illegible" | "inaudible" | "irrelevant" | "sampling" )+ } }?, attribute agent { text }?, ( tei_model.descLike | tei_model.certLike )* }⚓ |
<glyph> (character glyph) provides descriptive information about a character glyph. [5.2. Markup Constructs for Representation of Characters and Glyphs] | |
Module | gaiji — Formal specification |
Attributes |
|
Contained by | gaiji: charDecl |
May contain | gaiji: localProp mapping unicodeProp unihanProp |
Example | <glyph xml:id="rstroke">
<localProp name="Name"
value="LATIN SMALL LETTER R WITH A FUNNY STROKE"/>
<localProp name="entity" value="rstroke"/>
<figure>
<graphic url="glyph-rstroke.png"/>
</figure>
</glyph> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <elementRef key="unicodeProp"/> <elementRef key="unihanProp"/> <elementRef key="localProp"/> <elementRef key="mapping"/> <elementRef key="figure"/> <classRef key="model.graphicLike"/> <classRef key="model.noteLike"/> <classRef key="model.descLike"/> </alternate> </content> ⚓ |
Schema Declaration | element glyph { tei_att.global.attributes, ( tei_unicodeProp | tei_unihanProp | tei_localProp | tei_mapping | figure | tei_model.graphicLike | tei_model.noteLike | tei_model.descLike )* }⚓ |
<graphic> (graphic) indicates the location of a graphic or illustration, either forming part of a text, or providing an image of it. [3.10. Graphics and Other Non-textual Components 12.1. Digital Facsimiles] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | core: desc |
Note | The mimeType attribute should be used to supply the MIME media type of the image specified by the url attribute. Within the body of a text, a <graphic> element indicates the presence of a graphic component in the source itself. Within the context of a <facsimile> or <sourceDoc> element, however, a <graphic> element provides an additional digital representation of some part of the source being encoded. |
Example | <figure>
<graphic url="fig1.png"/>
<head>Figure One: The View from the Bridge</head>
<figDesc>A Whistleresque view showing four or five sailing boats in the foreground, and a
series of buoys strung out between them.</figDesc>
</figure> |
Example | <facsimile>
<surfaceGrp n="leaf1">
<surface>
<graphic url="page1.png"/>
</surface>
<surface>
<graphic url="page2-highRes.png"/>
<graphic url="page2-lowRes.png"/>
</surface>
</surfaceGrp>
</facsimile> |
Example | <facsimile>
<surfaceGrp n="leaf1" xml:id="spi001">
<surface xml:id="spi001r">
<graphic type="normal"
subtype="thumbnail" url="spi/thumb/001r.jpg"/>
<graphic type="normal" subtype="low-res"
url="spi/normal/lowRes/001r.jpg"/>
<graphic type="normal"
subtype="high-res" url="spi/normal/highRes/001r.jpg"/>
<graphic type="high-contrast"
subtype="low-res" url="spi/contrast/lowRes/001r.jpg"/>
<graphic type="high-contrast"
subtype="high-res" url="spi/contrast/highRes/001r.jpg"/>
</surface>
<surface xml:id="spi001v">
<graphic type="normal"
subtype="thumbnail" url="spi/thumb/001v.jpg"/>
<graphic type="normal" subtype="low-res"
url="spi/normal/lowRes/001v.jpg"/>
<graphic type="normal"
subtype="high-res" url="spi/normal/highRes/001v.jpg"/>
<graphic type="high-contrast"
subtype="low-res" url="spi/contrast/lowRes/001v.jpg"/>
<graphic type="high-contrast"
subtype="high-res" url="spi/contrast/highRes/001v.jpg"/>
<zone xml:id="spi001v_detail01">
<graphic type="normal"
subtype="thumbnail" url="spi/thumb/001v-detail01.jpg"/>
<graphic type="normal"
subtype="low-res"
url="spi/normal/lowRes/001v-detail01.jpg"/>
<graphic type="normal"
subtype="high-res"
url="spi/normal/highRes/001v-detail01.jpg"/>
<graphic type="high-contrast"
subtype="low-res"
url="spi/contrast/lowRes/001v-detail01.jpg"/>
<graphic type="high-contrast"
subtype="high-res"
url="spi/contrast/highRes/001v-detail01.jpg"/>
</zone>
</surface>
</surfaceGrp>
</facsimile> |
Content model | <content> <classRef key="model.descLike" minOccurs="0" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element graphic { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.declaring.attributes, tei_att.media.attributes, tei_att.resourced.attributes, tei_att.typed.attributes, tei_model.descLike* }⚓ |
<handNotes> contains one or more <handNote> elements documenting the different hands identified within the source texts. [12.3.2.1. Document Hands] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | header: profileDesc |
May contain | Empty element |
Example | <handNotes>
<handNote xml:id="H1" script="copperplate"
medium="brown-ink">Carefully written with regular descenders</handNote>
<handNote xml:id="H2" script="print"
medium="pencil">Unschooled scrawl</handNote>
</handNotes> |
Content model | <content> <elementRef key="handNote" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element handNotes { tei_att.global.attributes, handNote+ }⚓ |
<handShift> (handwriting shift) marks the beginning of a sequence of text written in a new hand, or the beginning of a scribal stint. [12.3.2.1. Document Hands] | |||||||||
Module | transcr — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | Empty element | ||||||||
Note | The <handShift> element may be used either to denote a shift in the document hand (as from one scribe to another, on one writing style to another). Or, it may indicate a shift within a document hand, as a change of writing style, character or ink. Like other milestone elements, it should appear at the point of transition from some other state to the state which it describes. | ||||||||
Example | <l>When wolde the cat dwelle in his ynne</l>
<handShift medium="greenish-ink"/>
<l>And if the cattes skynne be slyk <handShift medium="black-ink"/> and gaye</l> | ||||||||
Content model | <content> <empty/> </content> ⚓ | ||||||||
Schema Declaration | element handShift { tei_att.global.attributes, tei_att.handFeatures.attributes, attribute new { text }?, empty }⚓ |
<head> (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. [4.2.1. Headings and Trailers] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | The <head> element is used for headings at all levels; software which treats (e.g.) chapter headings, section headings, and list titles differently must determine the proper processing of a <head> element based on its structural position. A <head> occurring as the first element of a list is the title of that list; one occurring as the first element of a <div1> is the title of that chapter or section. |
Example | The most common use for the <head> element is to mark the headings of sections. In older writings, the headings or incipits may be rather longer than usual in modern works. If a section has an explicit ending as well as a heading, it should be marked as a <trailer>, as in this example: <div1 n="I" type="book">
<head>In the name of Christ here begins the first book of the ecclesiastical history of
Georgius Florentinus, known as Gregory, Bishop of Tours.</head>
<div2 type="section">
<head>In the name of Christ here begins Book I of the history.</head>
<p>Proposing as I do ...</p>
<p>From the Passion of our Lord until the death of Saint Martin four hundred and twelve
years passed.</p>
<trailer>Here ends the first Book, which covers five thousand, five hundred and ninety-six
years from the beginning of the world down to the death of Saint Martin.</trailer>
</div2>
</div1> |
Example | When headings are not inline with the running text (see e.g. the heading "Secunda conclusio") they might however be encoded as if. The actual placement in the source document can be captured with the place attribute. <div type="subsection">
<head place="margin">Secunda conclusio</head>
<p>
<lb n="1251"/>
<hi rend="large">Potencia: habitus: et actus: recipiunt speciem ab obiectis<supplied>.</supplied>
</hi>
<lb n="1252"/>Probatur sic. Omne importans necessariam habitudinem ad proprium
[...]
</p>
</div> |
Example | The <head> element is also used to mark headings of other units, such as lists: With a few exceptions, connectives are equally
useful in all kinds of discourse: description, narration, exposition, argument. <list rend="bulleted">
<head>Connectives</head>
<item>above</item>
<item>accordingly</item>
<item>across from</item>
<item>adjacent to</item>
<item>again</item>
<item>
<!-- ... -->
</item>
</list> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <elementRef key="lg"/> <classRef key="model.gLike"/> <classRef key="model.phrase"/> <classRef key="model.inter"/> <classRef key="model.lLike"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Schema Declaration | element head { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.placement.attributes, tei_att.typed.attributes, tei_att.written.attributes, ( text | lg | tei_model.gLike | tei_model.phrase | tei_model.inter | tei_model.lLike | tei_model.global )* }⚓ |
<hyphenation> (hyphenation) summarizes the way in which hyphenation in a source text has been treated in an encoded version of it. [2.3.3. The Editorial Practices Declaration 16.3.2. Declarable Elements] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | header: editorialDecl | ||||||||
May contain | core: p | ||||||||
Example | <hyphenation eol="some">
<p>End-of-line hyphenation silently removed where appropriate</p>
</hyphenation> | ||||||||
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:hyphenation"/>
</sch:pattern> | ||||||||
Content model | <content> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ | ||||||||
Schema Declaration | element hyphenation { tei_att.global.attributes, tei_att.declarable.attributes, attribute eol { "all" | "some" | "hard" | "none" }?, tei_model.pLike+ }⚓ |
<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [14.3.1. Basic Principles 2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.12.2.4. Imprint, Size of a Document, and Reprint Information] | |||||||||||
Module | header — Formal specification | ||||||||||
Attributes |
| ||||||||||
Member of | |||||||||||
Contained by | |||||||||||
May contain | |||||||||||
Note | <idno> should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web. Some suggested values for type on <idno> are ISBN, ISSN, DOI, and URI. | ||||||||||
Example | <idno type="ISBN">978-1-906964-22-1</idno>
<idno type="ISSN">0143-3385</idno>
<idno type="DOI">10.1000/123</idno>
<idno type="URI">http://www.worldcat.org/oclc/185922478</idno>
<idno type="URI">http://authority.nzetc.org/463/</idno>
<idno type="LT">Thomason Tract E.537(17)</idno>
<idno type="Wing">C695</idno>
<idno type="oldCat">
<g ref="#sym"/>345
</idno> In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as #sym . | ||||||||||
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <elementRef key="idno"/> </alternate> </content> ⚓ | ||||||||||
Schema Declaration | element idno { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_att.sortable.attributes, tei_att.typed.attribute.subtype, attribute type { "ISBN" | "ISSN" | "DOI" | "URI" | "VIAF" | "ESTC" | "OCLC" }?, ( text | tei_model.gLike | tei_idno )* }⚓ |
<label> (label) contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.8. Lists] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | Labels are commonly used for the headwords in glossary lists; note the use of the global xml:lang attribute to set the default language of the glossary list to Middle English, and identify the glosses and headings as modern English or Latin: <list type="gloss" xml:lang="enm">
<head xml:lang="en">Vocabulary</head>
<headLabel xml:lang="en">Middle English</headLabel>
<headItem xml:lang="en">New English</headItem>
<label>nu</label>
<item xml:lang="en">now</item>
<label>lhude</label>
<item xml:lang="en">loudly</item>
<label>bloweth</label>
<item xml:lang="en">blooms</item>
<label>med</label>
<item xml:lang="en">meadow</item>
<label>wude</label>
<item xml:lang="en">wood</item>
<label>awe</label>
<item xml:lang="en">ewe</item>
<label>lhouth</label>
<item xml:lang="en">lows</item>
<label>sterteth</label>
<item xml:lang="en">bounds, frisks (cf. <cit>
<ref>Chaucer, K.T.644</ref>
<quote>a courser, <term>sterting</term>as the fyr</quote>
</cit>
</item>
<label>verteth</label>
<item xml:lang="la">pedit</item>
<label>murie</label>
<item xml:lang="en">merrily</item>
<label>swik</label>
<item xml:lang="en">cease</item>
<label>naver</label>
<item xml:lang="en">never</item>
</list> |
Example | Labels may also be used to record explicitly the numbers or letters which mark list items in ordered lists, as in this extract from Gibbon's Autobiography. In this usage the <label> element is synonymous with the n attribute on the <item> element: I will add two facts, which have seldom occurred
in the composition of six, or at least of five quartos. <list rend="runon" type="ordered">
<label>(1)</label>
<item>My first rough manuscript, without any intermediate copy, has been sent to the press.</item>
<label>(2) </label>
<item>Not a sheet has been seen by any human eyes, excepting those of the author and the
printer: the faults and the merits are exclusively my own.</item>
</list> |
Example | Labels may also be used for other structured list items, as in this extract from the journal of Edward Gibbon: <list type="gloss">
<label>March 1757.</label>
<item>I wrote some critical observations upon Plautus.</item>
<label>March 8th.</label>
<item>I wrote a long dissertation upon some lines of Virgil.</item>
<label>June.</label>
<item>I saw Mademoiselle Curchod — <quote xml:lang="la">Omnia vincit amor, et nos cedamus
amori.</quote>
</item>
<label>August.</label>
<item>I went to Crassy, and staid two days.</item>
</list> Note that the <label> might also appear within the <item> rather than as its sibling. Though syntactically valid, this usage is not recommended TEI practice. |
Example | Labels may also be used to represent a label or heading attached to a paragraph or sequence of paragraphs not treated as a structural division, or to a group of verse lines. Note that, in this case, the <label> element appears within the <p> or <lg> element, rather than as a preceding sibling of it. <p>[...]
<lb/>& n’entrer en mauuais & mal-heu-
<lb/>ré meſnage. Or des que le conſente-
<lb/>ment des parties y eſt le mariage eſt
<lb/> arreſté, quoy que de faict il ne ſoit
<label place="margin">Puiſſance maritale
entre les Romains.</label>
<lb/> conſommé. Depuis la conſomma-
<lb/>tion du mariage la femme eſt ſoubs
<lb/> la puiſſance du mary, s’il n’eſt eſcla-
<lb/>ue ou enfant de famille : car en ce
<lb/> cas, la femme, qui a eſpouſé vn en-
<lb/>fant de famille, eſt ſous la puiſſance
[...]</p> In this example the text of the label appears in the right hand margin of the original source, next to the paragraph it describes, but approximately in the middle of it. If so desired the type attribute may be used to distinguish different categories of label. |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element label { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.placement.attributes, tei_att.typed.attributes, tei_att.written.attributes, tei_macro.phraseSeq }⚓ |
<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 16.3.2. Declarable Elements] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: profileDesc |
May contain | |
Example | <langUsage>
<language ident="fr-CA" usage="60">Québecois</language>
<language ident="en-CA" usage="20">Canadian business English</language>
<language ident="en-GB" usage="20">British English</language>
</langUsage> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde" value="tei:langUsage"/>
</sch:pattern> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <elementRef key="language" minOccurs="1" maxOccurs="unbounded"/> </alternate> </content> ⚓ |
Schema Declaration | element langUsage { tei_att.global.attributes, tei_att.declarable.attributes, ( tei_model.pLike+ | tei_language+ ) }⚓ |
<language> (language) characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage] | |||||||||||||
Module | header — Formal specification | ||||||||||||
Attributes |
| ||||||||||||
Contained by | header: langUsage | ||||||||||||
May contain | |||||||||||||
Note | Particularly for sublanguages, an informal prose characterization should be supplied as content for the element. | ||||||||||||
Example | <langUsage>
<language ident="en-US" usage="75">modern American English</language>
<language ident="az-Arab" usage="20">Azerbaijani in Arabic script</language>
<language ident="x-lap" usage="05">Pig Latin</language>
</langUsage> | ||||||||||||
Content model | <content> <macroRef key="macro.phraseSeq.limited"/> </content> ⚓ | ||||||||||||
Schema Declaration | element language { tei_att.global.attributes, tei_att.scope.attributes, attribute ident { text }, attribute usage { text }?, tei_macro.phraseSeq.limited }⚓ |
<licence> contains information about a licence or other legal agreement applicable to the text. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: availability |
May contain | |
Note | A <licence> element should be supplied for each licence agreement applicable to the text in question. The target attribute may be used to reference a full version of the licence. The when, notBefore, notAfter, from or to attributes may be used in combination to indicate the date or dates of applicability of the licence. |
Example | <licence target="http://www.nzetc.org/tm/scholarly/tei-NZETC-Help.html#licensing"> Licence: Creative Commons Attribution-Share Alike 3.0 New Zealand Licence
</licence> |
Example | <availability>
<licence target="http://creativecommons.org/licenses/by/3.0/"
notBefore="2013-01-01">
<p>The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence
applies to this document.</p>
<p>The licence was added on January 1, 2013.</p>
</licence>
</availability> |
Content model | <content> <macroRef key="macro.specialPara"/> </content> ⚓ |
Schema Declaration | element licence { tei_att.global.attributes, tei_att.datable.attributes, tei_att.pointing.attributes, tei_macro.specialPara }⚓ |
<line> contains the transcription of a topographic line in the source document. [12.2.2. Embedded Transcription] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | This element should be used only to mark up writing which is topographically organized as a series of lines, horizontal or vertical. It should not be used to mark lines of verse (for which use <l>) nor to mark line beginnings within text which has been encoded using structural elements such as <p> (for which use <lb>). |
Example | This example shows topographical lines as a means of preserving the visual appearance of a poem: <surface>
<zone>
<line>Poem</line>
<line>As in Visions of — at</line>
<line>night —</line>
<line>All sorts of fancies running through</line>
<line>the head</line>
</zone>
</surface> |
Example | <surface>
<zone>
<line>Hope you enjoyed</line>
<line>Wales, as they
said</line>
<line>to Mrs FitzHerbert</line>
<line>Mama</line>
</zone>
<zone>
<line>Printed in England</line>
</zone>
</surface> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.global"/> <classRef key="model.gLike"/> <classRef key="model.linePart"/> </alternate> </content> ⚓ |
Schema Declaration | element line { tei_att.global.attributes, tei_att.coordinated.attributes, tei_att.typed.attributes, tei_att.written.attributes, ( text | tei_model.global | tei_model.gLike | tei_model.linePart )* }⚓ |
<listPrefixDef> (list of prefix definitions) contains a list of definitions of prefixing schemes used in teidata.pointer values, showing how abbreviated URIs using each scheme may be expanded into full URIs. [17.2.3. Using Abbreviated Pointers] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc listPrefixDef |
May contain | core: desc header: listPrefixDef prefixDef |
Example | In this example, two private URI scheme prefixes are defined and patterns are provided for dereferencing them. Each prefix is also supplied with a human-readable explanation in a <p> element. <listPrefixDef>
<prefixDef ident="psn"
matchPattern="([A-Z]+)"
replacementPattern="personography.xml#$1">
<p> Private URIs using the <code>psn</code>
prefix are pointers to <gi>person</gi>
elements in the personography.xml file.
For example, <code>psn:MDH</code>
dereferences to <code>personography.xml#MDH</code>.
</p>
</prefixDef>
<prefixDef ident="bibl"
matchPattern="([a-z]+[a-z0-9]*)"
replacementPattern="http://www.example.com/getBibl.xql?id=$1">
<p> Private URIs using the <code>bibl</code> prefix can be
expanded to form URIs which retrieve the relevant
bibliographical reference from www.example.com.
</p>
</prefixDef>
</listPrefixDef> |
Content model | <content> <sequence> <elementRef key="desc" minOccurs="0" maxOccurs="unbounded"/> <alternate minOccurs="1" maxOccurs="unbounded"> <elementRef key="prefixDef"/> <elementRef key="listPrefixDef"/> </alternate> </sequence> </content> ⚓ |
Schema Declaration | element listPrefixDef { tei_att.global.attributes, ( tei_desc*, ( tei_prefixDef | tei_listPrefixDef )+ ) }⚓ |
<listTranspose> supplies a list of transpositions, each of which is indicated at some point in a document typically by means of metamarks. [12.3.4.5. Transpositions] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <listTranspose>
<transpose>
<ptr target="#ib02"/>
<ptr target="#ib01"/>
</transpose>
</listTranspose> This example might be used for a source document which indicates in some way that the elements identified by ib02 and code ib01 should be read in that order (ib02 followed by ib01), rather than in the reading order in which they are presented in the source. |
Content model | <content> <sequence> <elementRef key="desc" minOccurs="0" maxOccurs="unbounded"/> <elementRef key="transpose" minOccurs="1" maxOccurs="unbounded"/> </sequence> </content> ⚓ |
Schema Declaration | element listTranspose { tei_att.global.attributes, ( tei_desc*, tei_transpose+ ) }⚓ |
<localProp> (locally defined property) provides a locally defined character (or glyph) property. [5.2.1. Character Properties] | |
Module | gaiji — Formal specification |
Attributes |
|
Contained by | |
May contain | Empty element |
Note | No definitive list of local names is proposed. However, the name entity is recommended as a means of naming the property identifying the recommended character entity name for this character or glyph. |
Example | <char xml:id="daikanwaU4EBA">
<localProp name="name"
value="CIRCLED IDEOGRAPH 4EBA"/>
<localProp name="entity" value="daikanwa"/>
<unicodeProp name="Decomposition_Mapping"
value="circle"/>
<mapping type="standard">人</mapping>
</char> |
Content model | <content> <empty/> </content> ⚓ |
Schema Declaration | element localProp { tei_att.global.attributes, tei_att.gaijiProp.attributes, empty }⚓ |
<mapping> (character mapping) contains one or more characters which are related to the parent character or glyph in some respect, as specified by the type attribute. [5.2. Markup Constructs for Representation of Characters and Glyphs] | |
Module | gaiji — Formal specification |
Attributes |
|
Contained by | |
May contain | gaiji: g character data |
Note | Suggested values for the type attribute include exact for exact equivalences, uppercase for uppercase equivalences, lowercase for lowercase equivalences, and simplified for simplified characters. The <g> elements contained by this element can point to either another <char> or <glyph> element or contain a character that is intended to be the target of this mapping. |
Example | <mapping type="modern">r</mapping>
<mapping type="standard">人</mapping> |
Content model | <content> <macroRef key="macro.xtext"/> </content> ⚓ |
Schema Declaration | element mapping { tei_att.global.attributes, tei_att.datable.attributes, tei_att.typed.attributes, tei_macro.xtext }⚓ |
<measure> (measure) contains a word or phrase referring to some quantity of an object or commodity, usually comprising a number, a unit, and a commodity name. [3.6.3. Numbers and Measures] | |||||||||
Module | core — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | |||||||||
Example | This example references a definition of a measurement unit declared in the TEI header: <measure type="weight">
<num>2</num> pounds of flesh
</measure>
<measure type="currency">£10-11-6d</measure>
<measure type="area" unitRef="#merk">2 <unit>merks</unit> of old extent</measure>
<!-- In the TEI Header: -->
<encodingDesc>
<unitDecl>
<unitDef xml:id="merk" type="area">
<label>merk</label>
<placeName ref="#Scotland"/>
<desc>A merk was an area of land determined variably by its agricultural
productivity.</desc>
</unitDef>
</unitDecl>
</encodingDesc> | ||||||||
Example | <measure quantity="40" unit="hogshead"
commodity="rum">2 score hh rum</measure>
<measure quantity="12" unit="count"
commodity="roses">1 doz. roses</measure>
<measure quantity="1" unit="count"
commodity="tulips">a yellow tulip</measure> | ||||||||
Example | <head>Long papers.</head>
<p>Speakers will be given 30 minutes each: 20 minutes for
presentation, 10 minutes for discussion. Proposals should not
exceed <measure max="500" unit="count"
commodity="words">500
words</measure>. This presentation type is suitable for
substantial research, theoretical or critical discussions.</p> | ||||||||
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ | ||||||||
Schema Declaration | element measure { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.measurement.attributes, tei_att.ranging.attributes, tei_att.typed.attribute.subtype, attribute type { text }?, tei_macro.phraseSeq }⚓ |
<media> indicates the location of any form of external media such as an audio or video clip etc. [3.10. Graphics and Other Non-textual Components] | |||||||||
Module | core — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | core: desc | ||||||||
Note | The attributes available for this element are not appropriate in all cases. For example, it makes no sense to specify the temporal duration of a graphic. Such errors are not currently detected. The mimeType attribute must be used to specify the MIME media type of the resource specified by the url attribute. | ||||||||
Example | <figure>
<media mimeType="image/png" url="fig1.png"/>
<head>Figure One: The View from the Bridge</head>
<figDesc>A Whistleresque view showing four or five sailing boats in the foreground, and a
series of buoys strung out between them.</figDesc>
</figure> | ||||||||
Example | <media mimeType="audio/wav"
url="dingDong.wav" dur="PT10S">
<desc>Ten seconds of bellringing sound</desc>
</media> | ||||||||
Example | <media mimeType="video/mp4"
url="clip45.mp4" dur="PT45M" width="500px">
<desc>A 45 minute video clip to be displayed in a window 500
px wide</desc>
</media> | ||||||||
Content model | <content> <classRef key="model.descLike" minOccurs="0" maxOccurs="unbounded"/> </content> ⚓ | ||||||||
Schema Declaration | element media { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.declaring.attributes, tei_att.media.attribute.width, tei_att.media.attribute.height, tei_att.media.attribute.scale, tei_att.resourced.attributes, tei_att.timed.attributes, tei_att.typed.attributes, attribute mimeType { list { + } }, tei_model.descLike* }⚓ |
<meeting> contains the formalized descriptive title for a meeting or conference, for use in a bibliographic description for an item derived from such a meeting, or as a heading or preamble to publications emanating from it. [3.12.2.2. Titles, Authors, and Editors] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <div>
<meeting>Ninth International Conference on Middle High German Textual Criticism, Aachen,
June 1998.</meeting>
<list type="attendance">
<head>List of Participants</head>
<item>
<persName>...</persName>
</item>
<item>
<persName>...</persName>
</item>
<!--...-->
</list>
<p>...</p>
</div> |
Content model | <content> <macroRef key="macro.limitedContent"/> </content> ⚓ |
Schema Declaration | element meeting { tei_att.global.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_macro.limitedContent }⚓ |
<metamark> contains or describes any kind of graphic or written signal within a document the function of which is to determine how it should be read rather than forming part of the actual content of the document. [12.3.4.2. Metamarks] | |||||||||||||
Module | transcr — Formal specification | ||||||||||||
Attributes |
| ||||||||||||
Member of | |||||||||||||
Contained by | |||||||||||||
May contain | |||||||||||||
Example | <surface>
<metamark function="used" rend="line"
target="#X2"/>
<zone xml:id="zone-X2">
<line>I am that halfgrown <add>angry</add> boy, fallen asleep</line>
<line>The tears of foolish passion yet undried</line>
<line>upon my cheeks.</line>
<!-- ... -->
<line>I pass through <add>the</add> travels and <del>fortunes</del> of
<retrace>thirty</retrace>
</line>
<line>years and become old,</line>
<line>Each in its due order comes and goes,</line>
<line>And thus a message for me comes.</line>
<line>The</line>
</zone>
<metamark function="used"
target="#zone-X2">Entered - Yes</metamark>
</surface> | ||||||||||||
Content model | <content> <macroRef key="macro.specialPara"/> </content> ⚓ | ||||||||||||
Schema Declaration | element metamark { tei_att.global.attributes, tei_att.placement.attributes, tei_att.spanning.attributes, attribute function { text }?, attribute target { list { + } }?, tei_macro.specialPara }⚓ |
<mod> represents any kind of modification identified within a single document. [12.3.4.1. Generic Modification] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <mod type="subst">
<add>pleasing</add>
<del>agreable</del>
</mod> |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element mod { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, tei_att.typed.attributes, tei_macro.paraContent }⚓ |
<name> (name, proper noun) contains a proper noun or noun phrase. [3.6.1. Referring Strings] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | Proper nouns referring to people, places, and organizations may be tagged instead with <persName>, <placeName>, or <orgName>, when the TEI module for names and dates is included. |
Example | <name type="person">Thomas Hoccleve</name>
<name type="place">Villingaholt</name>
<name type="org">Vetus Latina Institut</name>
<name type="person" ref="#HOC001">Occleve</name> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element name { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_att.editLike.attributes, tei_att.personal.attributes, tei_att.typed.attributes, tei_macro.phraseSeq }⚓ |
<namespace> (namespace) supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration] | |||||||
Module | header — Formal specification | ||||||
Attributes |
| ||||||
Contained by | header: tagsDecl | ||||||
May contain | header: tagUsage | ||||||
Example | <namespace name="http://www.tei-c.org/ns/1.0">
<tagUsage gi="hi" occurs="28" withId="2"> Used only to mark English words
italicized in the copy text </tagUsage>
</namespace> | ||||||
Content model | <content> <elementRef key="tagUsage" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ | ||||||
Schema Declaration | element namespace { tei_att.global.attributes, attribute name { ? }, tei_tagUsage+ }⚓ |
<normalization> (normalization) indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 16.3.2. Declarable Elements] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | header: editorialDecl | ||||||||
May contain | core: p | ||||||||
Example | <editorialDecl>
<normalization method="markup">
<p>Where both upper- and lower-case i, j, u, v, and vv have been normalized, to modern
20th century typographical practice, the <gi>choice</gi> element has been used to
enclose <gi>orig</gi> and <gi>reg</gi> elements giving the original and new values
respectively. ... </p>
</normalization>
<normalization method="silent">
<p>Spacing between words and following punctuation has been regularized to zero spaces;
spacing between words has been regularized to one space.</p>
</normalization>
<normalization source="http://www.dict.sztaki.hu/webster">
<p>Spelling converted throughout to Modern American usage, based on Websters 9th
Collegiate dictionary.</p>
</normalization>
</editorialDecl> | ||||||||
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:normalization"/>
</sch:pattern> | ||||||||
Content model | <content> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ | ||||||||
Schema Declaration | element normalization { tei_att.global.attributes, tei_att.declarable.attributes, attribute method { "silent" | "markup" }?, tei_model.pLike+ }⚓ |
<note> (note) contains a note or annotation. [3.9.1. Notes and Simple Annotation 2.2.6. The Notes Statement 3.12.2.8. Notes and Statement of Language 10.3.5.4. Notes within Entries] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | In the following example, the translator has supplied a footnote containing an explanation of the term translated as "painterly": And yet it is not only
in the great line of Italian renaissance art, but even in the
painterly <note place="bottom" type="gloss"
resp="#MDMH">
<term xml:lang="de">Malerisch</term>. This word has, in the German, two
distinct meanings, one objective, a quality residing in the object,
the other subjective, a mode of apprehension and creation. To avoid
confusion, they have been distinguished in English as
<mentioned>picturesque</mentioned> and
<mentioned>painterly</mentioned> respectively.
</note> style of the
Dutch genre painters of the seventeenth century that drapery has this
psychological significance.
<!-- elsewhere in the document -->
<respStmt xml:id="MDMH">
<resp>translation from German to English</resp>
<name>Hottinger, Marie Donald Mackie</name>
</respStmt> For this example to be valid, the code MDMH must be defined elsewhere, for example by means of a responsibility statement in the associated TEI header. |
Example | The global n attribute may be used to supply the symbol or number used to mark the note's point of attachment in the source text, as in the following example: Mevorakh b. Saadya's mother, the matriarch of the
family during the second half of the eleventh century, <note n="126" anchored="true"> The
alleged mention of Judah Nagid's mother in a letter from 1071 is, in fact, a reference to
Judah's children; cf. above, nn. 111 and 54. </note> is well known from Geniza documents
published by Jacob Mann. However, if notes are numbered in sequence and their numbering can be reconstructed automatically by processing software, it may well be considered unnecessary to record the note numbers. |
Content model | <content> <macroRef key="macro.specialPara"/> </content> ⚓ |
Schema Declaration | element note { tei_att.global.attributes, tei_att.anchoring.attributes, tei_att.cmc.attributes, tei_att.placement.attributes, tei_att.pointing.attributes, tei_att.typed.attributes, tei_att.written.attributes, tei_macro.specialPara }⚓ |
<num> (number) contains a number, written in any form. [3.6.3. Numbers and Measures] | |||||||||||||||||||||||
Module | core — Formal specification | ||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||
Member of | |||||||||||||||||||||||
Contained by | |||||||||||||||||||||||
May contain | |||||||||||||||||||||||
Note | Detailed analyses of quantities and units of measure in historical documents may also use the feature structure mechanism described in chapter 19. Feature Structures. The <num> element is intended for use in simple applications. | ||||||||||||||||||||||
Example | <p>I reached <num type="cardinal" value="21">twenty-one</num> on
my <num type="ordinal" value="21">twenty-first</num> birthday</p>
<p>Light travels at <num value="3E10">3×10<hi rend="sup">10</hi>
</num> cm per second.</p> | ||||||||||||||||||||||
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ | ||||||||||||||||||||||
Schema Declaration | element num { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.ranging.attributes, tei_att.typed.attribute.subtype, attribute type { "cardinal" | "ordinal" | "fraction" | "percentage" }?, attribute value { text }?, tei_macro.phraseSeq }⚓ |
<p> (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | <p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you
done?</q>
</p>
<p>
<q>I have now arranged that you can be married a second time,</q> replied Thjostolf.
</p>
<p>
<q>Then you must mean that Thorvald is dead,</q> she said.
</p>
<p>
<q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q>
</p> |
Schematron |
<sch:rule context="tei:p">
<sch:report test="(ancestor::tei:ab or ancestor::tei:p) and not( ancestor::tei:floatingText
| parent::tei:exemplum | parent::tei:item | parent::tei:note | parent::tei:q
| parent::tei:quote | parent::tei:remarks | parent::tei:said | parent::tei:sp
| parent::tei:stage | parent::tei:cell | parent::tei:figure )"> Abstract model violation: Paragraphs may not occur inside other paragraphs or ab elements.
</sch:report>
</sch:rule> |
Schematron |
<sch:rule context="tei:l//tei:p">
<sch:assert test="ancestor::tei:floatingText | parent::tei:figure | parent::tei:note"> Abstract model violation: Metrical lines may not contain higher-level structural elements such as div, p, or ab, unless p is a child of figure or note, or is a descendant of floatingText.
</sch:assert>
</sch:rule> |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element p { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.declaring.attributes, tei_att.fragmentable.attributes, tei_att.written.attributes, tei_macro.paraContent }⚓ |
<particDesc> (participation description) describes the identifiable speakers, voices, or other participants in any kind of text or other persons named or otherwise referred to in a text, edition, or metadata. [16.2. Contextual Information] | |
Module | corpus — Formal specification |
Attributes |
|
Member of | |
Contained by | header: profileDesc |
May contain | core: p |
Note | May contain a prose description organized as paragraphs, or a structured list of persons and person groups, with an optional formal specification of any relationships amongst them. |
Example | <particDesc>
<listPerson>
<person xml:id="P-1234" sex="2" age="mid">
<p>Female informant, well-educated, born in
Shropshire UK, 12 Jan 1950, of unknown occupation. Speaks French fluently.
Socio-Economic status B2.</p>
</person>
<person xml:id="P-4332" sex="1">
<persName>
<surname>Hancock</surname>
<forename>Antony</forename>
<forename>Aloysius</forename>
<forename>St John</forename>
</persName>
<residence notAfter="1959">
<address>
<street>Railway Cuttings</street>
<settlement>East Cheam</settlement>
</address>
</residence>
<occupation>comedian</occupation>
</person>
<listRelation>
<relation type="personal" name="spouse"
mutual="#P-1234 #P-4332"/>
</listRelation>
</listPerson>
</particDesc> This example shows both a very simple person description, and a very detailed one, using some of the more specialized elements from the module for Names and Dates. |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:particDesc"/>
</sch:pattern> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.personLike"/> <elementRef key="listPerson"/> <elementRef key="listOrg"/> </alternate> </alternate> </content> ⚓ |
Schema Declaration | element particDesc { tei_att.global.attributes, tei_att.declarable.attributes, ( tei_model.pLike+ | ( tei_model.personLike | listPerson | listOrg )+ ) }⚓ |
<path> (path) defines any line passing through two or more points within a <surface> element. [12.1. Digital Facsimiles 12.2.2. Embedded Transcription] | |||||||||
Module | transcr — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | Empty element | ||||||||
Note | Although the simplest form of a path is a straight line between two points, a line with more than two points may bend at any point. The order of coordinates in points is significant, because the line follows the coordinate sequence. To specify a closed polygon, use the <zone> element rather than the <path> element. | ||||||||
Example | <surface ulx="0" uly="0" lrx="443" lry="272">
<graphic url="facs-fig3.jpg"/>
<path points="74,73 171,244"/>
<path points="71,203 173,116"/>
</surface> | ||||||||
Schematron | Since a <path> represents a line with distinct start and end points, the last coordinate should not be the same as the first coordinate.
<sch:rule context="tei:path[@points]">
<sch:let name="firstPair"
value="tokenize( normalize-space( @points ), ' ')[1]"/>
<sch:let name="lastPair"
value="tokenize( normalize-space( @points ), ' ')[last()]"/>
<sch:let name="firstX"
value="xs:float( substring-before( $firstPair, ',') )"/>
<sch:let name="firstY"
value="xs:float( substring-after( $firstPair, ',') )"/>
<sch:let name="lastX"
value="xs:float( substring-before( $lastPair, ',') )"/>
<sch:let name="lastY"
value="xs:float( substring-after( $lastPair, ',') )"/>
<sch:report test="$firstX eq $lastX and $firstY eq $lastY">The first and
last elements of this path are the same. To specify a closed polygon, use
the zone element rather than the path element. </sch:report>
</sch:rule> | ||||||||
Content model | <content> <empty/> </content> ⚓ | ||||||||
Schema Declaration | element path { tei_att.global.attributes, tei_att.coordinated.attribute.start, tei_att.coordinated.attribute.ulx, tei_att.coordinated.attribute.uly, tei_att.coordinated.attribute.lrx, tei_att.coordinated.attribute.lry, tei_att.typed.attributes, tei_att.written.attributes, attribute points { list { * } }?, empty }⚓ |
<pb> (page beginning) marks the beginning of a new page in a paginated document. [3.11.3. Milestone Elements] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | Empty element |
Note | A <pb> element should appear at the start of the page which it identifies. The global n attribute indicates the number or other value associated with this page. This will normally be the page number or signature printed on it, since the physical sequence number is implicit in the presence of the <pb> element itself. The type attribute may be used to characterize the page beginning in any respect. The more specialized attributes break, ed, or edRef should be preferred when the intent is to indicate whether or not the page beginning is word-breaking, or to note the source from which it derives. |
Example | Page numbers may vary in different editions of a text. <p> ... <pb n="145" ed="ed2"/>
<!-- Page 145 in edition "ed2" starts here --> ... <pb n="283" ed="ed1"/>
<!-- Page 283 in edition "ed1" starts here--> ... </p> |
Example | A page beginning may be associated with a facsimile image of the page it introduces by means of the facs attribute <body>
<pb n="1" facs="page1.png"/>
<!-- page1.png contains an image of the page;
the text it contains is encoded here -->
<p>
<!-- ... -->
</p>
<pb n="2" facs="page2.png"/>
<!-- similarly, for page 2 -->
<p>
<!-- ... -->
</p>
</body> |
Content model | <content> <empty/> </content> ⚓ |
Schema Declaration | element pb { tei_att.global.attributes, tei_att.breaking.attributes, tei_att.cmc.attributes, tei_att.edition.attributes, tei_att.spanning.attributes, tei_att.typed.attributes, empty }⚓ |
<pc> (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [18.1.2. Below the Word Level 18.4.2. Lightweight Linguistic Annotation] | |||||||||||||||||||||
Module | analysis — Formal specification | ||||||||||||||||||||
Attributes |
| ||||||||||||||||||||
Member of | |||||||||||||||||||||
Contained by | |||||||||||||||||||||
May contain | |||||||||||||||||||||
Example | <phr>
<w>do</w>
<w>you</w>
<w>understand</w>
<pc type="interrogative">?</pc>
</phr> | ||||||||||||||||||||
Example | Example encoding of the German sentence Wir fahren in den Urlaub., encoded with attributes from att.linguistic discussed in section [[undefined AILALW]]. <s>
<w pos="PPER" msd="1.Pl.*.Nom">Wir</w>
<w pos="VVFIN" msd="1.Pl.Pres.Ind">fahren</w>
<w pos="APPR" msd="--">in</w>
<w pos="ART" msd="Def.Masc.Akk.Sg.">den</w>
<w pos="NN" msd="Masc.Akk.Sg.">Urlaub</w>
<pc pos="$." msd="--" join="left">.</pc>
</s> | ||||||||||||||||||||
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <elementRef key="c"/> <classRef key="model.pPart.edit"/> </alternate> </content> ⚓ | ||||||||||||||||||||
Schema Declaration | element pc { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.linguistic.attributes, tei_att.segLike.attributes, tei_att.typed.attributes, attribute force { "strong" | "weak" | "inter" }?, attribute unit { text }?, attribute pre { text }?, ( text | tei_model.gLike | c | tei_model.pPart.edit )* }⚓ |
<prefixDef> (prefix definition) defines a prefixing scheme used in teidata.pointer values, showing how abbreviated URIs using the scheme may be expanded into full URIs. [17.2.3. Using Abbreviated Pointers] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Contained by | header: listPrefixDef | ||||||||
May contain | core: p | ||||||||
Note | The abbreviated pointer may be dereferenced to produce either an absolute or a relative URI reference. In the latter case it is combined with the value of xml:base in force at the place where the pointing attribute occurs to form an absolute URI in the usual manner as prescribed by XML Base. | ||||||||
Example | <prefixDef ident="ref"
matchPattern="([a-z]+)"
replacementPattern="../../references/references.xml#$1">
<p> In the context of this project, private URIs with
the prefix "ref" point to <gi>div</gi> elements in
the project's global references.xml file.
</p>
</prefixDef> | ||||||||
Content model | <content> <classRef key="model.pLike" minOccurs="0" maxOccurs="unbounded"/> </content> ⚓ | ||||||||
Schema Declaration | element prefixDef { tei_att.global.attributes, tei_att.patternReplacement.attributes, attribute ident { text }, tei_model.pLike* }⚓ |
<profileDesc> (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: teiHeader |
May contain | |
Note | Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of <profileDesc> unless these are documenting multiple texts. |
Example | <profileDesc>
<langUsage>
<language ident="fr">French</language>
</langUsage>
<textDesc n="novel">
<channel mode="w">print; part issues</channel>
<constitution type="single"/>
<derivation type="original"/>
<domain type="art"/>
<factuality type="fiction"/>
<interaction type="none"/>
<preparedness type="prepared"/>
<purpose type="entertain" degree="high"/>
<purpose type="inform" degree="medium"/>
</textDesc>
<settingDesc>
<setting>
<name>Paris, France</name>
<time>Late 19th century</time>
</setting>
</settingDesc>
</profileDesc> |
Content model | <content> <classRef key="model.profileDescPart" minOccurs="0" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element profileDesc { tei_att.global.attributes, tei_model.profileDescPart* }⚓ |
<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 16.3.2. Declarable Elements] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: encodingDesc |
May contain | core: p |
Example | <projectDesc>
<p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p>
</projectDesc> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:projectDesc"/>
</sch:pattern> |
Content model | <content> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element projectDesc { tei_att.global.attributes, tei_att.declarable.attributes, tei_model.pLike+ }⚓ |
<pubPlace> (publication place) contains the name of the place where a bibliographic item was published. [3.12.2.4. Imprint, Size of a Document, and Reprint Information] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | core: bibl header: publicationStmt |
May contain | |
Example | <publicationStmt>
<publisher>Oxford University Press</publisher>
<pubPlace>Oxford</pubPlace>
<date>1989</date>
</publicationStmt> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element pubPlace { tei_att.global.attributes, tei_att.naming.attributes, tei_macro.phraseSeq }⚓ |
<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: fileDesc |
May contain | |
Note | Where a publication statement contains several members of the model.publicationStmtPart.agency or model.publicationStmtPart.detail classes rather than one or more paragraphs or anonymous blocks, care should be taken to ensure that the repeated elements are presented in a meaningful order. It is a conformance requirement that elements supplying information about publication place, address, identifier, availability, and date be given following the name of the publisher, distributor, or authority concerned, and preferably in that order. |
Example | <publicationStmt>
<publisher>C. Muquardt </publisher>
<pubPlace>Bruxelles & Leipzig</pubPlace>
<date when="1846"/>
</publicationStmt> |
Example | <publicationStmt>
<publisher>Chadwyck Healey</publisher>
<pubPlace>Cambridge</pubPlace>
<availability>
<p>Available under licence only</p>
</availability>
<date when="1992">1992</date>
</publicationStmt> |
Example | <publicationStmt>
<publisher>Zea Books</publisher>
<pubPlace>Lincoln, NE</pubPlace>
<date>2017</date>
<availability>
<p>This is an open access work licensed under a Creative Commons Attribution 4.0 International license.</p>
</availability>
<ptr target="http://digitalcommons.unl.edu/zeabook/55"/>
</publicationStmt> |
Content model | <content> <alternate> <sequence minOccurs="1" maxOccurs="unbounded"> <classRef key="model.publicationStmtPart.agency"/> <classRef key="model.publicationStmtPart.detail" minOccurs="0" maxOccurs="unbounded"/> </sequence> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </alternate> </content> ⚓ |
Schema Declaration | element publicationStmt { tei_att.global.attributes, ( ( ( tei_model.publicationStmtPart.agency, tei_model.publicationStmtPart.detail* )+ ) | tei_model.pLike+ ) }⚓ |
<publisher> (publisher) provides the name of the organization responsible for the publication or distribution of a bibliographic item. [3.12.2.4. Imprint, Size of a Document, and Reprint Information 2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | core: bibl header: publicationStmt |
May contain | |
Note | Use the full form of the name by which a company is usually referred to, rather than any abbreviation of it which may appear on a title page |
Example | <imprint>
<pubPlace>Oxford</pubPlace>
<publisher>Clarendon Press</publisher>
<date>1987</date>
</imprint> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element publisher { tei_att.global.attributes, tei_att.canonical.attributes, tei_macro.phraseSeq }⚓ |
<quotation> (quotation) specifies editorial practice adopted with respect to quotation marks in the original. [2.3.3. The Editorial Practices Declaration 16.3.2. Declarable Elements] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | header: editorialDecl | ||||||||
May contain | core: p | ||||||||
Example | <quotation marks="none">
<p>No quotation marks have been retained. Instead, the <att>rend</att> attribute on the
<gi>q</gi> element is used to specify what kinds of quotation mark was used, according
to the following list: <list type="gloss">
<label>dq</label>
<item>double quotes, open and close</item>
<label>sq</label>
<item>single quotes, open and close</item>
<label>dash</label>
<item>long dash open, no close</item>
<label>dg</label>
<item>double guillemets, open and close</item>
</list>
</p>
</quotation> | ||||||||
Example | <quotation marks="all">
<p>All quotation marks are retained in the text and are represented by appropriate Unicode
characters.</p>
</quotation> | ||||||||
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde" value="tei:quotation"/>
</sch:pattern> | ||||||||
Schematron |
<sch:rule context="tei:quotation">
<sch:report test="not( @marks ) and not( tei:p )"> On <sch:name/>, either the @marks attribute should be used, or a paragraph of description provided
</sch:report>
</sch:rule> | ||||||||
Content model | <content> <classRef key="model.pLike" minOccurs="0" maxOccurs="unbounded"/> </content> ⚓ | ||||||||
Schema Declaration | element quotation { tei_att.global.attributes, tei_att.declarable.attributes, attribute marks { "none" | "some" | "all" }?, tei_model.pLike* }⚓ |
<redo> indicates one or more cancelled interventions in a document which have subsequently been marked as reaffirmed or repeated. [12.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications] | |||||||
Module | transcr — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | Empty element | ||||||
Example | <line>
<redo hand="#g_t" target="#redo-1"
cause="fix"/>
<mod xml:id="redo-1" rend="strikethrough"
spanTo="#anchor-1" hand="#g_bl"/>Ihr hagren, triſten, krummgezog<mod rend="strikethrough">nen</mod>ener Nacken
</line>
<line>Wenn ihr nur piepſet iſt die Welt ſchon matt.<anchor xml:id="anchor-1"/>
</line> This encoding represents the following sequence of events:
| ||||||
Content model | <content> <empty/> </content> ⚓ | ||||||
Schema Declaration | element redo { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, attribute target { list { + } }?, empty }⚓ |
<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.7. Simple Links and Cross-References 17.1. Links] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | The target and cRef attributes are mutually exclusive. |
Example | See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second
sentence</ref> |
Example | See also <ref target="#locution">s.v. <term>locution</term>
</ref>. |
Schematron |
<sch:rule context="tei:ref">
<sch:report test="@target and @cRef">Only one of the attributes @target and @cRef may be supplied on <sch:name/>.</sch:report>
</sch:rule> |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element ref { tei_att.global.attributes, tei_att.cReferencing.attributes, tei_att.cmc.attributes, tei_att.declaring.attributes, tei_att.internetMedia.attributes, tei_att.pointing.attributes, tei_att.typed.attributes, tei_macro.paraContent }⚓ |
<resp> (responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement] | |
Module | core — Formal specification |
Attributes |
|
Contained by | core: respStmt |
May contain | |
Note | The attribute ref, inherited from the class att.canonical may be used to indicate the kind of responsibility in a normalized form by referring directly to a standardized list of responsibility types, such as that maintained by a naming authority, for example the list maintained at http://www.loc.gov/marc/relators/relacode.html for bibliographic usage. |
Example | <respStmt>
<resp ref="http://id.loc.gov/vocabulary/relators/com.html">compiler</resp>
<name>Edward Child</name>
</respStmt> |
Content model | <content> <macroRef key="macro.phraseSeq.limited"/> </content> ⚓ |
Schema Declaration | element resp { tei_att.global.attributes, tei_att.canonical.attributes, tei_att.datable.attributes, tei_macro.phraseSeq.limited }⚓ |
<respStmt> (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | core: bibl header: editionStmt titleStmt |
May contain | |
Example | <respStmt>
<resp>transcribed from original ms</resp>
<persName>Claus Huitfeldt</persName>
</respStmt> |
Example | <respStmt>
<resp>converted to XML encoding</resp>
<name>Alan Morrison</name>
</respStmt> |
Content model | <content> <sequence> <alternate> <sequence> <elementRef key="resp" minOccurs="1" maxOccurs="unbounded"/> <classRef key="model.nameLike.agent" minOccurs="1" maxOccurs="unbounded"/> </sequence> <sequence> <classRef key="model.nameLike.agent" minOccurs="1" maxOccurs="unbounded"/> <elementRef key="resp" minOccurs="1" maxOccurs="unbounded"/> </sequence> </alternate> <elementRef key="note" minOccurs="0" maxOccurs="unbounded"/> </sequence> </content> ⚓ |
Schema Declaration | element respStmt { tei_att.global.attributes, tei_att.canonical.attributes, ( ( ( tei_resp+, tei_model.nameLike.agent+ ) | ( tei_model.nameLike.agent+, tei_resp+ ) ), tei_note* ) }⚓ |
<restore> (restore) indicates restoration of text to an earlier state by cancellation of an editorial or authorial marking or instruction. [12.3.1.6. Cancellation of Deletions and Other Markings] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | On this element, the type attribute categorizes the way that the cancelled intervention has been indicated in some way, for example by means of a marginal note, over-inking, additional markup, etc. |
Example | For I hate this
<restore hand="#dhl"
type="marginalStetNote">
<del>my</del>
</restore> body |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element restore { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.transcriptional.attributes, tei_att.typed.attributes, tei_macro.paraContent }⚓ |
<retrace> contains a sequence of writing which has been retraced, for example by over-inking, to clarify or fix it. [12.3.4.3. Fixation and Clarification] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | Multiple retraces are indicated by nesting one <retrace> within another. In principle, a retrace differs from a substitution in that second and subsequent rewrites do not materially alter the content of an element. Where minor changes have been made during the retracing action however these may be marked up using <del>, <add>, etc. with an appropriate value for the change attribute. |
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ |
Schema Declaration | element retrace { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, tei_macro.paraContent }⚓ |
<revisionDesc> (revision description) summarizes the revision history for a file. [2.6. The Revision Description 2.1.1. The TEI Header and Its Components] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: teiHeader |
May contain | header: change |
Note | If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any <change> to record the status at the time of that change. Conventionally <change> elements should be given in reverse date order, with the most recent change at the start of the list. |
Example | <revisionDesc status="embargoed">
<change when="1991-11-11" who="#LB"> deleted chapter 10 </change>
</revisionDesc> |
Content model | <content> <alternate> <elementRef key="list" minOccurs="1" maxOccurs="unbounded"/> <elementRef key="listChange" minOccurs="1" maxOccurs="unbounded"/> <elementRef key="change" minOccurs="1" maxOccurs="unbounded"/> </alternate> </content> ⚓ |
Schema Declaration | element revisionDesc { tei_att.global.attributes, tei_att.docStatus.attributes, ( list+ | listChange+ | tei_change+ ) }⚓ |
<s> (s-unit) contains a sentence-like division of a text. [18.1. Linguistic Segment Categories 8.4.1. Segmentation] | |
Module | analysis — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | The <s> element may be used to mark orthographic sentences, or any other segmentation of a text, provided that the segmentation is end-to-end, complete, and non-nesting. For segmentation which is partial or recursive, the <seg> should be used instead. The type attribute may be used to indicate the type of segmentation intended, according to any convenient typology. |
Example | <head>
<s>A short affair</s>
</head>
<s>When are you leaving?</s>
<s>Tomorrow.</s> |
Schematron |
<sch:rule context="tei:s">
<sch:report test="tei:s">You may not nest one s element within another: use seg instead</sch:report>
</sch:rule> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element s { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.notated.attributes, tei_att.segLike.attributes, tei_att.typed.attributes, tei_macro.phraseSeq }⚓ |
<secl> (secluded text) Secluded. Marks text present in the source which the editor believes to be genuine but out of its original place (which is unknown). [12.3.1.7. Text Omitted from or Supplied in the Transcription] | |||||||
Module | transcr — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | |||||||
Example | <rdg source="#Pescani">
<secl>
<l n="15" xml:id="l15">Alphesiboea suos ulta est pro coniuge fratres,</l>
<l n="16" xml:id="l16">sanguinis et cari vincula rupit amor.</l>
</secl>
</rdg>
<note>secl. Pescani</note> | ||||||
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ | ||||||
Schema Declaration | element secl { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, attribute reason { list { + } }?, tei_macro.paraContent }⚓ |
<segmentation> (segmentation) describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 16.3.2. Declarable Elements] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: editorialDecl |
May contain | core: p |
Example | <segmentation>
<p>
<gi>s</gi> elements mark orthographic sentences and are numbered sequentially within
their parent <gi>div</gi> element </p>
</segmentation> |
Example | <p>
<gi>seg</gi> elements are used to mark functional constituents of various types within each
<gi>s</gi>; the typology used is defined by a <gi>taxonomy</gi> element in the corpus
header <gi>classDecl</gi>
</p> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:segmentation"/>
</sch:pattern> |
Content model | <content> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element segmentation { tei_att.global.attributes, tei_att.declarable.attributes, tei_model.pLike+ }⚓ |
<setting> describes one particular setting in which a language interaction takes place. [16.2.3. The Setting Description] | |
Module | corpus — Formal specification |
Attributes |
|
Contained by | corpus: settingDesc |
May contain | |
Note | If the who attribute is not supplied, the setting is assumed to be that of all participants in the language interaction. |
Example | <setting>
<placeName>New York City, US</placeName>
<date>1989</date>
<locale>on a park bench</locale>
<activity>feeding birds</activity>
</setting> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.nameLike.agent"/> <classRef key="model.dateLike"/> <classRef key="model.settingPart"/> </alternate> </alternate> </content> ⚓ |
Schema Declaration | element setting { tei_att.global.attributes, tei_att.ascribed.attributes, ( tei_model.pLike+ | ( tei_model.nameLike.agent | tei_model.dateLike | tei_model.settingPart )* ) }⚓ |
<settingDesc> (setting description) describes the setting or settings within which a language interaction takes place, or other places otherwise referred to in a text, edition, or metadata. [16.2. Contextual Information 2.4. The Profile Description] | |
Module | corpus — Formal specification |
Attributes |
|
Member of | |
Contained by | header: profileDesc |
May contain | |
Note | May contain a prose description organized as paragraphs, or a series of <setting> elements. If used to record not settings of language interactions, but other places mentioned in the text, then <place> optionally grouped by <listPlace> inside <standOff> should be preferred. |
Example | <settingDesc>
<p>Texts recorded in the
Canadian Parliament building in Ottawa, between April and November 1988 </p>
</settingDesc> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:settingDesc"/>
</sch:pattern> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <alternate minOccurs="1" maxOccurs="unbounded"> <elementRef key="setting"/> <classRef key="model.placeLike"/> <elementRef key="listPlace"/> </alternate> </alternate> </content> ⚓ |
Schema Declaration | element settingDesc { tei_att.global.attributes, tei_att.declarable.attributes, ( tei_model.pLike+ | ( tei_setting | tei_model.placeLike | listPlace )+ ) }⚓ |
<sourceDesc> (source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as ‘born digital’ for a text which has no previous existence. [2.2.7. The Source Description] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: fileDesc |
May contain | |
Example | <sourceDesc>
<bibl>
<title level="a">The Interesting story of the Children in the Wood</title>. In
<author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
<publisher>OUP</publisher>
<date>1968</date>. </bibl>
</sourceDesc> |
Example | <sourceDesc>
<p>Born digital: no previous source exists.</p>
</sourceDesc> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde"
value="tei:sourceDesc"/>
</sch:pattern> |
Content model | <content> <alternate> <classRef key="model.pLike" minOccurs="1" maxOccurs="unbounded"/> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.biblLike"/> <classRef key="model.sourceDescPart"/> <classRef key="model.listLike"/> </alternate> </alternate> </content> ⚓ |
Schema Declaration | element sourceDesc { tei_att.global.attributes, tei_att.declarable.attributes, ( tei_model.pLike+ | ( tei_model.biblLike | tei_model.sourceDescPart | tei_model.listLike )+ ) }⚓ |
<sourceDoc> contains a transcription or other representation of a single source document potentially forming part of a dossier génétique or collection of sources. [12.1. Digital Facsimiles 12.2.2. Embedded Transcription] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | This element may be used as an alternative to <facsimile> for TEI documents containing only page images, or for documents containing both images and transcriptions. Transcriptions may be provided within the <surface> elements making up a source document, in parallel with them as part of a <text> element, or in both places if the encoder wishes to distinguish these two modes of transcription. |
Example | <sourceDoc>
<surfaceGrp n="leaf1">
<surface facs="page1.png">
<zone>All the writing on page 1</zone>
</surface>
<surface>
<graphic url="page2-highRes.png"/>
<graphic url="page2-lowRes.png"/>
<zone>
<line>A line of writing on page 2</line>
<line>Another line of writing on page 2</line>
</zone>
</surface>
</surfaceGrp>
</sourceDoc> |
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.graphicLike"/> <elementRef key="surface"/> <elementRef key="surfaceGrp"/> </alternate> </content> ⚓ |
Schema Declaration | element sourceDoc { tei_att.global.attributes, tei_att.declaring.attributes, ( tei_model.global | tei_model.graphicLike | tei_surface | tei_surfaceGrp )+ }⚓ |
<space> (space) indicates the location of a significant space in the text. [12.4.1. Space] | |||||||||||||||||||
Module | transcr — Formal specification | ||||||||||||||||||
Attributes |
| ||||||||||||||||||
Member of | |||||||||||||||||||
Contained by | |||||||||||||||||||
May contain | core: desc | ||||||||||||||||||
Note | This element should be used wherever it is desired to record an unusual space in the source text, e.g. space left for a word to be filled in later, for later rubrication, etc. It is not intended to be used to mark normal inter-word space or the like. | ||||||||||||||||||
Example | By god if wommen had writen storyes
As <space quantity="7" unit="minims"/> han within her oratoryes | ||||||||||||||||||
Example | στρατηλάτ<space quantity="1" unit="chars"/>ου | ||||||||||||||||||
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.descLike"/> <classRef key="model.certLike"/> </alternate> </content> ⚓ | ||||||||||||||||||
Schema Declaration | element space { tei_att.global.attribute.xmlid, tei_att.global.attribute.n, tei_att.global.attribute.xmllang, tei_att.global.attribute.xmlbase, tei_att.global.attribute.xmlspace, tei_att.global.analytic.attribute.ana, tei_att.global.change.attribute.change, tei_att.global.facs.attribute.facs, tei_att.global.rendition.attribute.rend, tei_att.global.rendition.attribute.style, tei_att.global.rendition.attribute.rendition, tei_att.global.responsibility.attribute.cert, tei_att.global.source.attribute.source, tei_att.dimensions.attributes, tei_att.typed.attributes, attribute resp { list { + } }?, attribute dim { "horizontal" | "vertical" }?, ( tei_model.descLike | tei_model.certLike )* }⚓ |
<subst> (substitution) groups one or more deletions (or surplus text) with one or more additions when the combination is to be regarded as a single intervention in the text. [12.3.1.5. Substitutions] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | ... are all included. <del hand="#RG">It is</del>
<subst>
<add>T</add>
<del>t</del>
</subst>he expressed
|
Example | that he and his Sister Miſs D — <lb/>who always lived with him, wd. be <subst>
<del>very</del>
<lb/>
<add>principally</add>
</subst> remembered in her Will.
|
Example | <ab>τ<subst>
<add place="above">ῶν</add>
<del>α</del>
</subst>
συνκυρόντ<subst>
<add place="above">ων</add>
<del>α</del>
</subst>
ἐργαστηρί<subst>
<add place="above">ων</add>
<del>α</del>
</subst>
</ab> |
Example | <subst>
<del>
<gap reason="illegible" quantity="5"
unit="character"/>
</del>
<add>apple</add>
</subst> |
Schematron |
<sch:rule context="tei:subst">
<sch:assert test="child::tei:add and (child::tei:del or child::tei:surplus)">
<sch:name/> must have at least one child add and at least one child del or surplus</sch:assert>
</sch:rule> |
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <elementRef key="add"/> <elementRef key="surplus"/> <elementRef key="del"/> <classRef key="model.milestoneLike"/> </alternate> </content> ⚓ |
Schema Declaration | element subst { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.transcriptional.attributes, ( add | tei_surplus | del | tei_model.milestoneLike )+ }⚓ |
<substJoin> (substitution join) identifies a series of possibly fragmented additions, deletions, or other revisions on a manuscript that combine to make up a single intervention in the text. [12.3.1.5. Substitutions] | |
Module | transcr — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | core: desc |
Example | While <del xml:id="r112">pondering</del> thus <add xml:id="r113">she mus'd</add>, her pinions fann'd
<substJoin target="#r112 #r113"/> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.descLike"/> <classRef key="model.certLike"/> </alternate> </content> ⚓ |
Schema Declaration | element substJoin { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.pointing.attributes, tei_att.transcriptional.attributes, ( tei_model.descLike | tei_model.certLike )* }⚓ |
<supplied> (supplied) signifies text supplied by the transcriber or editor for any reason; for example because the original cannot be read due to physical damage, or because of an obvious omission by the author or scribe. [12.3.3.1. Damage, Illegibility, and Supplied Text] | |||||||
Module | transcr — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | |||||||
Note | The <damage>, <gap>, <del>, <unclear> and <supplied> elements may be closely allied in use. See section 12.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance. | ||||||
Example | I am dr Sr yr
<supplied reason="illegible"
source="#amanuensis_copy">very humble Servt</supplied>
Sydney Smith | ||||||
Example | <supplied reason="omitted-in-original">Dedication</supplied> to the duke of Bejar | ||||||
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ | ||||||
Schema Declaration | element supplied { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, attribute reason { list { + } }?, tei_macro.paraContent }⚓ |
<surface> defines a written surface as a two-dimensional coordinate space, optionally grouping one or more graphic representations of that space, zones of interest within that space, and, when using an embedded transcription approach, transcriptions of the writing within them. [12.1. Digital Facsimiles 12.2.2. Embedded Transcription] | |||||||||||||||
Module | transcr — Formal specification | ||||||||||||||
Attributes |
| ||||||||||||||
Contained by | transcr: facsimile sourceDoc surface surfaceGrp zone | ||||||||||||||
May contain | |||||||||||||||
Note | The <surface> element represents any two-dimensional space on some physical surface forming part of the source material, such as a piece of paper, a face of a monument, a billboard, a scroll, a leaf etc. The coordinate space defined by this element may be thought of as a grid lrx - ulx units wide and uly - lry units high. The <surface> element may contain graphic representations or transcriptions of written zones, or both. The coordinate values used by every <zone> element contained by this element are to be understood with reference to the same grid. Where it is useful or meaningful to do so, any grouping of multiple <surface> elements may be indicated using the <surfaceGrp> element. | ||||||||||||||
Example | <facsimile>
<surface ulx="0" uly="0" lrx="200" lry="300">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile> | ||||||||||||||
Content model | <content> <sequence> <alternate minOccurs="0" maxOccurs="unbounded"> <classRef key="model.global"/> <classRef key="model.labelLike"/> <classRef key="model.graphicLike"/> </alternate> <sequence minOccurs="0" maxOccurs="unbounded"> <alternate> <elementRef key="zone"/> <elementRef key="line"/> <elementRef key="path"/> <elementRef key="surface"/> <elementRef key="surfaceGrp"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> </sequence> </content> ⚓ | ||||||||||||||
Schema Declaration | element surface { tei_att.global.attributes, tei_att.coordinated.attributes, tei_att.declaring.attributes, tei_att.typed.attributes, attribute attachment { text }?, attribute flipping { text }?, ( ( tei_model.global | tei_model.labelLike | tei_model.graphicLike )*, ( ( ( tei_zone | tei_line | tei_path | tei_surface | tei_surfaceGrp ), tei_model.global* )* ) ) }⚓ |
<surfaceGrp> (surface group) defines any kind of useful grouping of written surfaces, for example the recto and verso of a single leaf, which the encoder wishes to treat as a single unit. [12.1. Digital Facsimiles] | |
Module | transcr — Formal specification |
Attributes |
|
Contained by | transcr: facsimile sourceDoc surface surfaceGrp |
May contain | |
Note | Where it is useful or meaningful to do so, any grouping of multiple <surface> elements may be indicated using the <surfaceGrp> elements. |
Example | <sourceDoc>
<surfaceGrp>
<surface ulx="0" uly="0" lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
</surface>
<surface ulx="0" uly="0" lrx="200"
lry="300">
<graphic url="Bovelles-49v.png"/>
</surface>
</surfaceGrp>
</sourceDoc> |
Content model | <content> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.global"/> <elementRef key="surface"/> <elementRef key="surfaceGrp"/> </alternate> </content> ⚓ |
Schema Declaration | element surfaceGrp { tei_att.global.attributes, tei_att.declaring.attributes, tei_att.typed.attributes, ( tei_model.global | tei_surface | tei_surfaceGrp )+ }⚓ |
<surplus> (surplus) marks text present in the source which the editor believes to be superfluous or redundant. [12.3.3.1. Damage, Illegibility, and Supplied Text] | |||||||
Module | transcr — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | |||||||
Example | I am dr Sr yrs
<surplus reason="repeated">yrs</surplus>
Sydney Smith | ||||||
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ | ||||||
Schema Declaration | element surplus { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, attribute reason { list { + } }?, tei_macro.paraContent }⚓ |
<tagUsage> (element usage) documents the usage of a specific element within a specified document. [2.3.4. The Tagging Declaration] | |||||||||||||||||||
Module | header — Formal specification | ||||||||||||||||||
Attributes |
| ||||||||||||||||||
Contained by | header: namespace | ||||||||||||||||||
May contain | |||||||||||||||||||
Example | <tagsDecl partial="true">
<rendition xml:id="it" scheme="css"
selector="foreign, hi"> font-style: italic; </rendition>
<!-- ... -->
<namespace name="http://www.tei-c.org/ns/1.0">
<tagUsage gi="hi" occurs="28" withId="2"> Used to mark English words italicized in the copy text.</tagUsage>
<tagUsage gi="foreign">Used to mark non-English words in the copy text.</tagUsage>
<!-- ... -->
</namespace>
</tagsDecl> | ||||||||||||||||||
Content model | <content> <macroRef key="macro.limitedContent"/> </content> ⚓ | ||||||||||||||||||
Schema Declaration | element tagUsage { tei_att.global.attributes, tei_att.datcat.attributes, attribute gi { text }, attribute occurs { text }?, attribute withId { text }?, tei_macro.limitedContent }⚓ |
<tagsDecl> (tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description] | |||||||||
Module | header — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | header: encodingDesc | ||||||||
May contain | header: namespace | ||||||||
Example | partial attribute were not specified here, the implication would be that the document in question contains only <hi>, <title>, and <para> elements. | If the||||||||
Content model | <content> <sequence> <elementRef key="rendition" minOccurs="0" maxOccurs="unbounded"/> <elementRef key="namespace" minOccurs="0" maxOccurs="unbounded"/> </sequence> </content> ⚓ | ||||||||
Schema Declaration | element tagsDecl { tei_att.global.attributes, attribute partial { text }?, ( rendition*, tei_namespace* ) }⚓ |
<taxonomy> (taxonomy) defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy. [2.3.7. The Classification Declaration] | |
Module | header — Formal specification |
Attributes |
|
Contained by | |
May contain | |
Note | Nested taxonomies are common in many fields, so the <taxonomy> element can be nested. |
Example | <taxonomy xml:id="tax.b">
<bibl>Brown Corpus</bibl>
<category xml:id="tax.b.a">
<catDesc>Press Reportage</catDesc>
<category xml:id="tax.b.a1">
<catDesc>Daily</catDesc>
</category>
<category xml:id="tax.b.a2">
<catDesc>Sunday</catDesc>
</category>
<category xml:id="tax.b.a3">
<catDesc>National</catDesc>
</category>
<category xml:id="tax.b.a4">
<catDesc>Provincial</catDesc>
</category>
<category xml:id="tax.b.a5">
<catDesc>Political</catDesc>
</category>
<category xml:id="tax.b.a6">
<catDesc>Sports</catDesc>
</category>
</category>
<category xml:id="tax.b.d">
<catDesc>Religion</catDesc>
<category xml:id="tax.b.d1">
<catDesc>Books</catDesc>
</category>
<category xml:id="tax.b.d2">
<catDesc>Periodicals and tracts</catDesc>
</category>
</category>
</taxonomy> |
Example | <taxonomy>
<category xml:id="literature">
<catDesc>Literature</catDesc>
<category xml:id="poetry">
<catDesc>Poetry</catDesc>
<category xml:id="sonnet">
<catDesc>Sonnet</catDesc>
<category xml:id="shakesSonnet">
<catDesc>Shakespearean Sonnet</catDesc>
</category>
<category xml:id="petraSonnet">
<catDesc>Petrarchan Sonnet</catDesc>
</category>
</category>
<category xml:id="haiku">
<catDesc>Haiku</catDesc>
</category>
</category>
<category xml:id="drama">
<catDesc>Drama</catDesc>
</category>
</category>
<category xml:id="meter">
<catDesc>Metrical Categories</catDesc>
<category xml:id="feet">
<catDesc>Metrical Feet</catDesc>
<category xml:id="iambic">
<catDesc>Iambic</catDesc>
</category>
<category xml:id="trochaic">
<catDesc>trochaic</catDesc>
</category>
</category>
<category xml:id="feetNumber">
<catDesc>Number of feet</catDesc>
<category xml:id="pentameter">
<catDesc>>Pentameter</catDesc>
</category>
<category xml:id="tetrameter">
<catDesc>>Tetrameter</catDesc>
</category>
</category>
</category>
</taxonomy>
<!-- elsewhere in document -->
<lg ana="#shakesSonnet #iambic #pentameter">
<l>Shall I compare thee to a summer's day</l>
<!-- ... -->
</lg> |
Content model | <content> <alternate> <alternate> <alternate minOccurs="1" maxOccurs="unbounded"> <elementRef key="category"/> <elementRef key="taxonomy"/> </alternate> <sequence> <alternate minOccurs="1" maxOccurs="unbounded"> <classRef key="model.descLike" minOccurs="1" maxOccurs="1"/> <elementRef key="equiv" minOccurs="1" maxOccurs="1"/> <elementRef key="gloss" minOccurs="1" maxOccurs="1"/> </alternate> <alternate minOccurs="0" maxOccurs="unbounded"> <elementRef key="category"/> <elementRef key="taxonomy"/> </alternate> </sequence> </alternate> <sequence> <classRef key="model.biblLike"/> <alternate minOccurs="0" maxOccurs="unbounded"> <elementRef key="category"/> <elementRef key="taxonomy"/> </alternate> </sequence> </alternate> </content> ⚓ |
Schema Declaration | element taxonomy { tei_att.global.attributes, tei_att.datcat.attributes, ( ( ( tei_category | tei_taxonomy )+ | ( ( tei_model.descLike | equiv | gloss )+, ( tei_category | tei_taxonomy )* ) ) | ( tei_model.biblLike, ( tei_category | tei_taxonomy )* ) ) }⚓ |
<teiCorpus> (TEI corpus) contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more <TEI> elements, each containing a single text header and a text. [4. Default Text Structure 16.1. Varieties of Composite Text] | |||||||||
Module | core — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | core: teiCorpus | ||||||||
May contain | |||||||||
Note | Should contain one <teiHeader> for the corpus, and a series of <TEI> elements, one for each text. As with all elements in the TEI scheme (except <egXML>) this element is in the TEI namespace (see 5.7.2. Namespaces). Thus, when it is used as the outermost element of a TEI document, it is necessary to specify the TEI namespace on it. This is customarily achieved by including http://www.tei-c.org/ns/1.0 as the value of the XML namespace declaration (xmlns), without indicating a prefix, and then not using a prefix on TEI elements in the rest of the document. For example: <teiCorpus version="4.8.1" xml:lang="en" xmlns="http://www.tei-c.org/ns/1.0">. | ||||||||
Example | <teiCorpus version="3.3.0" xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader>
<!-- header for corpus -->
</teiHeader>
<TEI>
<teiHeader>
<!-- header for first text -->
</teiHeader>
<text>
<!-- content of first text -->
</text>
</TEI>
<TEI>
<teiHeader>
<!-- header for second text -->
</teiHeader>
<text>
<!-- content of second text -->
</text>
</TEI>
<!-- more TEI elements here -->
</teiCorpus> | ||||||||
Content model | <content> <sequence> <elementRef key="teiHeader"/> <classRef key="model.resource" minOccurs="0" maxOccurs="unbounded"/> <classRef key="model.describedResource" minOccurs="1" maxOccurs="unbounded"/> </sequence> </content> ⚓ | ||||||||
Schema Declaration | element teiCorpus { tei_att.global.attributes, tei_att.typed.attributes, attribute version { text }?, ( tei_teiHeader, tei_model.resource*, tei_model.describedResource+ ) }⚓ |
<teiHeader> (TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources. [2.1.1. The TEI Header and Its Components 16.1. Varieties of Composite Text] | |
Module | header — Formal specification |
Attributes |
|
Contained by | |
May contain | header: encodingDesc fileDesc profileDesc revisionDesc |
Note | One of the few elements unconditionally required in any TEI document. |
Example | <teiHeader>
<fileDesc>
<titleStmt>
<title>Shakespeare: the first folio (1623) in electronic form</title>
<author>Shakespeare, William (1564–1616)</author>
<respStmt>
<resp>Originally prepared by</resp>
<name>Trevor Howard-Hill</name>
</respStmt>
<respStmt>
<resp>Revised and edited by</resp>
<name>Christine Avern-Carr</name>
</respStmt>
</titleStmt>
<publicationStmt>
<distributor>Oxford Text Archive</distributor>
<address>
<addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>
</address>
<idno type="OTA">119</idno>
<availability>
<p>Freely available on a non-commercial basis.</p>
</availability>
<date when="1968">1968</date>
</publicationStmt>
<sourceDesc>
<bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
1968)</bibl>
</sourceDesc>
</fileDesc>
<encodingDesc>
<projectDesc>
<p>Originally prepared for use in the production of a series of old-spelling
concordances in 1968, this text was extensively checked and revised for use during the
editing of the new Oxford Shakespeare (Wells and Taylor, 1989).</p>
</projectDesc>
<editorialDecl>
<correction>
<p>Turned letters are silently corrected.</p>
</correction>
<normalization>
<p>Original spelling and typography is retained, except that long s and ligatured
forms are not encoded.</p>
</normalization>
</editorialDecl>
<refsDecl xml:id="ASLREF">
<cRefPattern matchPattern="(\S+) ([^.]+)\.(.*)"
replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])">
<p>A reference is created by assembling the following, in the reverse order as that
listed here: <list>
<item>the <att>n</att> value of the preceding <gi>lb</gi>
</item>
<item>a period</item>
<item>the <att>n</att> value of the ancestor <gi>div2</gi>
</item>
<item>a space</item>
<item>the <att>n</att> value of the parent <gi>div1</gi>
</item>
</list>
</p>
</cRefPattern>
</refsDecl>
</encodingDesc>
<revisionDesc>
<list>
<item>
<date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item>
<item>
<date when="1989-03-01">1 Mar 89</date> LB made new file</item>
</list>
</revisionDesc>
</teiHeader> |
Content model | <content> <sequence> <elementRef key="fileDesc"/> <classRef key="model.teiHeaderPart" minOccurs="0" maxOccurs="unbounded"/> <elementRef key="revisionDesc" minOccurs="0"/> </sequence> </content> ⚓ |
Schema Declaration | element teiHeader { tei_att.global.attributes, ( tei_fileDesc, tei_model.teiHeaderPart*, tei_revisionDesc? ) }⚓ |
<term> (term) contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. [3.4.1. Terms and Glosses] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | When this element appears within an <index> element, it is understood to supply the form under which an index entry is to be made for that location. Elsewhere, it is understood simply to indicate that its content is to be regarded as a technical or specialised term. It may be associated with a <gloss> element by means of its ref attribute; alternatively a <gloss> element may point to a <term> element by means of its target attribute. In formal terminological work, there is frequently discussion over whether terms must be atomic or may include multi-word lexical items, symbolic designations, or phraseological units. The <term> element may be used to mark any of these. No position is taken on the philosophical issue of what a term can be; the looser definition simply allows the <term> element to be used by practitioners of any persuasion. As with other members of the att.canonical class, instances of this element occuring in a text may be associated with a canonical definition, either by means of a URI (using the ref attribute), or by means of some system-specific code value (using the key attribute). Because the mutually exclusive target and cRef attributes overlap with the function of the ref attribute, they are deprecated and may be removed at a subsequent release. |
Example | A computational device that infers structure
from grammatical strings of words is known as a <term>parser</term>, and much of the history
of NLP over the last 20 years has been occupied with the design of parsers. |
Example | We may define <term xml:id="TDPV1" rend="sc">discoursal point of view</term> as
<gloss target="#TDPV1">the relationship, expressed
through discourse structure, between the implied author or some other addresser, and the
fiction.</gloss> |
Example | We may define <term ref="#TDPV2" rend="sc">discoursal point of view</term> as
<gloss xml:id="TDPV2">the relationship, expressed
through discourse structure, between the implied author or some other addresser, and the
fiction.</gloss> |
Example | We discuss Leech's concept of <term ref="myGlossary.xml#TDPV2" rend="sc">discoursal point of view</term> below. |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element term { tei_att.global.attributes, tei_att.cReferencing.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.declaring.attributes, tei_att.pointing.attributes, tei_att.sortable.attributes, tei_att.typed.attributes, tei_macro.phraseSeq }⚓ |
<text> (text) contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 16.1. Varieties of Composite Text] | |
Module | textstructure — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Note | This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose. |
Example | <text>
<front>
<docTitle>
<titlePart>Autumn Haze</titlePart>
</docTitle>
</front>
<body>
<l>Is it a dragonfly or a maple leaf</l>
<l>That settles softly down upon the water?</l>
</body>
</text> |
Example | The body of a text may be replaced by a group of nested texts, as in the following schematic: <text>
<front>
<!-- front matter for the whole group -->
</front>
<group>
<text>
<!-- first text -->
</text>
<text>
<!-- second text -->
</text>
</group>
</text> |
Content model | <content> <sequence> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> <sequence minOccurs="0"> <elementRef key="front"/> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> <alternate> <elementRef key="body"/> <elementRef key="group"/> </alternate> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> <sequence minOccurs="0"> <elementRef key="back"/> <classRef key="model.global" minOccurs="0" maxOccurs="unbounded"/> </sequence> </sequence> </content> ⚓ |
Schema Declaration | element text { tei_att.global.attributes, tei_att.declaring.attributes, tei_att.typed.attributes, tei_att.written.attributes, ( tei_model.global*, ( ( tei_front, tei_model.global* )? ), ( tei_body | group ), tei_model.global*, ( ( tei_back, tei_model.global* )? ) ) }⚓ |
<textClass> (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. [2.4.3. The Text Classification] | |
Module | header — Formal specification |
Attributes |
|
Member of | |
Contained by | header: profileDesc |
May contain | header: catRef |
Example | <taxonomy>
<category xml:id="acprose">
<catDesc>Academic prose</catDesc>
</category>
<!-- other categories here -->
</taxonomy>
<!-- ... -->
<textClass>
<catRef target="#acprose"/>
<classCode scheme="http://www.udcc.org">001.9</classCode>
<keywords scheme="http://authorities.loc.gov">
<list>
<item>End of the world</item>
<item>History - philosophy</item>
</list>
</keywords>
</textClass> |
Schematron |
<sch:pattern is-a="declarable">
<sch:param name="tde" value="tei:textClass"/>
</sch:pattern> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <elementRef key="classCode"/> <elementRef key="catRef"/> <elementRef key="keywords"/> </alternate> </content> ⚓ |
Schema Declaration | element textClass { tei_att.global.attributes, tei_att.declarable.attributes, ( classCode | tei_catRef | keywords )* }⚓ |
<time> (time) contains a phrase defining a time of day in any format. [3.6.4. Dates and Times] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | As he sat smiling, the
quarter struck — <time when="11:45:00">the quarter to twelve</time>. |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.phrase"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Schema Declaration | element time { tei_att.global.attributes, tei_att.calendarSystem.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_att.dimensions.attributes, tei_att.editLike.attributes, tei_att.typed.attributes, ( text | tei_model.gLike | tei_model.phrase | tei_model.global )* }⚓ |
<title> (title) contains a title for any kind of work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement] | |||||||||||||||||||||||
Module | core — Formal specification | ||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||
Member of | |||||||||||||||||||||||
Contained by | |||||||||||||||||||||||
May contain | |||||||||||||||||||||||
Note | The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title. | ||||||||||||||||||||||
Example | <title>Information Technology and the Research Process: Proceedings of
a conference held at Cranfield Institute of Technology, UK,
18–21 July 1989</title> | ||||||||||||||||||||||
Example | <title>Hardy's Tess of the D'Urbervilles: a machine readable
edition</title> | ||||||||||||||||||||||
Example | <title type="full">
<title type="main">Synthèse</title>
<title type="sub">an international journal for
epistemology, methodology and history of
science</title>
</title> | ||||||||||||||||||||||
Content model | <content> <macroRef key="macro.paraContent"/> </content> ⚓ | ||||||||||||||||||||||
Schema Declaration | element title { tei_att.global.attributes, tei_att.canonical.attributes, tei_att.cmc.attributes, tei_att.datable.attributes, tei_att.typed.attribute.subtype, attribute type { text }?, attribute level { "a" | "m" | "j" | "s" | "u" }?, tei_macro.paraContent }⚓ |
<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description] | |
Module | header — Formal specification |
Attributes |
|
Contained by | header: fileDesc |
May contain | |
Example | <titleStmt>
<title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title>
<respStmt>
<resp>compiled by</resp>
<name>P.J. Lucas</name>
</respStmt>
</titleStmt> |
Content model | <content> <sequence> <elementRef key="title" minOccurs="1" maxOccurs="unbounded"/> <classRef key="model.respLike" minOccurs="0" maxOccurs="unbounded"/> </sequence> </content> ⚓ |
Schema Declaration | element titleStmt { tei_att.global.attributes, ( tei_title+, tei_model.respLike* ) }⚓ |
<transpose> describes a single textual transposition as an ordered list of at least two pointers specifying the order in which the elements indicated should be re-combined. [12.3.4.5. Transpositions] | |
Module | transcr — Formal specification |
Attributes |
|
Contained by | transcr: listTranspose |
May contain | Empty element |
Note | Transposition is usually indicated in a document by a metamark such as a wavy line or numbering. The order in which <ptr> elements appear within a <transpose> element should correspond with the desired order, as indicated by the metamark. |
Example | <transpose>
<ptr target="#ib02"/>
<ptr target="#ib01"/>
</transpose> The transposition recorded here indicates that the content of the element with identifier ib02 should appear before the content of the element with identifier ib01 . |
Content model | <content> <elementRef key="ptr" minOccurs="2" maxOccurs="unbounded"/> </content> ⚓ |
Schema Declaration | element transpose { tei_att.global.attributes, ( ptr, ptr, ptr* ) }⚓ |
<undo> indicates one or more marked-up interventions in a document which have subsequently been marked for cancellation. [12.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications] | |||||||
Module | transcr — Formal specification | ||||||
Attributes |
| ||||||
Member of | |||||||
Contained by | |||||||
May contain | Empty element | ||||||
Example | <line>This is <del change="#s2" rend="overstrike">
<seg xml:id="undo-a">just some</seg>
sample <seg xml:id="undo-b">text</seg>,
we need</del>
<add change="#s2">not</add>
a real example.</line>
<undo target="#undo-a #undo-b"
rend="dotted" change="#s3"/> This encoding represents the following sequence of events:
| ||||||
Content model | <content> <empty/> </content> ⚓ | ||||||
Schema Declaration | element undo { tei_att.global.attributes, tei_att.dimensions.attributes, tei_att.spanning.attributes, tei_att.transcriptional.attributes, attribute target { list { + } }?, empty }⚓ |
<unicodeProp> (unicode property) provides a Unicode property for a character (or glyph). [5.2.1. Character Properties] | |||||||||||||||
Module | gaiji — Formal specification | ||||||||||||||
Attributes |
| ||||||||||||||
Contained by | |||||||||||||||
May contain | Empty element | ||||||||||||||
Note | A definitive list of current Unicode property names is provided in The Unicode Standard. | ||||||||||||||
Example | <char xml:id="U4EBA_circled">
<unicodeProp name="Decomposition_Mapping"
value="circle" version="12.1"/>
<localProp name="Name"
value="CIRCLED IDEOGRAPH 4EBA"/>
<localProp name="daikanwa" value="36"/>
<mapping type="standard">人</mapping>
</char> | ||||||||||||||
Content model | <content> <empty/> </content> ⚓ | ||||||||||||||
Schema Declaration | element unicodeProp { tei_att.global.attributes, tei_att.gaijiProp.attribute.version, tei_att.gaijiProp.attribute.scheme, tei_att.datable.attribute.period, tei_att.datable.w3c.attribute.when, tei_att.datable.w3c.attribute.notBefore, tei_att.datable.w3c.attribute.notAfter, tei_att.datable.w3c.attribute.from, tei_att.datable.w3c.attribute.to, attribute name { "Age" | "AHex" | "Alpha" | "Alphabetic" | "ASCII_Hex_Digit" | "bc" | "Bidi_C" | "Bidi_Class" | "Bidi_Control" | "Bidi_M" | "Bidi_Mirrored" | "Bidi_Mirroring_Glyph" | "Bidi_Paired_Bracket" | "Bidi_Paired_Bracket_Type" | "blk" | "Block" | "bmg" | "bpb" | "bpt" | "Canonical_Combining_Class" | "Case_Folding" | "Case_Ignorable" | "Cased" | "ccc" | "CE" | "cf" | "Changes_When_Casefolded" | "Changes_When_Casemapped" | "Changes_When_Lowercased" | "Changes_When_NFKC_Casefolded" | "Changes_When_Titlecased" | "Changes_When_Uppercased" | "CI" | "Comp_Ex" | "Composition_Exclusion" | "CWCF" | "CWCM" | "CWKCF" | "CWL" | "CWT" | "CWU" | "Dash" | "Decomposition_Mapping" | "Decomposition_Type" | "Default_Ignorable_Code_Point" | "Dep" | "Deprecated" | "DI" | "Dia" | "Diacritic" | "dm" | "dt" | "ea" | "East_Asian_Width" | "EqUIdeo" | "Equivalent_Unified_Ideograph" | "Expands_On_NFC" | "Expands_On_NFD" | "Expands_On_NFKC" | "Expands_On_NFKD" | "Ext" | "Extender" | "FC_NFKC" | "FC_NFKC_Closure" | "Full_Composition_Exclusion" | "gc" | "GCB" | "General_Category" | "Gr_Base" | "Gr_Ext" | "Gr_Link" | "Grapheme_Base" | "Grapheme_Cluster_Break" | "Grapheme_Extend" | "Grapheme_Link" | "Hangul_Syllable_Type" | "Hex" | "Hex_Digit" | "hst" | "Hyphen" | "ID_Continue" | "ID_Start" | "IDC" | "Ideo" | "Ideographic" | "IDS" | "IDS_Binary_Operator" | "IDS_Trinary_Operator" | "IDSB" | "IDST" | "Indic_Positional_Category" | "Indic_Syllabic_Category" | "InPC" | "InSC" | "isc" | "ISO_Comment" | "Jamo_Short_Name" | "jg" | "Join_C" | "Join_Control" | "Joining_Group" | "Joining_Type" | "JSN" | "jt" | "kAccountingNumeric" | "kCompatibilityVariant" | "kIICore" | "kIRG_GSource" | "kIRG_HSource" | "kIRG_JSource" | "kIRG_KPSource" | "kIRG_KSource" | "kIRG_MSource" | "kIRG_TSource" | "kIRG_USource" | "kIRG_VSource" | "kOtherNumeric" | "kPrimaryNumeric" | "kRSUnicode" | "lb" | "lc" | "Line_Break" | "LOE" | "Logical_Order_Exception" | "Lower" | "Lowercase" | "Lowercase_Mapping" | "Math" | "na" | "na1" | "Name" | "Name_Alias" | "NChar" | "NFC_QC" | "NFC_Quick_Check" | "NFD_QC" | "NFD_Quick_Check" | "NFKC_Casefold" | "NFKC_CF" | "NFKC_QC" | "NFKC_Quick_Check" | "NFKD_QC" | "NFKD_Quick_Check" | "Noncharacter_Code_Point" | "nt" | "Numeric_Type" | "Numeric_Value" | "nv" | "OAlpha" | "ODI" | "OGr_Ext" | "OIDC" | "OIDS" | "OLower" | "OMath" | "Other_Alphabetic" | "Other_Default_Ignorable_Code_Point" | "Other_Grapheme_Extend" | "Other_ID_Continue" | "Other_ID_Start" | "Other_Lowercase" | "Other_Math" | "Other_Uppercase" | "OUpper" | "Pat_Syn" | "Pat_WS" | "Pattern_Syntax" | "Pattern_White_Space" | "PCM" | "Prepended_Concatenation_Mark" | "QMark" | "Quotation_Mark" | "Radical" | "Regional_Indicator" | "RI" | "SB" | "sc" | "scf" | "Script" | "Script_Extensions" | "scx" | "SD" | "Sentence_Break" | "Sentence_Terminal" | "Simple_Case_Folding" | "Simple_Lowercase_Mapping" | "Simple_Titlecase_Mapping" | "Simple_Uppercase_Mapping" | "slc" | "Soft_Dotted" | "stc" | "STerm" | "suc" | "tc" | "Term" | "Terminal_Punctuation" | "Titlecase_Mapping" | "uc" | "UIdeo" | "Unicode_1_Name" | "Unified_Ideograph" | "Upper" | "Uppercase" | "Uppercase_Mapping" | "Variation_Selector" | "Vertical_Orientation" | "vo" | "VS" | "WB" | "White_Space" | "Word_Break" | "WSpace" | "XID_Continue" | "XID_Start" | "XIDC" | "XIDS" | "XO_NFC" | "XO_NFD" | "XO_NFKC" | "XO_NFKD" }, attribute value { text }, empty }⚓ |
<unihanProp> (unihan property) holds the name and value of a normative or informative Unihan character (or glyph) property as part of its attributes. [5.2.1. Character Properties] | |||||||||||||||
Module | gaiji — Formal specification | ||||||||||||||
Attributes |
| ||||||||||||||
Contained by | |||||||||||||||
May contain | Empty element | ||||||||||||||
Note | A definitive list of current Unihan property names is provided in the Unicode Han Database. | ||||||||||||||
Example | <unihanProp name="kRSKangXi" value="120.5"
version="12.1"/> | ||||||||||||||
Content model | <content> <empty/> </content> ⚓ | ||||||||||||||
Schema Declaration | element unihanProp { tei_att.global.attributes, tei_att.gaijiProp.attribute.version, tei_att.gaijiProp.attribute.scheme, tei_att.datable.attribute.period, tei_att.datable.w3c.attribute.when, tei_att.datable.w3c.attribute.notBefore, tei_att.datable.w3c.attribute.notAfter, tei_att.datable.w3c.attribute.from, tei_att.datable.w3c.attribute.to, attribute name { "kZVariant" | "kAccountingNumeric" | "kBigFive" | "kCCCII" | "kCNS1986" | "kCNS1992" | "kCangjie" | "kCantonese" | "kCheungBauer" | "kCheungBauerIndex" | "kCihaiT" | "kCompatibilityVariant" | "kCowles" | "kDaeJaweon" | "kDefinition" | "kEACC" | "kFenn" | "kFennIndex" | "kFourCornerCode" | "kFrequency" | "kGB0" | "kGB1" | "kGB3" | "kGB5" | "kGB7" | "kGB8" | "kGSR" | "kGradeLevel" | "kHDZRadBreak" | "kHKGlyph" | "kHKSCS" | "kHanYu" | "kHangul" | "kHanyuPinlu" | "kHanyuPinyin" | "kIBMJapan" | "kIICore" | "kIRGDaeJaweon" | "kIRGDaiKanwaZiten" | "kIRGHanyuDaZidian" | "kIRGKangXi" | "kIRG_GSource" | "kIRG_HSource" | "kIRG_JSource" | "kIRG_KPSource" | "kIRG_KSource" | "kIRG_MSource" | "kIRG_TSource" | "kIRG_USource" | "kIRG_VSource" | "kJIS0213" | "kJa" | "kJapaneseKun" | "kJapaneseOn" | "kJinmeiyoKanji" | "kJis0" | "kJis1" | "kJoyoKanji" | "kKPS0" | "kKPS1" | "kKSC0" | "kKSC1" | "kKangXi" | "kKarlgren" | "kKorean" | "kKoreanEducationHanja" | "kKoreanName" | "kLau" | "kMainlandTelegraph" | "kMandarin" | "kMatthews" | "kMeyerWempe" | "kMorohashi" | "kNelson" | "kOtherNumeric" | "kPhonetic" | "kPrimaryNumeric" | "kPseudoGB1" | "kRSAdobe_Japan1_6" | "kRSJapanese" | "kRSKanWa" | "kRSKangXi" | "kRSKorean" | "kRSUnicode" | "kSBGY" | "kSemanticVariant" | "kSimplifiedVariant" | "kSpecializedSemanticVariant" | "kTGH" | "kTaiwanTelegraph" | "kTang" | "kTotalStrokes" | "kTraditionalVariant" | "kVietnamese" | "kXHC1983" | "kXerox" }, attribute value { text }, empty }⚓ |
<unit> contains a symbol, a word or a phrase referring to a unit of measurement in any kind of formal or informal system. [3.6.3. Numbers and Measures] | |
Module | core — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | Here is an example of a <unit> element holding a unitRef attribute that points to a definition of the unit in the TEI header. <measure>
<num>3</num>
<unit unitRef="#ell">ells</unit>
</measure>
<!-- In the TEI Header: -->
<encodingDesc>
<unitDecl>
<unitDef xml:id="ell">
<label>ell</label>
<placeName ref="#iceland"/>
<desc>A unit of measure for cloth, roughly equivalent to 18 inches, or from an adult male’s elbow to the tip of the middle finger.</desc>
</unitDef>
</unitDecl>
</encodingDesc> |
Example | <measure>
<num>2</num>
<unit>kg</unit>
</measure> |
Example | <measure type="value">
<num>3</num>
<unit type="time" unit="min">minute</unit>
</measure> |
Example | <measure type="interval">
<num atLeast="1.2">1.2</num> to <num atMost="5.6">5.6</num>
<unit type="velocity" unit="km/h">km/h</unit>
</measure> |
Example | <p>Light travels at <num value="3E10">3×10^10</num>
<unit type="rate" unit="cm/s">
<unit type="space">cm</unit> per <unit type="time">second</unit>
</unit>.</p> |
Content model | <content> <macroRef key="macro.phraseSeq"/> </content> ⚓ |
Schema Declaration | element unit { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.measurement.attributes, tei_att.typed.attributes, tei_macro.phraseSeq }⚓ |
<w> (word) represents a grammatical (not necessarily orthographic) word. [18.1. Linguistic Segment Categories 18.4.2. Lightweight Linguistic Annotation] | |
Module | analysis — Formal specification |
Attributes |
|
Member of | |
Contained by | |
May contain | |
Example | This example is adapted from the Folger Library’s Early Modern English Drama version of The Wits: a Comedy by William Davenant. <l>
<w lemma="it" pos="pn"
xml:id="A19883-003-a-0100">IT</w>
<w lemma="have" pos="vvz"
xml:id="A19883-003-a-0110">hath</w>
<w lemma="be" pos="vvn"
xml:id="A19883-003-a-0120">been</w>
<w lemma="say" pos="vvn"
xml:id="A19883-003-a-0130">said</w>
<w lemma="of" pos="acp-p"
xml:id="A19883-003-a-0140">of</w>
<w lemma="old" pos="j"
xml:id="A19883-003-a-0150">old</w>
<pc xml:id="A19883-003-a-0160">,</pc>
<w lemma="that" pos="cs"
xml:id="A19883-003-a-0170">that</w>
<w lemma="play" pos="vvz"
xml:id="A19883-003-a-0180">
<choice>
<orig>Playes</orig>
<reg>Plays</reg>
</choice>
</w>
<w lemma="be" pos="vvb"
xml:id="A19883-003-a-0190">are</w>
<w lemma="feast" pos="n2"
xml:id="A19883-003-a-0200">Feasts</w>
<pc xml:id="A19883-003-a-0210">,</pc>
</l>
<l xml:id="A19883-e100220">
<w lemma="poet" pos="n2"
xml:id="A19883-003-a-0220">Poets</w>
<w lemma="the" pos="d"
xml:id="A19883-003-a-0230">the</w>
<w lemma="cook" pos="n2"
xml:id="A19883-003-a-0240">
<choice>
<orig>Cookes</orig>
<reg>Cooks</reg>
</choice>
</w>
<pc xml:id="A19883-003-a-0250">,</pc>
<w lemma="and" pos="cc"
xml:id="A19883-003-a-0260">and</w>
<w lemma="the" pos="d"
xml:id="A19883-003-a-0270">the</w>
<w lemma="spectator" pos="n2"
xml:id="A19883-003-a-0280">Spectators</w>
<w lemma="guest" pos="n2"
xml:id="A19883-003-a-0290">Guests</w>
<pc xml:id="A19883-003-a-0300">,</pc>
</l>
<l xml:id="A19883-e100230">
<w lemma="the" pos="d"
xml:id="A19883-003-a-0310">The</w>
<w lemma="actor" pos="n2"
xml:id="A19883-003-a-0320">Actors</w>
<w lemma="waiter" pos="n2"
xml:id="A19883-003-a-0330">Waiters</w>
<pc xml:id="A19883-003-a-0340">:</pc>
<!-- ... -->
</l> |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <elementRef key="seg"/> <elementRef key="w"/> <elementRef key="m"/> <elementRef key="c"/> <elementRef key="pc"/> <classRef key="model.global"/> <classRef key="model.lPart"/> <classRef key="model.hiLike"/> <classRef key="model.pPart.edit"/> </alternate> </content> ⚓ |
Schema Declaration | element w { tei_att.global.attributes, tei_att.cmc.attributes, tei_att.linguistic.attributes, tei_att.notated.attributes, tei_att.segLike.attributes, tei_att.typed.attributes, ( text | tei_model.gLike | seg | tei_w | m | c | tei_pc | tei_model.global | tei_model.lPart | tei_model.hiLike | tei_model.pPart.edit )* }⚓ |
<zone> defines any two-dimensional area within a <surface> element. [12.1. Digital Facsimiles 12.2.2. Embedded Transcription] | |||||||||
Module | transcr — Formal specification | ||||||||
Attributes |
| ||||||||
Member of | |||||||||
Contained by | |||||||||
May contain | |||||||||
Note | The position of every zone for a given surface is always defined by reference to the coordinate system defined for that surface. A graphic element contained by a zone represents the whole of the zone. A zone may be of any shape. The attribute points may be used to define a polygonal zone, using the coordinate system defined by its parent surface. A zone is always a closed polygon. Repeating the initial coordinate at the end of the sequence is optional. To encode an unclosed path, use the <path> element. | ||||||||
Example | <surface ulx="14.54" uly="16.14" lrx="0"
lry="0">
<graphic url="stone.jpg"/>
<zone points="4.6,6.3 5.25,5.85 6.2,6.6 8.19222,7.4125 9.89222,6.5875 10.9422,6.1375
11.4422,6.7125 8.21722,8.3125 6.2,7.65"/>
</surface> This example defines a non-rectangular zone: see the illustration in section [[undefined PH-surfzone]]. | ||||||||
Example | <facsimile>
<surface ulx="50" uly="20" lrx="400"
lry="280">
<zone ulx="0" uly="0" lrx="500" lry="321">
<graphic url="graphic.png"/>
</zone>
</surface>
</facsimile> This example defines a zone which has been defined as larger than its parent surface in order to match the dimensions of the graphic it contains. | ||||||||
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.graphicLike"/> <classRef key="model.global"/> <elementRef key="surface"/> <classRef key="model.linePart"/> </alternate> </content> ⚓ | ||||||||
Schema Declaration | element zone { tei_att.global.attributes, tei_att.coordinated.attributes, tei_att.typed.attributes, tei_att.written.attributes, attribute rotate { text }?, ( text | tei_model.gLike | tei_model.graphicLike | tei_model.global | tei_surface | tei_model.linePart )* }⚓ |
model.addressLike groups elements used to represent a postal or email address. [1. The TEI Infrastructure] | |
Module | tei — Formal specification |
Used by | |
Members |
model.applicationLike groups elements used to record application-specific information about a document in its header. | |
Module | tei — Formal specification |
Used by | |
Members | application |
model.attributable groups elements that contain a word or phrase that can be attributed to a source. [3.3.3. Quotation 4.3.2. Floating Texts] | |
Module | tei — Formal specification |
Used by | |
Members | model.quoteLike |
model.availabilityPart groups elements such as licences and paragraphs of text which may appear as part of an availability statement. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | tei — Formal specification |
Used by | |
Members | licence |
model.biblLike groups elements containing a bibliographic description. [3.12. Bibliographic Citations and References] | |
Module | tei — Formal specification |
Used by | |
Members | bibl |
model.biblPart groups elements which represent components of a bibliographic description. [3.12. Bibliographic Citations and References] | |
Module | tei — Formal specification |
Used by | |
Members | model.imprintPart[pubPlace publisher] model.respLike[funder meeting respStmt] availability bibl edition extent |
model.common groups common chunk- and inter-level elements. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.cmc model.divPart[model.lLike model.pLike[p]] model.inter[model.attributable[model.quoteLike] model.biblLike[bibl] model.egLike model.labelLike[desc label] model.listLike model.oddDecl model.stageLike] |
Note | This class defines the set of chunk- and inter-level elements; it is used in many content models, including those for textual divisions. |
model.dateLike groups elements containing temporal expressions. [3.6.4. Dates and Times 14.4. Dates] | |
Module | tei — Formal specification |
Used by | |
Members | date time |
model.describedResource groups elements which contain the content of a digital resource and its metadata; these elements may serve as the outermost or ‘root’ element of a TEI-conformant document. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | TEI teiCorpus |
model.divBottom groups elements appearing at the end of a text division. [4.2. Elements Common to All Divisions] | |
Module | tei — Formal specification |
Used by | |
Members | model.divBottomPart model.divWrapper[docDate meeting] |
model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.lLike model.pLike[p] |
Note | Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items. |
model.divTop groups elements appearing at the beginning of a text division. [4.2. Elements Common to All Divisions] | |
Module | tei — Formal specification |
Used by | |
Members | model.divTopPart[model.headLike[head]] model.divWrapper[docDate meeting] |
model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. Title Pages] | |
Module | tei — Formal specification |
Used by | |
Members | model.headLike[head] |
model.divWrapper groups elements which can appear at either top or bottom of a textual division. [4.2. Elements Common to All Divisions] | |
Module | tei — Formal specification |
Used by | |
Members | docDate meeting |
model.editorialDeclPart groups elements which may be used inside <editorialDecl> and appear multiple times. | |
Module | tei — Formal specification |
Used by | |
Members | correction hyphenation normalization quotation segmentation |
model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. Highlighting and Quotation] | |
Module | tei — Formal specification |
Used by | |
Members | term title |
model.encodingDescPart groups elements which may be used inside <encodingDesc> and appear multiple times. | |
Module | tei — Formal specification |
Used by | |
Members | appInfo charDecl classDecl editorialDecl listPrefixDef projectDesc tagsDecl |
model.frontPart groups elements which appear at the level of divisions within front or back matter. [7.1. Front and Back Matter ] | |
Module | tei — Formal specification |
Used by | |
Members | model.frontPart.drama |
model.gLike groups elements used to represent individual non-Unicode characters or glyphs. | |
Module | tei — Formal specification |
Used by | |
Members | g |
model.global groups elements which may appear at any point within a TEI text. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.global.edit[addSpan damageSpan delSpan gap space] model.global.meta[listTranspose substJoin] model.milestoneLike[fw pb] model.noteLike[note] metamark |
model.global.edit groups globally available elements which perform a specifically editorial function. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | addSpan damageSpan delSpan gap space |
model.global.meta groups globally available elements which describe the status of other elements. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | listTranspose substJoin |
Note | Elements in this class are typically used to hold groups of links or of abstract interpretations, or by provide indications of certainty etc. It may find be convenient to localize all metadata elements, for example to contain them within the same divison as the elements that they relate to; or to locate them all to a division of their own. They may however appear at any point in a TEI text. |
model.graphicLike groups elements containing images, formulae, and similar objects. [3.10. Graphics and Other Non-textual Components] | |
Module | tei — Formal specification |
Used by | |
Members | graphic media |
model.headLike groups elements used to provide a title or heading at the start of a text division. | |
Module | tei — Formal specification |
Used by | |
Members | head |
model.highlighted groups phrase-level elements which are typographically distinct. [3.3. Highlighting and Quotation] | |
Module | tei — Formal specification |
Used by | |
Members | model.emphLike[term title] model.hiLike |
model.imprintPart groups the bibliographic elements which occur inside imprints. [3.12. Bibliographic Citations and References] | |
Module | tei — Formal specification |
Used by | |
Members | pubPlace publisher |
model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.attributable[model.quoteLike] model.biblLike[bibl] model.egLike model.labelLike[desc label] model.listLike model.oddDecl model.stageLike |
model.labelLike groups elements used to gloss or explain other parts of a document. | |
Module | tei — Formal specification |
Used by | |
Members | desc label |
model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.emphLike[term title] model.hiLike model.pPart.data[model.addressLike[email] model.dateLike[date time] model.measureLike[measure num unit] model.nameLike[model.nameLike.agent[name] model.offsetLike model.placeStateLike[model.placeNamePart] idno]] model.pPart.editorial[ex subst] model.pPart.msdesc model.phrase.xml model.ptrLike[ref] |
model.measureLike groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning. [3.6.3. Numbers and Measures] | |
Module | tei — Formal specification |
Used by | |
Members | measure num unit |
model.milestoneLike groups milestone-style elements used to represent reference systems. [1.3. The TEI Class System 3.11.3. Milestone Elements] | |
Module | tei — Formal specification |
Used by | |
Members | fw pb |
model.nameLike groups elements which name or refer to a person, place, or organization. | |
Module | tei — Formal specification |
Used by | |
Members | model.nameLike.agent[name] model.offsetLike model.placeStateLike[model.placeNamePart] idno |
Note | A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc. |
model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.6. Names, Numbers, Dates, Abbreviations, and Addresses] | |
Module | tei — Formal specification |
Used by | |
Members | name |
Note | This class is used in the content model of elements which reference names of people or organizations. |
model.noteLike groups globally-available note-like elements. [3.9. Notes, Annotation, and Indexing] | |
Module | tei — Formal specification |
Used by | |
Members | note |
model.pLike groups paragraph-like elements. | |
Module | tei — Formal specification |
Used by | |
Members | p |
model.pLike.front groups paragraph-like elements which can occur as direct constituents of front matter. [4.6. Title Pages] | |
Module | tei — Formal specification |
Used by | |
Members | docDate head |
model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.6. Names, Numbers, Dates, Abbreviations, and Addresses] | |
Module | tei — Formal specification |
Used by | |
Members | model.addressLike[email] model.dateLike[date time] model.measureLike[measure num unit] model.nameLike[model.nameLike.agent[name] model.offsetLike model.placeStateLike[model.placeNamePart] idno] |
model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.5. Simple Editorial Changes] | |
Module | tei — Formal specification |
Used by | |
Members | model.pPart.editorial[ex subst] model.pPart.transcriptional[damage handShift mod redo restore retrace secl supplied surplus undo] |
model.pPart.editorial groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring. [3.5. Simple Editorial Changes] | |
Module | tei — Formal specification |
Used by | |
Members | ex subst |
model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | model.graphicLike[graphic media] model.highlighted[model.emphLike[term title] model.hiLike] model.lPart model.pPart.data[model.addressLike[email] model.dateLike[date time] model.measureLike[measure num unit] model.nameLike[model.nameLike.agent[name] model.offsetLike model.placeStateLike[model.placeNamePart] idno]] model.pPart.edit[model.pPart.editorial[ex subst] model.pPart.transcriptional[damage handShift mod redo restore retrace secl supplied surplus undo]] model.pPart.msdesc model.phrase.xml model.ptrLike[ref] model.segLike[pc s w] model.specDescLike |
Note | This class of elements can occur within paragraphs, list items, lines of verse, etc. |
model.placeStateLike groups elements which describe changing states of a place. | |
Module | tei — Formal specification |
Used by | |
Members | model.placeNamePart |
model.profileDescPart groups elements which may be used inside <profileDesc> and appear multiple times. | |
Module | tei — Formal specification |
Used by | |
Members | handNotes langUsage listTranspose particDesc settingDesc textClass |
model.ptrLike groups elements used for purposes of location and reference. [3.7. Simple Links and Cross-References] | |
Module | tei — Formal specification |
Used by | |
Members | ref |
model.publicationStmtPart.agency groups the child elements of a <publicationStmt> element of the TEI header that indicate an authorising agent. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | tei — Formal specification |
Used by | |
Members | publisher |
Note | The ‘agency’ child elements, while not required, are required if one of the ‘detail’ child elements is to be used. It is not valid to have a ‘detail’ child element without a preceding ‘agency’ child element. See also model.publicationStmtPart.detail. |
model.publicationStmtPart.detail groups the agency-specific child elements of the <publicationStmt> element of the TEI header. [2.2.4. Publication, Distribution, Licensing, etc.] | |
Module | tei — Formal specification |
Used by | |
Members | model.ptrLike[ref] availability date idno pubPlace |
Note | A ‘detail’ child element may not occur unless an ‘agency’ child element precedes it. See also model.publicationStmtPart.agency. |
model.resource groups separate elements which constitute the content of a digital resource, as opposed to its metadata. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Members | facsimile sourceDoc text |
model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element. | |
Module | tei — Formal specification |
Used by | |
Members | funder meeting respStmt |
model.segLike groups elements used for arbitrary segmentation. [17.3. Blocks, Segments, and Anchors 18.1. Linguistic Segment Categories] | |
Module | tei — Formal specification |
Used by | |
Members | pc s w |
Note | The principles on which segmentation is carried out, and any special codes or attribute values used, should be defined explicitly in the <segmentation> element of the <encodingDesc> within the associated TEI header. |
model.teiHeaderPart groups high level elements which may appear more than once in a TEI header. | |
Module | tei — Formal specification |
Used by | |
Members | encodingDesc profileDesc |
att.anchoring (anchoring) provides attributes for use on annotations, e.g. notes and groups of notes describing the existence and position of an anchor for annotations. | |||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||
Members | note | ||||||||||||||||||
Attributes |
| ||||||||||||||||||
Example | <p>(...) tamen reuerendos dominos archiepiscopum et canonicos Leopolienses
necnon episcopum in duplicibus Quatuortemporibus<anchor xml:id="A55234"/> totaliter expediui...</p>
<!-- elsewhere in the document -->
<noteGrp targetEnd="#A55234">
<note xml:lang="en"> Quatuor Tempora, so called dry fast days.
</note>
<note xml:lang="pl"> Quatuor Tempora, tzw. Suche dni postne.
</note>
</noteGrp> |
att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts] | |||||||||||
Module | tei — Formal specification | ||||||||||
Members | change setting | ||||||||||
Attributes |
|
att.breaking provides attributes to indicate whether or not the element concerned is considered to mark the end of an orthographic token in the same way as whitespace. [3.11.3. Milestone Elements] | |||||||||||
Module | tei — Formal specification | ||||||||||
Members | pb | ||||||||||
Attributes |
|
att.cReferencing provides attributes that may be used to supply a canonical reference as a means of identifying the target of a pointer. | |||||||||
Module | tei — Formal specification | ||||||||
Members | ref term | ||||||||
Attributes |
|
att.calendarSystem provides attributes for indicating calendar systems to which a date belongs. [3.6.4. Dates and Times 14.4. Dates] | |||||||||||||||
Module | tei — Formal specification | ||||||||||||||
Members | date docDate time | ||||||||||||||
Attributes |
|
att.canonical provides attributes that can be used to associate a representation such as a name or title with canonical information about the object being named or referenced. [14.1.1. Linking Names and Their Referents] | |||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||
Members | att.naming[att.personal[name] pubPlace] bibl catDesc date funder meeting publisher resp respStmt term time title | ||||||||||||||||||||
Attributes |
| ||||||||||||||||||||
Example | In this contrived example, a canonical reference to the same organisation is provided in four different ways. <author n="1">
<name ref="http://nzetc.victoria.ac.nz/tm/scholarly/name-427308.html"
type="organisation">New Zealand Parliament, Legislative Council</name>
</author>
<author n="2">
<name ref="nzvn:427308"
type="organisation">New Zealand Parliament, Legislative Council</name>
</author>
<author n="3">
<name ref="./named_entities.xml#o427308"
type="organisation">New Zealand Parliament, Legislative Council</name>
</author>
<author n="4">
<name key="name-427308"
type="organisation">New Zealand Parliament, Legislative Council</name>
</author> The first presumes the availability of an internet connection and a processor that can resolve a URI (most can). The second requires, in addition, a <prefixDef> that declares how the nzvm prefix should be interpreted. The third does not require an internet connection, but does require that a file named named_entities.xml be in the same directory as the TEI document. The fourth requires that an entire external system for key resolution be available. | ||||||||||||||||||||
Note | The key attribute is more flexible and general-purpose, but its use in interchange requires that documentation about how the key is to be resolved be sent to the recipient of the TEI document. In contrast values of the ref attribute are resolved using the widely accepted protocols for a URI, and thus less documentation, if any, is likely required by the recipient in data interchange. These guidelines provide no semantic basis or suggested precedence when both key and ref are provided. For this reason simultaneous use of both is not recommended unless documentation explaining the use is provided, probably in an ODD customization, for interchange. |
att.cmc (computer-mediated communication) provides attributes categorizing how the element content was created in a CMC environment. | |||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||
Members | bibl date desc docDate email gap graphic head idno label measure media meeting name note num p pb pc ref s term time title unit w | ||||||||||||||||||||
Attributes |
|
att.coordinated provides attributes that can be used to position their parent element within a two dimensional coordinate system. | |||||||||||||||||||||||||||||||||||||
Module | transcr — Formal specification | ||||||||||||||||||||||||||||||||||||
Members | line path surface zone | ||||||||||||||||||||||||||||||||||||
Attributes |
|
att.damaged provides attributes describing the nature of any physical damage affecting a reading. [12.3.3.1. Damage, Illegibility, and Supplied Text 1.3.1. Attribute Classes] | |||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||
Members | damage damageSpan | ||||||||||||||||||||||
Attributes |
|
att.datable provides attributes for normalization of elements that contain dates, times, or datable events. [3.6.4. Dates and Times 14.4. Dates] | |||||||
Module | tei — Formal specification | ||||||
Members | att.gaijiProp[localProp unicodeProp unihanProp] application change date docDate funder idno licence mapping meeting name resp time title | ||||||
Attributes |
| ||||||
Note | This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes. |
att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition. [3.6.4. Dates and Times 14.4. Dates] | |||||||||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||||||||
Members | att.datable[att.gaijiProp[localProp unicodeProp unihanProp] application change date docDate funder idno licence mapping meeting name resp time title] | ||||||||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||||||||
Schematron |
<sch:rule context="tei:*[@when]">
<sch:report test="@notBefore|@notAfter|@from|@to"
role="nonfatal">The @when attribute cannot be used with any other att.datable.w3c attributes.</sch:report>
</sch:rule> | ||||||||||||||||||||||||||||||||||||
Schematron |
<sch:rule context="tei:*[@from]">
<sch:report test="@notBefore"
role="nonfatal">The @from and @notBefore attributes cannot be used together.</sch:report>
</sch:rule> | ||||||||||||||||||||||||||||||||||||
Schematron |
<sch:rule context="tei:*[@to]">
<sch:report test="@notAfter"
role="nonfatal">The @to and @notAfter attributes cannot be used together.</sch:report>
</sch:rule> | ||||||||||||||||||||||||||||||||||||
Example | <date from="1863-05-28" to="1863-06-01">28 May through 1 June 1863</date> | ||||||||||||||||||||||||||||||||||||
Note | The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar. The most commonly-encountered format for the date portion of a temporal attribute is Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used. |
att.datcat provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by an external taxonomy, in this way establishing the identity of information containers and values, and providing means of interpreting them. [10.5.2. Lexical View 19.3. Other Atomic Feature Values] | |||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||
Members | att.segLike[pc s w] category tagUsage taxonomy | ||||||||||||||||||
Attributes |
| ||||||||||||||||||
Example | The example below presents the TEI encoding of the name-value pair <part of speech, common noun> , where the name (key) ‘part of speech’ is abbreviated as ‘POS’, and the value, ‘common noun’ is symbolized by ‘NN’. The entire name-value pair is encoded by means of the element <f>. In TEI XML, that element acts as the container, labeled with the name attribute. Its contents may be complex or simple. In the case at hand, the content is the symbol ‘NN’.The datcat attribute relates the feature name (i.e., the key) to the data category ‘part of speech’, while the attribute valueDatcat relates the feature value to the data category common noun. Both these data categories should be defined in an external and preferably open reference taxonomy or ontology.<fs>
<f name="POS"
datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3">
<symbol valueDatcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545"
value="NN"/>
</f>
<!-- ... -->
</fs> ‘NN’ is the symbol for common noun used e.g. in the CLAWS-7 tagset defined by the University Centre for Computer Corpus Research on Language at the University of Lancaster. The very same data category used for tagging an early version of the British National Corpus, and coming from the BNC Basic (C5) tagset, uses the symbol ‘NN0’ (rather than ‘NN’). Making these values semantically interoperable would be extremely difficult without a human expert if they were not anchored in a single point of an established reference taxonomy of morphosyntactic data categories. In the case at hand, the string ‘http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545’ is both a persistent identifier of the data category in question, as well as a pointer to a shared definition of common noun.While the symbols ‘NN’, ‘NN0’, and many others (often coming from languages other than English) are implicitly members of the container category ‘part of speech’, it is sometimes useful not to rely on such an implicit relationship but rather use an explicit identifier for that data category, to distinguish it from other morphosyntactic data categories, such as gender, tense, etc. For that purpose, the above example uses the datcat attribute to reference a definition of part of speech. The reference taxonomy in this example is the CLARIN Concept Registry.If the feature structure markup exemplified above is to be repeated many times in a single document, it is much more efficient to gather the persistent identifiers in a single place and to only reference them, implicitly or directly, from feature structure markup. The following example is much more concise than the one above and relies on the concepts of feature structure declaration and feature value library, discussed in chapter [[undefined FS]]. <fs>
<f name="POS" fVal="#commonNoun"/>
<!-- ... -->
</fs> The assumption here is that the relevant feature values are collected in a place that the annotation document in question has access to — preferably, a single document per linguistic resource, for example an <fsdDecl> that is XIncluded as a sibling of <text> or a child of <encodingDesc>; a <taxonomy> available resource-wide (e.g., in a shared header) is also an option.The example below presents an <fvLib> element that collects the relevant feature values (most of them omitted). At the same time, this example shows one way of encoding a tagset, i.e., an established inventory of values of (in the case at hand) morphosyntactic categories. <fvLib n="POS values">
<symbol xml:id="commonNoun" value="NN"
datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"/>
<symbol xml:id="properNoun" value="NP"
datcat="http://hdl.handle.net/11459/CCR_C-1371_fbebd9ec-a7f4-9a36-d6e9-88ee16b944ae"/>
<!-- ... -->
</fvLib> Note that these Guidelines do not prescribe a specific choice between datcat and valueDatcat in such cases. The former is the generic way of referencing a data category, whereas the latter is more specific, in that it references a data category that represents a value. The choice between them comes into play where a single element — or a tight element complex, such as the <f>/<symbol> complex illustrated above — make it necessary or useful to distinguish between the container data category and its value. | ||||||||||||||||||
Example | In the context of dictionaries designed with semantic interoperability in mind, the following example ensures that the <pos> element is interpreted as the same information container as in the case of the example of <f name="POS"> above. <gramGrp>
<pos datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"
valueDatcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545">NN</pos>
</gramGrp> Efficiency of this type of interoperable markup demands that the references to the particular data categories should best be provided in a single place within the dictionary (or a single place within the project), rather than being repeated inside every entry. For the container elements, this can be achieved at the level of <tagUsage>, although here, the valueDatcat attribute should be used, because it is not the <tagUsage> element that is associated with the relevant data category, but rather the element <pos> (or <case>, etc.) that is described by <tagUsage>: <tagsDecl partial="true">
<!-- ... -->
<namespace name="http://www.tei-c.org/ns/1.0">
<tagUsage gi="pos"
targetDatcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3">Contains the part of speech.</tagUsage>
<tagUsage gi="case"
targetDatcat="http://hdl.handle.net/11459/CCR_C-1840_9f4e319c-f233-6c90-9117-7270e215f039">Contains information about the grammatical case that the described form is inflected for.</tagUsage>
<!-- ... -->
</namespace>
</tagsDecl> Another possibility is to shorten the URIs by means of the <prefixDef> mechanism, as illustrated below: <listPrefixDef>
<prefixDef ident="ccr" matchPattern="pos"
replacementPattern="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"/>
<prefixDef ident="ccr" matchPattern="adj"
replacementPattern="http://hdl.handle.net/11459/CCR_C-1230_23653c21-fca1-edf8-fd7c-3df2d6499157"/>
</listPrefixDef>
<!-- ... -->
<entry>
<!--...-->
<form>
<orth>isotope</orth>
</form>
<gramGrp>
<pos datcat="ccr:pos"
valueDatcat="ccr:adj">adj</pos>
</gramGrp>
<!--...-->
</entry> This mechanism creates implications that are not always wanted, among others, in the case at hand, suggesting that the identifiers ‘pos’ and ‘adj’ belong to a namespace associated with the CLARIN Concept Repository (CCR), whereas that is solely a shorthand mechanism whose scope is the current resource. Documenting this clearly in the header of the dictionary is therefore advised.Yet another possibility is to associate the information about the relationship between a TEI markup element and the data category that it is intended to model already at the level of modeling the dictionary resource, that is, at the level of the ODD, in the <equiv> element that is a child of <elementSpec> or <attDef>. | ||||||||||||||||||
Example | The <taxonomy> element is a handy tool for encoding taxonomies that are later referenced by att.datcat attributes, but it can also act as an intermediary device, for example holding a fragment of an external taxonomy (or ‘flattening’ an external ontology) that is relevant to the project or document at hand. (It is also imaginable that, for the purpose of the project at hand, the local <taxonomy> element combines vocabularies that originate from more than one external taxonomy or ontology.) In such cases, the <taxonomy> creates a local layer of indirection: the att.datcat attributes internal to the resource may reference the <category> elements stored in the header (as well as the <taxonomy> element itself), whereas these same <category> and <taxonomy> elements use att.datcat attributes to reference the original taxonomy or ontology. <encodingDesc>
<!-- ... -->
<classDecl>
<!-- ... -->
<taxonomy xml:id="UD-SYN"
datcat="https://universaldependencies.org/u/dep/index.html">
<desc>
<term>UD syntactic relations</term>
</desc>
<category xml:id="acl"
valueDatcat="https://universaldependencies.org/u/dep/acl.html">
<catDesc>
<term>acl</term>: Clausal modifier of noun (adjectival clause)</catDesc>
</category>
<category xml:id="acl_relcl"
valueDatcat="https://universaldependencies.org/u/dep/acl-relcl.html">
<catDesc>
<term>acl:relcl</term>: relative clause modifier</catDesc>
</category>
<category xml:id="advcl"
valueDatcat="https://universaldependencies.org/u/dep/advcl.html">
<catDesc>
<term>advcl</term>: Adverbial clause modifier</catDesc>
</category>
<!-- ... -->
</taxonomy>
</classDecl>
</encodingDesc> The above fragment was excerpted from the GB subset of the ParlaMint project in April 2023, and enriched with att.datcat attributes for the purpose of illustrating the mechanism described here.Note that, in the ideal case, the values of att.datcat attributes should be persistent identifiers, and that the addressing scheme of Universal Dependencies is treated here as persistent for the sake of illustration. Note also that the contrast between datcat used on <taxonomy> on the one hand, and the valueDatcat used on <category> on the other, is not mandatory: both kinds of relations could be encoded by means of the generic datcat attribute, but using the former for the container and the latter for the content is more user-friendly. | ||||||||||||||||||
Example | The targetDatcat attribute is designed to be used in, e.g., feature structure declarations, and is analogous to the targetLang attribute of the att.pointing class, in that it describes the object that is being referenced, rather than the referencing object. <fDecl name="POS"
targetDatcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3">
<fDescr>part of speech (morphosyntactic category)</fDescr>
<vRange>
<vAlt>
<symbol value="NN"
datcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545"/>
<symbol value="NP"
datcat="http://hdl.handle.net/11459/CCR_C-1371_fbebd9ec-a7f4-9a36-d6e9-88ee16b944ae"/>
<!-- ... -->
</vAlt>
</vRange>
</fDecl> Above, the <fDecl> uses targetDatcat, because if it were to use datcat, it would be asserting that it is an instance of the container data category part of speech, whereas it is not — it models a container (<f>) that encodes a part of speech. Note also that it is the <f> that is modeled above, not its values, which are used as direct references to data categories; hence the use of datcat in the <symbol> element. | ||||||||||||||||||
Example | The att.datcat attributes can be used for any sort of taxonomies. The example below illustrates their usefulness for describing usage domain labels in dictionaries on the example of the Diccionario da Lingua Portugueza by António de Morais Silva, retro-digitised in the MORDigital project.
<!-- in the dictionary header --><encodingDesc>
<classDecl>
<taxonomy xml:id="domains">
<!--...-->
<category xml:id="domain.medical_and_health_sciences">
<catDesc xml:lang="en">Medical and Health Sciences</catDesc>
<catDesc xml:lang="pt">Ciências Médicas e da Saúde</catDesc>
<category xml:id="domain.medical_and_health_sciences.medicine"
valueDatcat="https://vocabs.rossio.fcsh.unl.pt/pub/morais_domains/pt/page/0025">
<catDesc xml:lang="en">
<term>Medicine</term>
<gloss>
<!--...-->
</gloss>
</catDesc>
<catDesc xml:lang="pt">
<term>Medicina</term>
<gloss>
<!--...-->
</gloss>
</catDesc>
</category>
</category>
<!--...-->
</taxonomy>
</classDecl>
</encodingDesc>
<!--
inside an <entry> element: -->
<usg type="domain"
valueDatcat="#domain.medical_and_health_sciences.medicine">Med.</usg> In the Morais dictionary, the relevant domain labels are in the header, getting referenced inside the dictionary, from <usg> elements. The vocabulary used for dictionary-internal labelling is in turn anchored in the MorDigital controlled vocabulary service of the NOVA University of Lisbon – School of Social Sciences and Humanities (NOVA FCSH). | ||||||||||||||||||
Note | The TEI Abstract Model can be expressed as a hierarchy of attribute-value matrices (AVMs) of various types and of various levels of complexity, nested or grouped in various ways. At the most abstract level, an AVM consists of an information container and the value (contents) of that container. A simple example of an XML serialization of such structures is, on the one hand, the opening and closing tags that delimit and name the container, and, on the other, the content enclosed by the two tags that constitues the value. An analogous example is an attribute name and the value of that attribute. In a TEI XML example of two equivalent serializations expressing the name-value pair The att.datcat class provides means of addressing the containers and their values, while at the same time providing a way to interpret them in the context of external taxonomies or ontologies. Aligning e.g. both the <pos> element and the pos attribute with the same value of an external reference point (i.e., an entry in an agreed taxonomy) affirms the identity of the concept serialised by both the element container and the attribute container, and optionally provides a definition of that concept (in the case at hand, the concept part of speech). The value of the att.datcat attributes should be a PID (persistent identifier) that points to a specific — and, ideally, shared — taxonomy or ontology. Among the resources that can, to a lesser or greater extent, be used as inventories of (more or less) standardized linguistic categories are the GOLD ontology, CLARIN CCR, OLiA, or TermWeb's DatCatInfo, and also the Universal Dependencies inventory, on the assumption that its URIs are going to persist. It is imaginable that a project may choose to address a local taxonomy store instead, but this risks losing the advantage of interchangeability with other projects. Historically, datcat and valueDatcat originate from the (now obsolete) ISO 12620:2009 standard, describing the data model and procedures for a Data Category Registry (DCR). The current version of that standard, ISO 12620-1, does not standardize the serialization of pointers, merely mentioning the TEI att.datcat as an example. Note that no constraint prevents the occurrence of a combination of att.datcat attributes: the <fDecl> element, which is a natural bearer of the targetDatcat attribute, is an instance of a specific modeling element, and, in principle, could be semantically fixed by an appropriate reference taxonomy of modeling devices. |
att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [16.3. Associating Contextual Information with a Text] | |||||||||
Module | tei — Formal specification | ||||||||
Members | availability bibl correction editorialDecl hyphenation langUsage normalization particDesc projectDesc quotation segmentation settingDesc sourceDesc textClass | ||||||||
Attributes |
| ||||||||
Schematron |
<sch:pattern id="declarable" abstract="true">
<sch:rule context="$tde[ ancestor::tei:teiHeader and following-sibling::$tde and not(
preceding-sibling::$tde ) ]">
<sch:report test="../child::$tde[ not( @xml:id ) ]"> When there is more than one <sch:name/>, each must have an @xml:id
</sch:report>
<sch:assert test="count( ../child::$tde[ normalize-space( @default ) = ('1','true')
] ) eq 1"> When there is more than one <sch:name/>, one and only one must have a @default of 'true'.
</sch:assert>
</sch:rule>
</sch:pattern> | ||||||||
Note | The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 16.3. Associating Contextual Information with a Text. Only one element of a particular type may have a default attribute with a value of true. |
att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [16.3. Associating Contextual Information with a Text] | |||||||
Module | tei — Formal specification | ||||||
Members | back body div facsimile front graphic media p ref sourceDoc surface surfaceGrp term text | ||||||
Attributes |
| ||||||
Note | The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 16.3. Associating Contextual Information with a Text. |
att.dimensions provides attributes for describing the size of physical objects. | |||||||||||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||||||||||
Members | att.damaged[damage damageSpan] addSpan date delSpan ex gap mod redo restore retrace secl space subst substJoin supplied surplus time undo | ||||||||||||||||||||||||||||||||||||||
Attributes |
|
att.divLike provides attributes common to all elements which behave in the same way as divisions. [4. Default Text Structure] | |||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||
Members | div | ||||||||||||||||
Attributes |
|
att.docStatus provides attributes for use on metadata elements describing the status of a document. | |||||||||
Module | tei — Formal specification | ||||||||
Members | bibl change revisionDesc | ||||||||
Attributes |
| ||||||||
Example | <revisionDesc status="published">
<change when="2010-10-21"
status="published"/>
<change when="2010-10-02" status="cleared"/>
<change when="2010-08-02"
status="embargoed"/>
<change when="2010-05-01" status="frozen"
who="#MSM"/>
<change when="2010-03-01" status="draft"
who="#LB"/>
</revisionDesc> |
att.editLike provides attributes describing the nature of an encoded scholarly intervention or interpretation of any kind. [3.5. Simple Editorial Changes 11.3.1. Origination 14.3.2. The Person Element 12.3.1.1. Core Elements for Transcriptional Work] | |||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||
Members | att.transcriptional[addSpan delSpan mod redo restore retrace subst substJoin undo] date ex gap name secl supplied surplus time | ||||||||||||||||
Attributes |
| ||||||||||||||||
Note | The members of this attribute class are typically used to represent any kind of editorial intervention in a text, for example a correction or interpretation, or to date or localize manuscripts etc. Each pointer on the source (if present) corresponding to a witness or witness group should reference a bibliographic citation such as a <witness>, <msDesc>, or <bibl> element, or another external bibliographic citation, documenting the source concerned. |
att.edition provides attributes identifying the source edition from which some encoded feature derives. | |||||||||||||
Module | tei — Formal specification | ||||||||||||
Members | pb | ||||||||||||
Attributes |
| ||||||||||||
Example | <l>Of Mans First Disobedience,<lb ed="1674"/> and<lb ed="1667"/> the Fruit</l>
<l>Of that Forbidden Tree, whose<lb ed="1667 1674"/> mortal tast</l>
<l>Brought Death into the World,<lb ed="1667"/> and all<lb ed="1674"/> our woe,</l> | ||||||||||||
Example | <listBibl>
<bibl xml:id="stapledon1937">
<author>Olaf Stapledon</author>,
<title>Starmaker</title>, <publisher>Methuen</publisher>, <date>1937</date>
</bibl>
<bibl xml:id="stapledon1968">
<author>Olaf Stapledon</author>,
<title>Starmaker</title>, <publisher>Dover</publisher>, <date>1968</date>
</bibl>
</listBibl>
<!-- ... -->
<p>Looking into the future aeons from the supreme moment of
the cosmos, I saw the populations still with all their
strength maintaining the<pb n="411" edRef="#stapledon1968"/>essentials of their ancient culture,
still living their personal lives in zest and endless
novelty of action, … I saw myself still
preserving, though with increasing difficulty, my lucid
con<pb n="291" break="no"
edRef="#stapledon1937"/>sciousness;</p> In the above example, the soft hyphen in Stapledon 1937 is omitted. Such decisions may be documented in the edition's declaration of editorial principles, e.g. with the <hyphenation> element in the <teiHeader>. | ||||||||||||
Note | These guidelines provide no semantic basis or suggested precedence when both ed and edRef are provided. For this reason simultaneous use of both is not recommended unless documentation explaining the use is provided, probably in an ODD customization, for interchange. |
att.fragmentable provides attributes for representing fragmentation of a structural element, typically as a consequence of some overlapping hierarchy. | |||||||||||
Module | tei — Formal specification | ||||||||||
Members | att.divLike[div] att.segLike[pc s w] p | ||||||||||
Attributes |
|
att.gaijiProp provides attributes for defining the properties of non-standard characters or glyphs. [5. Characters, Glyphs, and Writing Modes] | |||||||||||||||||||||||||||||
Module | gaiji — Formal specification | ||||||||||||||||||||||||||||
Members | localProp unicodeProp unihanProp | ||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||
Example | In this example a definition for the Unicode property Decomposition Mapping is provided. <unicodeProp name="Decomposition_Mapping"
value="circle"/> | ||||||||||||||||||||||||||||
Note | All name-only attributes need an xs:boolean attribute value inside value. |
att.global provides attributes common to all elements in the TEI encoding scheme. [1.3.1.1. Global Attributes] | |||||||||||||||||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||||||||||||||||
Members | TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone | ||||||||||||||||||||||||||||||||||||||||||||
Attributes |
|
att.global.change provides attributes allowing its member elements to specify one or more states or revision campaigns with which they are associated. | |||||||
Module | transcr — Formal specification | ||||||
Members | att.global[TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone] | ||||||
Attributes |
|
att.global.facs provides attributes used to express correspondence between an element and all or part of a facsimile image or surface. [12.1. Digital Facsimiles] | |||||||
Module | transcr — Formal specification | ||||||
Members | att.global[TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone] | ||||||
Attributes |
|
att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme. [1.3.1.1.3. Rendition Indicators] | |||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||
Members | att.global[TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone] | ||||||||||||||||||||||||||||||
Attributes |
|
att.global.responsibility provides attributes indicating the agent responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it. [1.3.1.1.4. Sources, certainty, and responsibility 3.5. Simple Editorial Changes 12.3.2.2. Hand, Responsibility, and Certainty Attributes 18.3. Spans and Interpretations 14.1.1. Linking Names and Their Referents] | |||||||||||||||
Module | tei — Formal specification | ||||||||||||||
Members | att.global[TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone] | ||||||||||||||
Attributes |
| ||||||||||||||
Example | Blessed are the
<choice>
<sic>cheesemakers</sic>
<corr resp="#editor" cert="high">peacemakers</corr>
</choice>: for they shall be called the children of God. | ||||||||||||||
Example |
<!-- in the <text> ... --><lg>
<!-- ... -->
<l>Punkes, Panders, baſe extortionizing
sla<choice>
<sic>n</sic>
<corr resp="#JENS1_transcriber">u</corr>
</choice>es,</l>
<!-- ... -->
</lg>
<!-- in the <teiHeader> ... -->
<!-- ... -->
<respStmt xml:id="JENS1_transcriber">
<resp when="2014">Transcriber</resp>
<name>Janelle Jenstad</name>
</respStmt> |
att.global.source provides attributes used by elements to point to an external source. [1.3.1.1.4. Sources, certainty, and responsibility 3.3.3. Quotation 8.3.4. Writing] | |||||||||||
Module | tei — Formal specification | ||||||||||
Members | att.global[TEI addSpan appInfo application availability back bibl body catDesc catRef category change char charDecl classDecl correction damage damageSpan date delSpan desc div docDate edition editionStmt editorialDecl email encodingDesc ex extent facsimile fileDesc front funder fw g gap glyph graphic handNotes handShift head hyphenation idno label langUsage language licence line listPrefixDef listTranspose localProp mapping measure media meeting metamark mod name namespace normalization note num p particDesc path pb pc prefixDef profileDesc projectDesc pubPlace publicationStmt publisher quotation redo ref resp respStmt restore retrace revisionDesc s secl segmentation setting settingDesc sourceDesc sourceDoc space subst substJoin supplied surface surfaceGrp surplus tagUsage tagsDecl taxonomy teiCorpus teiHeader term text textClass time title titleStmt transpose undo unicodeProp unihanProp unit w zone] | ||||||||||
Attributes |
| ||||||||||
Example | <p>
<!-- ... --> As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>) tells us, <quote source="#mcc_2012">‘Collaboration’ is a problematic and should be a contested
term.</quote>
<!-- ... -->
</p> | ||||||||||
Example | <p>
<!-- ... -->
<quote source="#chicago_15_ed">Grammatical theories are in flux, and the more we learn, the
less we seem to know.</quote>
<!-- ... -->
</p>
<!-- ... -->
<bibl xml:id="chicago_15_ed">
<title level="m">The Chicago Manual of Style</title>,
<edition>15th edition</edition>. <pubPlace>Chicago</pubPlace>: <publisher>University of
Chicago Press</publisher> (<date>2003</date>), <biblScope unit="page">p.147</biblScope>.
</bibl> | ||||||||||
Example | <elementRef key="p" source="tei:2.0.1"/> Include in the schema an element named <p> available from the TEI P5 2.0.1 release. | ||||||||||
Example | <schemaSpec ident="myODD"
source="mycompiledODD.xml">
<!-- further declarations specifying the components required -->
</schemaSpec> Create a schema using components taken from the file mycompiledODD.xml. |
att.handFeatures provides attributes describing aspects of the hand in which a manuscript is written. [12.3.2.1. Document Hands] | |||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||
Members | handShift | ||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||
Note | Usually either script or scriptRef, and similarly, either scribe or scribeRef, will be supplied. |
att.internetMedia provides attributes for specifying the type of a computer resource using a standard taxonomy. | |||||||
Module | tei — Formal specification | ||||||
Members | att.media[graphic media] ref | ||||||
Attributes |
| ||||||
Example | In this example mimeType is used to indicate that the URL points to a TEI XML file encoded in UTF-8. <ref mimeType="application/tei+xml; charset=UTF-8"
target="https://raw.githubusercontent.com/TEIC/TEI/dev/P5/Source/guidelines-en.xml"/> | ||||||
Note | This attribute class provides an attribute for describing a computer resource, typically available over the internet, using a value taken from a standard taxonomy. At present only a single taxonomy is supported, the Multipurpose Internet Mail Extensions (MIME) Media Type system. This typology of media types is defined by the Internet Engineering Task Force in RFC 2046. The list of types is maintained by the Internet Assigned Numbers Authority (IANA). The mimeType attribute must have a value taken from this list. |
att.lexicographic.normalized provides attributes for usage within word-level elements in the analysis module and within lexicographic microstructure in the dictionaries module. | |||||||||||||||||||||||||||||||
Module | analysis — Formal specification | ||||||||||||||||||||||||||||||
Members | att.linguistic[pc w] | ||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||
Note | It needs to be stressed that the two attributes in this class are meant for strictly lexicographic and linguistic uses, and not for editorial interventions. For the latter, the mechanism based on <choice>, <orig>, and <reg> needs to be employed. |
att.linguistic provides a set of attributes concerning linguistic features of tokens, for usage within token-level elements, specifically <w> and <pc> in the analysis module. [18.4.2. Lightweight Linguistic Annotation] | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Module | analysis — Formal specification | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Members | pc w | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||
Note | These attributes make it possible to encode simple language corpora and to add a layer of linguistic information to any tokenized resource. See section 18.4.2. Lightweight Linguistic Annotation for discussion. |
att.measurement provides attributes to represent a regularized or normalized measurement. | |||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||
Members | measure unit | ||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||
Schematron |
<sch:rule context="tei:*[@unitRef]">
<sch:report test="@unit" role="info">The @unit attribute may be unnecessary when @unitRef is present.</sch:report>
</sch:rule> | ||||||||||||||||||||||||||||||
Note | This attribute class provides a triplet of attributes that may be used either to regularize the values of the measurement being encoded, or to normalize them with respect to a standard measurement system. <l>So weren't you gonna buy <measure quantity="0.5" unit="gal"
commodity="ice cream">half
a gallon</measure>, baby</l>
<l>So won't you go and buy <measure quantity="1.893" unit="L"
commodity="ice cream">half
a gallon</measure>, baby?</l> The unit should normally be named using the standard symbol for an SI unit (see further http://www.bipm.org/en/publications/si-brochure/; http://physics.nist.gov/cuu/Units/). However, encoders may also specify measurements using informally defined units such as lines or characters. |
att.media provides attributes for specifying display and related properties of external media. | |||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||
Members | graphic media | ||||||||||||||||||
Attributes |
|
att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.6.1. Referring Strings 14.3.7. Names and Nyms] | |||||||||||||||
Module | tei — Formal specification | ||||||||||||||
Members | att.personal[name] pubPlace | ||||||||||||||
Attributes |
|
att.notated provides attributes to indicate any specialised notation used for element content. | |||||||
Module | tei — Formal specification | ||||||
Members | s w | ||||||
Attributes |
|
att.patternReplacement provides attributes for regular-expression matching and replacement. [17.2.3. Using Abbreviated Pointers 2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration 2.3.6.2. Search-and-Replace Method] | |||||||||||||||||
Module | header — Formal specification | ||||||||||||||||
Members | prefixDef | ||||||||||||||||
Attributes |
|
att.personal (attributes for components of names usually, but not necessarily, personal names) common attributes for those elements which form part of a name usually, but not necessarily, a personal name. [14.2.1. Personal Names] | |||||||||||||||
Module | tei — Formal specification | ||||||||||||||
Members | name | ||||||||||||||
Attributes |
|
att.placement provides attributes for describing where on the source page or object a textual element appears. [3.5.3. Additions, Deletions, and Omissions 12.3.1.4. Additions and Deletions] | |||||||||||||
Module | tei — Formal specification | ||||||||||||
Members | att.transcriptional[addSpan delSpan mod redo restore retrace subst substJoin undo] div fw head label metamark note | ||||||||||||
Attributes |
|
att.pointing provides a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. Language Indicators 3.7. Simple Links and Cross-References] | |||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||
Members | catRef licence note ref substJoin term | ||||||||||||||||||||||||||||||
Attributes |
|
att.ranging provides attributes for describing numerical ranges. | |||||||||||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||||||||||
Members | att.dimensions[att.damaged[damage damageSpan] addSpan date delSpan ex gap mod redo restore retrace secl space subst substJoin supplied surplus time undo] measure num | ||||||||||||||||||||||||||||||
Attributes |
| ||||||||||||||||||||||||||||||
Example | The MS. was lost in transmission by mail from
<del rend="overstrike">
<gap reason="illegible"
extent="one or two letters" atLeast="1" atMost="2" unit="chars"/>
</del>
Philadelphia to the Graphic office, New York.
| ||||||||||||||||||||||||||||||
Example | Americares has been supporting the health sector in Eastern Europe since 1986,
and since 1992 has provided <measure atLeast="120000000" unit="USD"
commodity="currency">more
than $120m</measure> in aid to Ukrainians.
|
att.resourced provides attributes by which a resource (such as an externally held media file) may be located. | |||||||
Module | tei — Formal specification | ||||||
Members | graphic media | ||||||
Attributes |
|
att.scope provides attributes to describe, in general terms, the scope of an element’s application. | |||||||||||||
Module | tei — Formal specification | ||||||||||||
Members | att.handFeatures[handShift] language | ||||||||||||
Attributes |
|
att.segLike provides attributes for elements used for arbitrary segmentation. [17.3. Blocks, Segments, and Anchors 18.1. Linguistic Segment Categories] | |||||||||
Module | tei — Formal specification | ||||||||
Members | pc s w | ||||||||
Attributes |
|
att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [10.1. Dictionary Body and Overall Structure] | |||||||||||
Module | tei — Formal specification | ||||||||||
Members | bibl idno term | ||||||||||
Attributes |
|
att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it. [12.3.1.4. Additions and Deletions 1.3.1. Attribute Classes] | |||||||||
Module | tei — Formal specification | ||||||||
Members | addSpan damageSpan delSpan metamark mod pb redo retrace undo | ||||||||
Attributes |
| ||||||||
Note | The span is defined as running in document order from the start of the content of the pointing element to the end of the content of the element pointed to by the spanTo attribute (if any). If no value is supplied for the attribute, the assumption is that the span is coextensive with the pointing element. If no content is present, the assumption is that the starting point of the span is immediately following the element itself. |
att.timed provides attributes common to those elements which have a duration in time, expressed either absolutely or by reference to an alignment map. [8.3.5. Temporal Information] | |||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||
Members | gap media | ||||||||||||||||
Attributes |
|
att.transcriptional provides attributes specific to elements encoding authorial or scribal intervention in a text when transcribing manuscript or similar sources. [12.3.1.4. Additions and Deletions] | |||||||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||||||
Members | addSpan delSpan mod redo restore retrace subst substJoin undo | ||||||||||||||||||||||
Attributes |
|
att.typed provides attributes that can be used to classify or subclassify elements in any way. [1.3.1. Attribute Classes 18.1.1. Words and Above 3.6.1. Referring Strings 3.7. Simple Links and Cross-References 3.6.5. Abbreviations and Their Expansions 3.13.1. Core Tags for Verse 7.2.5. Speech Contents 4.1.1. Un-numbered Divisions 4.1.2. Numbered Divisions 4.2.1. Headings and Trailers 4.4. Virtual Divisions 14.3.2.3. Personal Relationships 12.3.1.1. Core Elements for Transcriptional Work 17.1.1. Pointers and Links 17.3. Blocks, Segments, and Anchors 13.2. Linking the Apparatus to the Text 23.5.1.2. Defining Content Models: RELAX NG 8.3. Elements Unique to Spoken Texts 24.3.1.3. Modification of Attribute and Attribute Value Lists] | |||||||||||||||||||
Module | tei — Formal specification | ||||||||||||||||||
Members | TEI addSpan application bibl change damage damageSpan date delSpan desc div fw g graphic head idno label line mapping measure media mod name note num path pb pc ref restore s space surface surfaceGrp teiCorpus term text time title unit w zone | ||||||||||||||||||
Attributes |
| ||||||||||||||||||
Schematron |
<sch:rule context="tei:*[@subtype]">
<sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype unless also categorized in general with @type</sch:assert>
</sch:rule> | ||||||||||||||||||
Note | When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 24.3.1.3. Modification of Attribute and Attribute Value Lists . |
att.written provides attributes to indicate the hand in which the content of an element was written in the source being transcribed. [1.3.1. Attribute Classes] | |||||||
Module | tei — Formal specification | ||||||
Members | att.damaged[damage damageSpan] att.transcriptional[addSpan delSpan mod redo restore retrace subst substJoin undo] div fw head label line note p path text zone | ||||||
Attributes |
|
macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.limitedPhrase"/> <classRef key="model.inter"/> </alternate> </content> ⚓ |
Declaration | tei_macro.limitedContent = ( text | tei_model.limitedPhrase | tei_model.inter )*⚓ |
macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.paraPart"/> </alternate> </content> ⚓ |
Declaration | tei_macro.paraContent = ( text | tei_model.paraPart )*⚓ |
macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. Standard Content Models] | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.attributable"/> <classRef key="model.phrase"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Declaration | tei_macro.phraseSeq = ( text | tei_model.gLike | tei_model.attributable | tei_model.phrase | tei_model.global )*⚓ |
macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. Standard Content Models] | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.limitedPhrase"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Declaration | tei_macro.phraseSeq.limited = ( text | tei_model.limitedPhrase | tei_model.global )*⚓ |
macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. The TEI Class System] | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <classRef key="model.phrase"/> <classRef key="model.inter"/> <classRef key="model.divPart"/> <classRef key="model.global"/> </alternate> </content> ⚓ |
Declaration | tei_macro.specialPara = ( text | tei_model.gLike | tei_model.phrase | tei_model.inter | tei_model.divPart | tei_model.global )*⚓ |
macro.xtext (extended text) defines a sequence of character data and gaiji elements. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> </alternate> </content> ⚓ |
Declaration | tei_macro.xtext = ( text | tei_model.gLike )*⚓ |
teidata.certainty defines the range of attribute values expressing a degree of certainty. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <valList type="closed"> <valItem ident="high"/> <valItem ident="medium"/> <valItem ident="low"/> <valItem ident="unknown"/> </valList> </content> ⚓ |
Declaration | tei_teidata.certainty = "high" | "medium" | "low" | "unknown"⚓ |
Note | Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter. |
teidata.count defines the range of attribute values used for a non-negative integer value used as a count. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="nonNegativeInteger"/> </content> ⚓ |
Declaration | tei_teidata.count = xsd:nonNegativeInteger⚓ |
Note | Any positive integer value or zero is permitted |
teidata.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="token" restriction="[0-9.,DHMPRSTWYZ/:+\-]+"/> </content> ⚓ |
Declaration | tei_teidata.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }⚓ |
Example | <time dur-iso="PT0,75H">three-quarters of an hour</time> |
Example | <date dur-iso="P1,5D">a day and a half</date> |
Example | <date dur-iso="P14D">a fortnight</date> |
Example | <time dur-iso="PT0.02S">20 ms</time> |
Note | A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times. |
teidata.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="duration"/> </content> ⚓ |
Declaration | tei_teidata.duration.w3c = xsd:duration⚓ |
Example | <time dur="PT45M">forty-five minutes</time> |
Example | <date dur="P1DT12H">a day and a half</date> |
Example | <date dur="P7D">a week</date> |
Example | <time dur="PT0.02S">20 ms</time> |
Note | A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the For complete details, see the W3C specification. |
teidata.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef key="teidata.word"/> </content> ⚓ |
Declaration | tei_teidata.enumerated = teidata.word⚓ |
Note | Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace. Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element. |
teidata.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. Language Identification] | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <alternate> <dataRef name="language"/> <valList> <valItem ident=""/> </valList> </alternate> </content> ⚓ |
Declaration | tei_teidata.language = xsd:language | ( "" )⚓ |
Note | The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 5646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice. A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable.
There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications. Second, an entire language tag can consist of only a private use subtag. These tags start with Examples include
The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML. |
teidata.name defines the range of attribute values expressed as an XML Name. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="Name"/> </content> ⚓ |
Declaration | tei_teidata.name = xsd:Name⚓ |
Note | Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see https://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits. |
teidata.namespace defines the range of attribute values used to indicate XML namespaces as defined by the W3C Namespaces in XML Technical Recommendation. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef restriction="\S+" name="anyURI"/> </content> ⚓ |
Declaration | tei_teidata.namespace = xsd:anyURI { pattern = "\S+" }⚓ |
Note | The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax |
teidata.numeric defines the range of attribute values used for numeric values. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate> <dataRef name="double"/> <dataRef name="token" restriction="(\-?[\d]+/\-?[\d]+)"/> <dataRef name="decimal"/> </alternate> </content> ⚓ |
Declaration | tei_teidata.numeric = xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal⚓ |
Note | Any numeric value, represented as a decimal number, in floating point format, or as a ratio. To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3. A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2. |
teidata.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="token" restriction="[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)"/> </content> ⚓ |
Declaration | tei_teidata.outputMeasurement = token { pattern = "[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)" }⚓ |
Example | <figure>
<head>The TEI Logo</head>
<figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in
between and <mentioned>text encoding initiative</mentioned> underneath, all on a white
background.</figDesc>
<graphic height="600px" width="600px"
url="http://www.tei-c.org/logos/TEI-600.jpg"/>
</figure> |
Note | These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft. |
teidata.pattern defines attribute values which are expressed as a regular expression. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="token"/> </content> ⚓ |
Declaration | tei_teidata.pattern = token⚓ |
Note | A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern WikipediaH(ä|ae?)ndel (or alternatively, it is said that the pattern H(ä|ae?)ndel matches each of the three strings)This TEI datatype is mapped to the XSD token datatype, and may therefore contain any string of characters. However, it is recommended that the value used conform to the particular flavour of regular expression syntax supported by XSD Schema. |
teidata.point defines the data type used to express a point in cartesian space. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="token" restriction="(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)"/> </content> ⚓ |
Declaration | tei_teidata.point = token { pattern = "(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)" }⚓ |
Example | <facsimile>
<surface ulx="0" uly="0" lrx="400" lry="280">
<zone points="220,100 300,210 170,250 123,234">
<graphic url="handwriting.png"/>
</zone>
</surface>
</facsimile> |
Note | A point is defined by two numeric values, which should be expressed as decimal numbers. Neither number can end in a decimal point. E.g., both 0.0,84.2 and 0,84 are allowed, but 0.,84. is not. |
teidata.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef restriction="\S+" name="anyURI"/> </content> ⚓ |
Declaration | tei_teidata.pointer = xsd:anyURI { pattern = "\S+" }⚓ |
Note | The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, |
teidata.prefix defines a range of values that may function as a URI scheme name. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="token" restriction="[a-z][a-z0-9\+\.\-]*"/> </content> ⚓ |
Declaration | tei_teidata.prefix = token { pattern = "[a-z][a-z0-9\+\.\-]*" }⚓ |
Note | This datatype is used to constrain a string of characters to one that can be used as a URI scheme name according to RFC 3986, section 3.1. Thus only the 26 lowercase letters a–z, the 10 digits 0–9, the plus sign, the period, and the hyphen are permitted, and the value must start with a letter. |
teidata.probCert defines a range of attribute values which can be expressed either as a numeric probability or as a coded certainty value. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate> <dataRef key="teidata.probability"/> <dataRef key="teidata.certainty"/> </alternate> </content> ⚓ |
Declaration | tei_teidata.probCert = teidata.probability | teidata.certainty⚓ |
teidata.probability defines the range of attribute values expressing a probability. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="double"> <dataFacet name="minInclusive" value="0"/> <dataFacet name="maxInclusive" value="1"/> </dataRef> </content> ⚓ |
Declaration | tei_teidata.probability = xsd:double⚓ |
Note | Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true. |
teidata.replacement defines attribute values which contain a replacement template. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <textNode/> </content> ⚓ |
Declaration | tei_teidata.replacement = text⚓ |
teidata.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate> <dataRef name="date"/> <dataRef name="gYear"/> <dataRef name="gMonth"/> <dataRef name="gDay"/> <dataRef name="gYearMonth"/> <dataRef name="gMonthDay"/> <dataRef name="time"/> <dataRef name="dateTime"/> </alternate> </content> ⚓ |
Declaration | tei_teidata.temporal.w3c = xsd:date | xsd:gYear | xsd:gMonth | xsd:gDay | xsd:gYearMonth | xsd:gMonthDay | xsd:time | xsd:dateTime⚓ |
Note | If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used. |
teidata.text defines the range of attribute values used to express some kind of identifying string as a single sequence of Unicode characters possibly including whitespace. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="string"/> </content> ⚓ |
Declaration | tei_teidata.text = string⚓ |
Note | Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted. |
teidata.truthValue defines the range of attribute values used to express a truth value. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="boolean"/> </content> ⚓ |
Declaration | tei_teidata.truthValue = xsd:boolean⚓ |
Note | The possible values of this datatype are 1 or true, or 0 or false. This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: teidata.xTruthValue. |
teidata.version defines the range of attribute values which may be used to specify a TEI or Unicode version number. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="token" restriction="[\d]+(\.[\d]+){0,2}"/> </content> ⚓ |
Declaration | tei_teidata.version = token { pattern = "[\d]+(\.[\d]+){0,2}" }⚓ |
Note | The value of this attribute follows the pattern specified by the Unicode consortium for its version number (https://unicode.org/versions/). A version number contains digits and fullstop characters only. The first number supplied identifies the major version number. A second and third number, for minor and sub-minor version numbers, may also be supplied. |
teidata.versionNumber defines the range of attribute values used for version numbers. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="token" restriction="[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}"/> </content> ⚓ |
Declaration | tei_teidata.versionNumber = token { pattern = "[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}" }⚓ |
teidata.word defines the range of attribute values expressed as a single word or token. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <dataRef name="token" restriction="[^\p{C}\p{Z}]+"/> </content> ⚓ |
Declaration | tei_teidata.word = token { pattern = "[^\p{C}\p{Z}]+" }⚓ |
Note | Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace. |
teidata.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <alternate> <dataRef name="boolean"/> <valList> <valItem ident="unknown"/> <valItem ident="inapplicable"/> </valList> </alternate> </content> ⚓ |
Declaration | tei_teidata.xTruthValue = xsd:boolean | ( "unknown" | "inapplicable" )⚓ |
Note | In cases where where uncertainty is inappropriate, use the datatype teidata.TruthValue. |
teidata.xmlName defines attribute values which contain an XML name. | |
Module | tei — Formal specification |
Used by | Element:
|
Content model | <content> <dataRef name="NCName"/> </content> ⚓ |
Declaration | tei_teidata.xmlName = xsd:NCName⚓ |
Note | The rules defining an XML name form a part of the XML Specification. |
teidata.xpath defines attribute values which contain an XPath expression. | |
Module | tei — Formal specification |
Used by | |
Content model | <content> <textNode/> </content> ⚓ |
Declaration | tei_teidata.xpath = text⚓ |
Note | Any XPath expression using the syntax defined in 6.2.. When writing programs that evaluate XPath expressions, programmers should be mindful of the possibility of malicious code injection attacks. For further information about XPath injection attacks, see the article at OWASP. |