high
medium
low
unknown
(\-?[\d]+/\-?[\d]+)
unknown
inapplicable
0
1
2
9
[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm)
[\d]+(\.[\d]+){0,2}
(\p{L}|\p{N}|\p{P}|\p{S})+
[0-9.,DHMPRSTWYZ/:+\-]+
[0-9.,DHMPRSTWYZ/:+\-]+
egXML
indicates the person, or group of people, to whom the element content is ascribed.
provides an externally-defined means of identifying the entity (or entities) being
named, using a coded value of some kind.
(reference) provides an explicit means of locating a full definition for the entity being named by
means of one or more URIs.
gives a minimum estimated value for the approximate measurement.
gives a maximum estimated value for the approximate measurement.
where the measurement summarizes more than one observation
or a range, supplies the minimum value
observed.
where the measurement summarizes more than one observation
or a range, supplies the maximum value
observed.
names the unit used for the measurement
Suggested values include: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)
cm
(centimetres)
mm
(millimetres)
in
(inches)
lines
lines of text
chars
(characters) characters of text
specifies the length in the units specified
indicates the size of the object concerned using a project-specific vocabulary combining
quantity and units in a single string of words.
characterizes the precision of the values specified by the other attributes.
where the measurement summarizes more than one observation, specifies the applicability
of this measurement.
Sample values include: 1] all; 2] most; 3] range
indicates whether or not the element
bearing this attribute should be considered to mark the end of
an orthographic token in the same way as whitespace.
supplies a pointer to some location defining a named
period of time within which the datable item is understood to
have occurred.
supplies the value of the date or time in a standard form,
e.g. yyyy-mm-dd.
specifies the earliest possible date for the event in
standard form, e.g. yyyy-mm-dd.
specifies the latest possible date for the event in
standard form, e.g. yyyy-mm-dd.
indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
indicates the ending point of the period in standard
form, e.g. yyyy-mm-dd.
indicates whether or not this element is selected by default when
its parent is selected.
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the
only one of its kind, in which case it is selected if its parent is selected.
identifies one or more declarable elements within the
header, which are understood to apply to the element bearing this
attribute and its content.
(organization) specifies how the content of the division is organized.
composite
composite content: i.e. no claim is made about the
sequence in which the immediate contents of this division
are to be processed, or their inter-relationships.
uniform
uniform content: i.e. the immediate contents of this
element are regarded as forming a logical unit, to be
processed in sequence.
indicates whether this division is a sample of the
original source and if so, from which part.
initial
division lacks material present at end in source.
medial
division lacks material at start and end.
final
division lacks material at start.
unknown
position of sampled material within original unknown.
complete
division is not a sample.
specifies whether or not the division is fragmented by
some other structural element, for example a speech which is
divided between two or more verse stanzas.
Y
(yes) the division is incomplete in some respect
N
(no) either the division is complete, or no claim is made as to its completeness.
I
(initial) the initial part of an incomplete division
M
(medial) a medial part of an incomplete division
F
(final) the final part of an incomplete division
describes the status of a document either currently or, when
associated with a dated element, at the time indicated.
Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn
(duration) indicates the length of this element in time.
(certainty) signifies the degree of certainty associated with the intervention or interpretation.
(responsible party) indicates the agency responsible for the intervention or interpretation, for example an
editor or transcriber.
indicates the nature of the evidence supporting the reliability or accuracy of the
intervention or interpretation.
Suggested values include: 1] internal; 2] external; 3] conjecture
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
contains a list of one or more pointers indicating sources
supporting the given intervention or interpretation.
(identifier) provides a unique identifier for the element bearing the attribute.
(number) gives a number (or other label) for an element, which is not necessarily unique within
the document.
(language) indicates the language of the element content using a tag generated
according to BCP 47
(rendition) indicates how the element in question was rendered or presented in the source text.
points to a description of the rendering or presentation used for this element in the
source text.
provides a base URI reference with which applications can resolve relative URI
references into absolute URI references.
signals an intention about how white space should be
managed by applications.
default
the processor should treat white space according to the
default XML white space handling rules
preserve
the processor should preserve unchanged any and all
white space in the source
gives a name or other identifier for the scribe
believed to be responsible for this hand.
points to a full description of the scribe concerned, typically supplied by a person element
elsewhere in the description.
characterizes the particular script or writing style used by
this hand, for example secretary, copperplate, Chancery, Italian, etc.
points to a full description of the script or writing style used by
this hand, typically supplied by a scriptNote element
elsewhere in the description.
describes the tint or type of ink, e.g. brown, or other
writing medium, e.g. pencil
specifies how widely this hand is used in the manuscript.
sole
only this hand is used throughout the manuscript
major
this hand is used through most of the manuscript
minor
this hand is used occasionally in the manuscript
(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
indicates the units used for the measurement, usually
using the standard symbol for the desired units.
Suggested values include: 1] m(metre) ; 2] kg(kilogram) ; 3] s(second) ; 4] Hz(hertz) ; 5] Pa(pascal) ; 6] Ω(ohm) ; 7] L(litre) ; 8] t(tonne) ; 9] ha(hectare) ; 10] Å(ångström) ; 11] mL(millilitre) ; 12] cm(centimetre) ; 13] dB(decibel) ; 14] kbit(kilobit) ; 15] Kibit(kibibit) ; 16] kB(kilobyte) ; 17] KiB(kibibyte) ; 18] MB(megabyte) ; 19] MiB(mebibyte)
m
(metre) SI base unit of length
kg
(kilogram) SI base unit of mass
s
(second) SI base unit of time
Hz
(hertz) SI unit of frequency
Pa
(pascal) SI unit of pressure or stress
Ω
(ohm) SI unit of electric resistance
L
(litre) 1 dm³
t
(tonne) 10³ kg
ha
(hectare) 1 hm²
Å
(ångström) 10⁻¹⁰ m
mL
(millilitre)
cm
(centimetre)
dB
(decibel) see remarks, below
kbit
(kilobit) 10³ or 1000 bits
Kibit
(kibibit) 2¹⁰ or 1024 bits
kB
(kilobyte) 10³ or 1000 bytes
KiB
(kibibyte) 2¹⁰ or 1024 bytes
MB
(megabyte) 10⁶ or 1 000 000 bytes
MiB
(mebibyte) 2²⁰ or 1 048 576 bytes
specifies the number of the specified units that
comprise the measurement
indicates the substance that is being measured
may be used to specify further information about the entity referenced by
this name, for example the occupation of a person, or the status of a place.
(reference to the canonical name) provides a means of locating the canonical form
(nym) of the names associated with the object
named by
the element bearing it.
Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
characterizes the element in some sense, using any convenient
classification scheme or typology.
provides a sub-categorization of the element, if needed
specifies the destination of the reference by supplying one or more URI References
specifies the intended meaning when the target of a
pointer is itself a pointer.
all
if the element pointed to is itself a pointer, then
the target of that pointer will be taken, and so on, until
an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then
its target (whether a pointer or not) is taken as the target
of this pointer.
none
no further evaluation of targets is carried out
beyond that needed to find the element specified in the
pointer's target.
specifies the source from which declarations and definitions for
the components of the object being defined may be obtained.
characterizes the function of the segment.
specifies whether or not the segment is fragmented by some other
structural element, for example a clause which is divided between two
or more sentences.
Y
(yes) the segment is incomplete in some respect
N
(no) either the segment is complete, or no claim is made as to
its completeness
I
(initial) the initial part of an incomplete segment
M
(medial) a medial part of an incomplete segment
F
(final) the final part of an incomplete segment
(edition) supplies an arbitrary identifier for the source edition in which
the associated feature (for example, a page, column, or line
break) occurs at this point in the text.
indicates the end of a span initiated by the element
bearing this attribute.
indicates the location within a temporal alignment at
which this element begins.
indicates the location within a temporal alignment at
which this element ends.
signifies the hand of the agent which made the intervention.
indicates the effect of the intervention, for example in
the case of a deletion, strikeouts
which include too much or too little text, or in the case of an
addition, an insertion which duplicates some of the text
already present.
Sample values include: 1] duplicate; 2] duplicate-partial; 3] excessStart; 4] excessEnd; 5] shortStart; 6] shortEnd; 7] partial; 8] unremarkable
(sequence) assigns a sequence number related to the order in which
the encoded features carrying this attribute are believed to have occurred.
specifies the version name or number of the source from
which the translated version was derived
indicates whether the name component is given in full, as an
abbreviation or simply as an initial.
yes
the name component is spelled out in full.
abb
(abbreviated) the name component is given in an abbreviated form.
init
(initial letter) the name component is indicated only by
one initial.
specifies the sort order of the name component in relation
to others within the personal name.
(paragraph) marks paragraphs in prose.
(foreign) identifies a word or phrase as belonging to some language other than that of the
surrounding text.
(emphasized) marks words or phrases which are stressed or emphasized for
linguistic or rhetorical effect.
(highlighted) marks a word or phrase as graphically distinct from the
surrounding text, for reasons concerning which no claim is
made.
identifies any word or phrase which is regarded as linguistically distinct, for example as
archaic, technical, dialectal, non-preferred, etc., or as forming part of a sublanguage.
specifies the sublanguage or register to which the word or phrase is being
assigned
specifies how the phrase is distinct diachronically
specifies how the phrase is distinct diatopically
specifies how the phrase is distinct diastatically
(speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or
not, whether directly or indirectly reported, whether by real people or fictional characters.
may be used to indicate whether the quoted matter is regarded as having been vocalized
or signed.
may be used to indicate whether the quoted matter is regarded as direct or indirect
speech.
(quotation) contains a phrase or passage attributed by the narrator or author to some agency external
to the text.
(separated from the surrounding text with quotation marks) contains material which is marked as (ostensibly) being somehow different than the
surrounding text, for any one of a variety of reasons including, but not limited to: direct
speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and
passages that are mentioned but not used.
may be used to indicate whether the offset passage is spoken or thought, or to
characterize it more finely.
Suggested values include: 1] spoken; 2] thought; 3] written; 4] soCalled; 5] foreign(foreign words) ; 6] distinct(linguistically distinct) ; 7] term(technical term) ; 8] emph(rhetorically emphasized) ; 9] mentioned
spoken
representation of speech
thought
representation of thought, e.g. internal monologue
written
quotation from a written source
soCalled
authorial distance
foreign
(foreign words)
distinct
(linguistically distinct)
term
(technical term)
emph
(rhetorically emphasized)
mentioned
refering to itself, not its normal referant
(cited quotation) contains a quotation from some other document, together with a bibliographic reference to
its source. In a dictionary it may contain an example text with at least one occurrence of the
word form, used in the sense being described, or a translation of the headword, or an example.
marks words or phrases mentioned, not used.
contains a word or phrase for which the author or narrator indicates a disclaiming of
responsibility, for example by the use of scare quotes or italics.
(description) contains a brief description of the object documented by its parent element, including its
intended usage, purpose, or application where this is appropriate.
identifies a phrase or word used to provide a gloss or definition for some other word or
phrase.
(canonical reference) identifies the associated term element using a canonical reference from a
scheme defined in a refsDecl element in the TEI header
contains a single-word, multi-word, or symbolic designation which is regarded as a technical
term.
identifies the associated gloss element using a canonical reference from a
scheme defined in a refsDecl element in the TEI header
supplies the sort key for this term in an index.
(latin for thus or so
) contains text reproduced although apparently incorrect or inaccurate.
(correction) contains the correct form of a passage apparently erroneous in the copy text.
groups a number of alternative encodings for the same point in
a text.
(regularization) contains a reading which has been regularized or normalized in some sense.
(original form) contains a reading which is marked as following the original, rather than being normalized
or corrected.
(gap) indicates a point where material has been omitted in a transcription, whether for editorial
reasons described in the TEI header, as part of sampling practice, or because the material is
illegible, invisible, or inaudible.
gives the reason for omission. Sample values include sampling,
inaudible, irrelevant, cancelled.
in the case of text omitted from the transcription because of deliberate deletion by an
identifiable hand, signifies the hand which made the deletion.
In the case of text omitted because of damage, categorizes the cause of the damage, if
it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
(addition) contains letters, words, or phrases inserted in the text by an
author, scribe, annotator, or corrector.
(deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as
superfluous or spurious in the copy text by an author, scribe, annotator, or corrector.
contains a word, phrase, or passage which cannot be transcribed with certainty because it
is illegible or inaudible in the source.
indicates why the material is hard to transcribe.
Where the difficulty in transcription arises from action (partial deletion, etc.)
assignable to an identifiable hand, signifies the hand responsible for the action.
Where the difficulty in transcription arises from damage, categorizes the cause of
the damage, if it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
(name, proper noun) contains a proper noun or noun phrase.
(referencing string) contains a general purpose name or referring string.
indicates more specifically the object referred to by the referencing string.
Values might include person, place, ship,
element etc.
(electronic mail address) contains an e-mail address identifying a location to which
e-mail messages can be delivered.
contains a postal address, for example of a
publisher, an organization, or an individual.
(address line) contains one line of a postal address.
a full street address including any name or number identifying a
building as well as the name of the street or route on which it is
located.
(postal code) contains a numerical or alphanumeric code used as part of a postal address to simplify
sorting or delivery of mail.
(postal box or post office box) contains a number or other identifier for some postal delivery point other than a street
address.
(number) contains a number, written in any form.
indicates the type of numeric value.
Suggested values include: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage
cardinal
absolute number, e.g. 21, 21.5
ordinal
ordinal number, e.g. 21st
fraction
fraction, e.g. one half or three-quarters
percentage
a percentage
supplies the value of the number in standard form.
contains a word or phrase referring to some quantity of an object or commodity, usually
comprising a number, a unit, and a commodity name.
specifies the type of measurement in any convenient typology.
(measure group) contains a group of dimensional specifications which relate to the same object, for example
the height and width of a manuscript page.
contains a date in any format.
indicates the system or calendar to which the date represented by the content of this
element belongs.
Suggested values include: 1] Gregorian; 2] Julian; 3] Islamic; 4] Hebrew; 5] Revolutionary; 6] Iranian; 7] Coptic; 8] Chinese
Gregorian
Gregorian calendar
Julian
Julian calendar
Islamic
Islamic or Muslim (hijri) lunar calendar
Hebrew
Hebrew or Jewish lunisolar calendar
Revolutionary
French Revolutionary calendar
Iranian
Iranian or Persian (Jalaali) solar calendar
Coptic
Coptic or Alexandrian calendar
Chinese
Chinese lunisolar calendar
contains a phrase defining a time of day in any format.
(abbreviation) contains an abbreviation of any sort.
allows the encoder to classify the abbreviation according to some convenient
typology.
Sample values include: 1] suspension; 2] contraction; 3] brevigraph; 4] superscription; 5] acronym; 6] title; 7] organization; 8] geographic
(expansion) contains the expansion of an abbreviation.
(pointer) defines a pointer to another location.
Only one of the
attributes 'target' and 'cRef' may be supplied.
(canonical reference) specifies the destination of the pointer by supplying a canonical reference from a
scheme defined in a refsDecl element in the TEI header
(reference) defines a reference to another location, possibly modified by additional text or comment.
Only one of the
attributes 'target' and 'cRef' may be supplied.
(canonical reference) specifies the destination of the reference by supplying a canonical reference from a
scheme defined in a refsDecl element in the TEI header
(list) contains any sequence of items organized as a list.
describes the form of the list.
Suggested values include: 1] ordered; 2] bulleted; 3] simple; 4] gloss
ordered
list items are numbered or lettered.
bulleted
list items are marked with a bullet or other typographic device.
simple
list items are not numbered or bulleted.
gloss
each list item glosses some term or concept, which is given by a label element
preceding the list item.
contains one component of a list.
contains the label associated with an item in a list; in glossaries, marks the term being
defined.
(heading) contains any type of heading, for example the title of a section, or the heading of a list,
glossary, manuscript description, etc.
(heading for list labels) contains the heading for the label or term column in a glossary list or similar structured
list.
(heading for list items) contains the heading for the item or gloss column in a glossary list or similar structured
list.
contains a note or annotation.
indicates whether the copy text shows the exact place of reference for the note.
points to the end of the span to which the note is attached, if the note is not embedded
in the text at that point.
(index entry) marks a location to be indexed for whatever purpose.
supplies a name to specify which index (of several) the index entry belongs to.
indicates the location of an inline graphic, illustration, or figure.
The display width of the image
The display height of the image
A scale factor to be applied to the image to make it the desired display size
(uniform resource locator) A URL which refers to the image itself.
provides encoded binary data representing an inline graphic or other object.
The display width of the object
The display height of the object
A scale factor to be applied to the object to make it the desired display size
The encoding used to encode the binary data. If not specified, this is assumed to be
Base64.
marks a boundary point separating any kind of section of a text, typically but not
necessarily indicating a point at which some part of a standard reference system changes, where
the change is not represented by a structural element.
provides a conventional name for the kind of section changing at this milestone.
Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] speaker; 8] stanza; 9] act; 10] scene; 11] section; 12] absent; 13] unnumbered
page
physical page breaks (synonymous with the pb element).
column
column breaks.
line
line breaks (synonymous with the lb element).
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
speaker
changes of speaker or narrator.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
unnumbered
passages present in the text, but not to be included as part of the reference.
(page break) marks the boundary between one page of a text and the next in a standard reference system.
(line break) marks the start of a new (typographic) line in some edition or version of a text.
(column break) marks the boundary between one column of a text and the next
in a standard reference system.
(analytic level) contains bibliographic elements describing an item (e.g. an article or poem) published
within a monograph or journal and not as an independent publication.
(monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an
independent item (i.e. as a separate physical object).
(series information) contains information about the series in which a book or other bibliographic item has
appeared.
in a bibliographic reference, contains the name(s) of the
author(s), personal or corporate, of a work; for example in the same
form as that provided by a recognized bibliographic name authority.
secondary statement of responsibility for a bibliographic item, for example the name of an
individual, institution or organization, (or of several such) acting as editor, compiler,
translator, etc.
(statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition,
recording, or series, where the specialized elements for authors, editors, etc. do not suffice
or do not apply.
(responsibility) contains a phrase describing the nature of a person's intellectual responsibility.
contains a title for any kind of work.
indicates the bibliographic level for a title, that is, whether
it identifies an article, book, journal, series, or
unpublished material.
a
(analytic) analytic title (article, poem, or other item
published as part of a larger item)
m
(monographic) monographic title (book, collection, or
other item published as a distinct item,
including single volumes of multi-volume
works)
j
(journal) journal title
s
(series) series title
u
(unpublished) title of unpublished material (including
theses and dissertations unless
published by a commercial press)
classifies the title according to some convenient typology.
Sample values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
contains the formalized descriptive title for a meeting or conference, for use in a
bibliographic description for an item derived from such a meeting, or as a heading or preamble
to publications emanating from it.
groups information relating to the publication or distribution
of a bibliographic item.
provides the name of the organization responsible for the publication or distribution of a
bibliographic item.
(scope of citation) defines the scope of a bibliographic reference, for example as a
list of page numbers, or a named subdivision of a larger work.
identifies the type of information conveyed by the element, e.g.
columns, pages, volume.
Suggested values include: 1] vol(volume) ; 2] issue; 3] pp(pages) ; 4] ll (lines) ; 5] chap(chapter) ; 6] part
vol
(volume) the element contains a volume number.
issue
the element contains an issue number, or volume and
issue numbers.
pp
(pages) the element contains a page number or page range.
ll
(lines) the element contains a line number or line range.
chap
(chapter) the element contains a chapter indication (number
and/or title)
part
the element identifies a part of a book or collection.
specifies the starting point of the range of units indicated by the type attribute.
specifies the end-point of the range of units indicated by the type attribute.
(publication place) contains the name of the place where a bibliographic item was published.
(bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may
not be explicitly tagged.
(structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements
appear and in a specified order.
(citation list) contains a list of bibliographic citations of any kind.
contains or references some other bibliographic item which is related to the present one in
some specified manner, for example as a constituent or alternative version of it.
If the 'target' attribute is used, the
relatedItem element must be empty
A relatedItem element should have either a 'target' attribute
or a child element to indicate the related bibliographic item
points to the related bibliographic element by means of an
absolute or relative URI reference
(verse line) contains a single, possibly incomplete, line of verse.
specifies whether or not the line is metrically complete.
Y
(yes) the line is metrically incomplete
N
(no) either the line is complete, or no claim is made as to its completeness
I
(initial) the initial part of an incomplete line
M
(medial) a medial part of an incomplete line
F
(final) the final part of an incomplete line
(line group) contains a group of verse lines functioning as a formal unit, e.g. a stanza, refrain,
verse paragraph, etc.
(speech) An individual speech in a performance text, or a passage presented as such in a prose or
verse text.
A specialized form of heading or label, giving the name of one or more speakers in a
dramatic text or fragment.
(stage direction) contains any kind of stage direction within a dramatic text or fragment.
indicates the kind of stage direction.
Suggested values include: 1] setting; 2] entrance; 3] exit; 4] business; 5] novelistic; 6] delivery; 7] modifier; 8] location; 9] mixed
setting
describes a setting.
entrance
describes an entrance.
exit
describes an exit.
business
describes stage business.
novelistic
is a narrative, motivating stage direction.
delivery
describes how a character speaks.
modifier
gives some detail about a character.
location
describes a location.
mixed
more than one of the above
contains the whole of a TEI encoded corpus, comprising a single corpus header and one or
more TEI elements, each containing a single text header and a text.
The version of the TEI scheme
(automatically generated text division) indicates the location at which a textual division generated
automatically by a text-processing application is to appear.
specifies what type of generated text division (e.g. index,
table of contents, etc.) is to appear.
Sample values include: 1] index; 2] toc; 3] figlist; 4] tablist
(TEI Header) supplies the descriptive and declarative information making up an electronic title page
prefixed to every TEI-conformant text.
specifies the kind of document to which the header is attached, for example whether it
is a corpus or individual text.
Sample values include: 1] text; 2] corpus
(file description) contains a full bibliographic description of an electronic file.
(title statement) groups information about the title of a work and those responsible for its intellectual
content.
specifies the name of a sponsoring organization or institution.
(funding body) specifies the name of an individual, institution, or organization responsible for the
funding of a project or text.
(principal researcher) supplies the name of the principal researcher responsible for the
creation of an electronic text.
(edition statement) groups information relating to one edition of a text.
(edition) describes the particularities of one edition of a text.
describes the approximate size of a text as stored on some carrier medium, whether digital
or non-digital, specified in any convenient units.
Which file (level) does the extent refer to? (Possible
values are text.xml, ann_segmentation.xml, etc.)
(publication statement) groups information concerning the publication or distribution of an electronic or other
text.
Possible values are: "balanced" (for text in the 300-million-word balanced subcorpus), "unbalanced" (for other texts which may be distributed by NKJP), "restricted" (for texts available only for the internal NKJP purposes, not to be distributed), "one_million" (for texts in the 1-million-word manually annotated sample).
balanced
unbalanced
restricted
one_million
supplies the name of a person or other agency responsible for the
distribution of a text.
(release authority) supplies the name of a person or other agency responsible for
making an electronic file available, other than a publisher or
distributor.
(identifier) supplies any form of identifier used to identify some object,
such as a bibliographic item, a person, a title, an organization,
etc. in a standardized way.
categorizes the identifier, for example as an ISBN, Social
Security number, etc.
supplies information about the availability of a text, for example any restrictions on its
use or distribution, its copyright status, etc.
supplies a code identifying the current availability of the text.
free
the text is freely available.
unknown
the status of the text is unknown.
restricted
the text is not freely available.
(series statement) groups information about the series, if any, to which a publication belongs.
(notes statement) collects together any notes providing information about a text additional to that recorded
in other parts of the bibliographic description.
(source description) describes the source from which an electronic text was derived or generated, typically a
bibliographic description in the case of a digitized text, or a phrase such as "born digital"
for a text which has no previous existence.
(fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file
description are present.
(encoding description) documents the relationship between an electronic text and the
source or sources from which it was derived.
(project description) describes in detail the aim or purpose for which an electronic file was encoded, together
with any other relevant information concerning the process by which it was assembled or
collected.
(sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the
creation of a corpus or collection.
(editorial practice declaration) provides details of editorial principles and practices applied
during the encoding of a text.
(correction principles) states how and under what circumstances corrections have been made in the text.
indicates the degree of correction applied to the text.
high
the text has been thoroughly checked and proofread.
medium
the text has been checked at least once.
low
the text has not been checked.
unknown
the correction status of the text is unknown.
indicates the method adopted to indicate corrections within the text.
silent
corrections have been made silently
markup
corrections have been represented using markup
indicates the extent of normalization or regularization of the original source carried out
in converting it to electronic form.
indicates the authority for any normalization carried out.
indicates the method adopted to indicate normalizations within the text.
silent
normalization made silently
markup
normalization represented using markup
specifies editorial practice adopted with respect to quotation marks in the original.
(quotation marks) indicates whether or not quotation marks have been retained as content within the text.
none
no quotation marks have been retained
some
some quotation marks have been retained
all
all quotation marks have been retained
specifies how quotation marks are indicated within the text.
summarizes the way in which hyphenation in a source text has been treated in an encoded
version of it.
(end-of-line) indicates whether or not end-of-line hyphenation has been retained in a text.
all
all end-of-line hyphenation has been retained, even though the lineation of the
original may not have been.
some
end-of-line hyphenation has been retained in some cases.
hard
all soft end-of-line hyphenation has been removed: any remaining end-od-line
hyphenation should be retained.
none
all end-of-line hyphenation has been removed: any remaining hyphenation occurred
within the line.
describes the principles according to which the text has been segmented, for example into
sentences, tone-units, graphemic strata, etc.
(standard values) specifies the format used when standardized date or number values are supplied.
describes the scope of any analytic or interpretive information added to the text in
addition to the transcription.
(tagging declaration) provides detailed information about the tagging applied to a document.
supplies information about the usage of a specific element within a text.
(element name) the name (generic identifier) of the element indicated by the tag.
specifies the number of occurrences of this element within the text.
(with unique identifier) specifies the number of occurrences of this element within the text which bear a
distinct value for the global xml:id attribute.
specifies the identifier of a rendition element which defines how this element
is to be rendered.
supplies the formal name of the namespace to which the elements documented by its children
belong.
the full formal name of the namespace concerned.
supplies information about the rendition or appearance of one or more elements in the source
text.
identifies the language used to describe the rendition.
css
Cascading Stylesheet Language
xslfo
Extensible Stylesheet Language Formatting Objects
free
Informal free text description
other
A user-defined rendition description language
where CSS is used, provides a way of defining
pseudo-elements, that is, styling rules
applicable to specific sub-portions of an element.
(references declaration) specifies how canonical references are constructed for this
text.
(canonical reference pattern) specifies an expression and replacement pattern for transforming a canonical reference into
a URI.
specifies a regular expression against which the values of cRef attributes
can be matched.
specifies a replacement pattern which, once subpattern substitution
has been performed, provides a URI.
(reference state) specifies one component of a canonical reference defined by the milestone method.
indicates what kind of state is changing at this milestone.
Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] stanza; 8] act; 9] scene; 10] section; 11] absent
page
page breaks in the reference edition.
column
column breaks.
line
line breaks.
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
specifies the fixed length of the reference component.
(delimiter) supplies a delimiting string following the reference component.
(classification declarations) contains one or more taxonomies defining any classificatory
codes used elsewhere in the text.
defines a typology used to classify texts either implicitly, by means of a bibliographic
citation, or explicitly by a structured taxonomy.
contains an individual descriptive category, possibly nested within a superordinate
category, within a user-defined taxonomy.
(category description) describes some category within a taxonomy or text typology, either in the form of a brief
prose description or in terms of the situational parameters used by the TEI formal textDesc.
(application information) records information about an application which has
edited the TEI file.
provides information about an application which has acted upon the document.
Supplies an identifier for the application, independent of its version number or display
name.
Supplies a version number for the application, independent of its identifier or display
name.
[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}
(text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the
languages and sublanguages used, the situation in which it was produced, the participants and
their setting.
contains information about the creation of a text.
(language usage) describes the languages, sublanguages, registers, dialects, etc.
represented within a text.
characterizes a single language or sublanguage used within a text.
(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the
language documented by this element, and which is referenced by the global
xml:lang attribute.
specifies the approximate percentage (by volume) of the text which uses this language.
100
(text classification) groups information which describes the nature or topic of a text in terms of a standard
classification scheme, thesaurus, etc.
contains a list of keywords or phrases identifying the topic or nature of a text.
identifies the controlled vocabulary within which the set of keywords concerned is
defined.
(classification code) contains the classification code used for this text in some standard classification system.
identifies the classification system or taxonomy in use.
(category reference) specifies one or more defined categories within some taxonomy or text typology.
identifies the classification scheme within which the set of categories concerned is
defined
(revision description) summarizes the revision history for a file.
summarizes a particular change or correction made to a particular version of an electronic
text which is shared between several researchers.
(geographic coordinates declaration) documents the notation and the datum used for geographic coordinates expressed as content of
the geo element elsewhere within the document.
supplies a commonly used code name for the datum employed.
Suggested values include: 1] WGS84(World Geodetic System) ; 2] MGRS(Military Grid Reference System) ; 3] OSGB36(ordnance survey great britain) ; 4] ED50(European Datum coordinate system)
WGS84
(World Geodetic System) a pair of numbers to be interpreted as latitude followed by longitude according to
the World Geodetic System.
MGRS
(Military Grid Reference System) the values supplied are geospatial entity object codes, based on
OSGB36
(ordnance survey great britain) the value supplied is to be interpreted as a British National Grid Reference.
ED50
(European Datum coordinate system) the value supplied is to be interpreted as latitude followed by longitude according
to the European Datum coordinate system.
(text description) provides a description of a text in terms of its
situational parameters.
(participation description) describes the identifiable speakers, voices, or other participants
in any kind of text.
(setting description) describes the setting or settings within which a language
interaction takes place, either as a prose description or as a
series of setting elements.
(primary channel) describes the medium or channel by which a text is delivered or
experienced. For a written text, this might be print, manuscript, e-mail, etc.;
for a spoken one, radio, telephone, face-to-face, etc.
specifies the mode of this channel with respect to speech and
writing.
s
(spoken)
w
(written)
sw
(spoken to be written) e.g. dictation
ws
(written to be spoken) e.g. a script
m
(mixed)
x
(unknown or inapplicable)
describes the internal composition of a text or text sample,
for example
as fragmentary, complete, etc.
specifies how the text was constituted.
single
a single complete text
composite
a text made by combining several smaller
items, each individually complete
frags
(fragments) a text made by combining several smaller, not
necessarily complete, items
unknown
composition unknown or unspecified
describes the nature and extent of originality of this text.
categorizes the derivation of the text.
Sample values include: 1] original; 2] revision; 3] translation; 4] abridgment; 5] plagiarism; 6] traditional
(domain of use) describes the most important social context in which the text was
realized or for which it is intended, for example private vs. public,
education, religion, etc.
categorizes the domain of use.
Sample values include: 1] art; 2] domestic; 3] religious; 4] business; 5] education; 6] govt(government) ; 7] public
describes the extent to which the text may be regarded as
imaginative or non-imaginative, that is, as describing a fictional
or a non-fictional world.
categorizes the factuality of the text.
fiction
the text is to be regarded as entirely imaginative
fact
the text is to be regarded as entirely informative or factual
mixed
the text contains a mixture of fact and fiction
inapplicable
the fiction/fact distinction is not regarded
as helpful or appropriate to this text
describes the extent, cardinality and nature of any interaction
among those producing and experiencing the text, for example in the
form of response or interjection, commentary, etc.
specifies the degree of interaction between
active and passive participants in the text.
none
no interaction of any kind, e.g. a monologue
partial
some degree of interaction, e.g. a monologue with set responses
complete
complete interaction, e.g. a face to face conversation
inapplicable
this parameter is inappropriate or inapplicable in this case
specifies the number of active participants
(or addressors) producing parts of the text.
Suggested values include: 1] singular; 2] plural; 3] corporate; 4] unknown
singular
a single addressor
plural
many addressors
corporate
a corporate addressor
unknown
number of addressors unknown or unspecifiable
specifies the number of passive participants
(or addressees) to whom a text is directed
or in whose presence it is created or performed.
Suggested values include: 1] self; 2] single; 3] many; 4] group; 5] world
self
text is addressed to the originator e.g. a diary
single
text is addressed to one other person e.g. a personal letter
many
text is addressed to a countable number of others
e.g. a conversation in which all participants are identified
group
text is addressed to an undefined but fixed
number of participants e.g. a lecture
world
text is addressed to an undefined and indeterminately
large number e.g. a published book
describes the extent to which a text may be regarded as
prepared or spontaneous.
a keyword characterizing the type of preparedness.
Sample values include: 1] none; 2] scripted; 3] formulaic; 4] revised
characterizes a single purpose or communicative function of the
text.
specifies a particular kind of purpose.
Suggested values include: 1] persuade; 2] express; 3] inform; 4] entertain
persuade
didactic, advertising, propaganda, etc.
express
self expression, confessional, etc.
inform
convey information, educate, etc.
entertain
amuse, entertain, etc.
specifies the extent to which this purpose predominates.
describes one particular setting in which a language
interaction takes place.
contains a brief informal description of the kind of
place concerned, for example: a room, a restaurant, a park bench, etc.
contains a brief informal description of what a participant in a
language interaction is doing other than speaking, if anything.
(script statement) contains a citation giving details of the script used for
a spoken text.
(recording statement) describes a set of recordings used as the basis for transcription of a
spoken text.
(recording event) details of an audio or video recording event
used as the source of a spoken text, either directly or from
a public broadcast.
the kind of recording.
audio
audio recording
video
audio and video recording
provides technical details of the equipment and media used for
an audio or video recording used as the source for a spoken text.
describes a broadcast used as the source of a spoken text.
(utterance) Clean restriction of TEI's u(tterance).
(transition) indicates the nature of the transition between this utterance
and the previous one.
smooth
this utterance begins without unusual pause or rapidity.
latching
this utterance begins with a markedly shorter pause than normal.
overlap
this utterance begins before the previous one has finished.
pause
this utterance begins after a noticeable pause.
a pause either between or within utterances.
any vocalized but not necessarily lexical phenomenon, for
example voiced pauses, non-lexical backchannels, etc.
indicates whether or not the phenomenon is repeated.
any communicative phenomenon, not necessarily vocalized, for
example a gesture, frown, etc.
indicates whether or not the phenomenon is
repeated.
any phenomenon or occurrence, not necessarily vocalized or
communicative, for example incidental noises or other events affecting
communication.
a passage of written text revealed to participants in the
course of a spoken text.
points to a bibliographic citation in the header giving
a full description of the source or script of the
writing.
indicates whether the writing is revealed all at once or
gradually.
marks the point at which some paralinguistic feature of a series of
utterances by any one speaker changes.
a
paralinguistic feature.
tempo
speed of utterance.
loud
loudness.
pitch
pitch range.
tension
tension or stress pattern.
rhythm
rhythmic qualities.
voice
voice quality.
specifies the new state of the paralinguistic feature specified.
(organization name) contains an organizational name.
(personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including any
or all of the person's forenames, surnames, honorifics, added names, etc.
contains a family (inherited) name, as opposed to a given, baptismal, or nick name.
contains a forename, given or baptismal name.
(generational name component) contains a name component used to distinguish otherwise similar names on the basis of the relative ages or generations of the persons
named.
(name link) contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or
of.
(additional name) contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal
name.
contains a name component which indicates that the referent has a particular role or position in society, such as an official title or
rank.
contains an absolute or relative place name.
(bloc) contains the name of a geo-political unit consisting of two or more nation states or
countries.
(country) contains the name of a geo-political unit, such as a nation, country, colony, or
commonwealth, larger than or administratively superior to a region and smaller than a bloc.
contains the name of an administrative unit such as a state, province, or county, larger
than a settlement, but smaller than a country.
contains the name of any kind of subdivision of a settlement, such as a parish, ward, or other administrative or geographic unit.
contains the name of a settlement such as a city, town, or village identified as a single geo-political or administrative unit.
that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or
times involved in the expression.
(geographical name) a name associated with some geographical feature such as Windrush Valley or Mount Sinai.
provides more culture- linguistic- or application- specific information used to categorize this name component.
(geographical feature name) contains a common noun identifying some geographical feature contained within a geographic
name, such as valley, mount, etc.
(affiliation) contains an informal description of a person's present or past affiliation with some
organization, for example an employer or sponsor.
(age) specifies the age of a person.
supplies a numeric code representing the age or age group
(birth) contains information about a person's birth, such as its date and place.
(climate) contains information about the physical climate of a place.
(death) contains information about a person's death, such as its date and place.
contains a description of the educational experience of a person.
(event) contains data relating to any kind of significant event associated with a person, place, or organization.
indicates the location of an event by pointing to a place element
specifies the faith, religion, or belief set of a person.
contains information about a person's period of activity.
(geographical coordinates) contains any expression of a set of geographic coordinates, representing a point, line, or area on the surface of the earth in some
notation.
(language knowledge) summarizes the state of a person's linguistic knowledge, either as prose or by a list of langKnown elements.
supplies one or more valid language tags for the languages specified
(language known) summarizes the state of a person's linguistic competence, i.e., knowledge of a single language.
supplies a valid language tag for the language concerned.
a code indicating the person's level of knowledge for this language
(list of organizations) contains a list of elements, each of which provides information about an identifiable
organization.
(list of events) contains a list of descriptions, each of which provides information
about an identifiable event.
(list of persons) contains a list of descriptions, each of which provides information about an identifiable
person or a group of people, for example the participants in a language interaction, or the
people referred to in a historical source.
(list of places) contains a list of places, optionally followed by a list of relationships (other than
containment) defined amongst them.
defines the location of a place as a set of geographical coordinates, in terms of a other named geo-political entities, or as an
address.
contains an informal description of a person's present or past nationality or citizenship.
contains an informal description of a person's trade, profession or occupation.
identifies the classification system or taxonomy in use by supplying the identifier of a taxonomy element elsewhere in the
header.
identifies an occupation code defined within the classification system or taxonomy defined by the scheme
attribute.
(organization) provides information about an identifiable organization such as a business, a tribe, or
any other grouping of people.
specifies a primary role or classification for the organization.
(relation group) provides information about relationships identified amongst people, places, and
organizations, either informally as prose or as formally expressed relation links.
provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a
historical source.
specifies a primary role or classification for the person.
specifies the sex of the person.
specifies an age group for the person.
(personal group) describes a group of individuals treated as a single person for analytic purposes.
specifies the role of this group of participants in the interaction.
specifies the sex of the participant group.
mixed
specifies the age group of the participants.
specifies the size or approximate size of the group.
contains data about a geographic location
contains information about the population of a place.
(relationship) describes any kind of relationship or linkage amongst a specified group of participants.
Only one of the attributes
'active' and 'mutual' may be supplied
the attribute 'passive'
may be supplied only if the attribute 'active' is
supplied
categorizes the relationship in some respect, e.g. as social, personal or other.
Suggested values include: 1] social; 2] personal; 3] other
social
relationship concerned with social roles
personal
relationship concerned with personal roles, e.g. kinship, marriage, etc.
other
other kinds of relationship
supplies a name for the kind of relationship of which this is an instance.
identifies the passive participants in a non-mutual relationship.
identifies the active participants in a non-mutual relationship, or all the participants in a mutual
one.
supplies a list of participants amongst all of whom the relationship holds equally.
(residence) describes a person's present or past places of residence.
specifies the sex of a person.
(socio-economic status) contains an informal description of a person's perceived social or economic status.
identifies the classification system or taxonomy in use.
identifies a status code defined within the classification system or taxonomy defined by the source attribute.
contains a description of some status or quality attributed to a person, place, or organization at some specific time.
contains information about the physical terrain of a place.
contains a description of some culturally-determined and in principle unchanging characteristic attributed to a person or place.
(canonical name) contains the definition for a canonical name or namepart of any kind.
points to constituent nyms
(list of canonical names) contains a list of nyms, that is, standardized names for any thing.
supplies the value of a date or time in a standard form.
specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
indicates the starting point of the period in standard form.
indicates the ending point of the period in standard form.
(duration) indicates the length of this element in time.
(TEI document) contains a single TEI-conformant document,
comprising a TEI header and a text, either in isolation or as part of a
teiCorpus element.
specifies the version number of the TEI Guidelines against
which this document is valid.
contains a single text of any kind, whether unitary or composite, for example a poem or
drama, a collection of essays, a novel, a dictionary, or a corpus sample.
(text body) contains the whole body of a single unitary text, excluding any front or back matter.
contains the body of a composite text, grouping together a sequence of distinct texts (or
groups of such texts) which are regarded as a unit for some purpose, for example the collected
works of an author, a sequence of prose essays, etc.
contains a single text of any kind, whether unitary or composite, which interrupts the text
containing it at any point and after which the surrounding text resumes.
(text division) contains a subdivision of the front, body, or back of a text.
(level-1 text division) contains a first-level subdivision of the front, body, or back of a text.
(level-2 text division) contains a second-level subdivision of the front, body, or back of a
text.
(level-3 text division) contains a third-level subdivision of the front, body, or back of a text.
(level-4 text division) contains a fourth-level subdivision of the front, body, or back of a text.
(level-5 text division) contains a fifth-level subdivision of the front, body, or back of a text.
(level-6 text division) contains a sixth-level subdivision of the front, body, or back of a text.
(level-7 text division) contains the smallest possible subdivision of the front, body or back of a text, larger than
a paragraph.
contains a closing title or footer appearing at the end of a division of a text.
contains the primary statement of responsibility given for a work
on its title page or at the head or end of the work.
contains a brief description of the place, date, time, etc. of production of a letter,
newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
A formal list or prose description of the topics addressed by
a subdivision of a text.
contains a quotation, anonymous or attributed, appearing at the start of a section or
chapter, or on a title page.
groups together dateline, byline, salutation, and similar phrases appearing as a preliminary
group at the start of a division, especially of a letter.
groups together salutations, datelines, and similar phrases appearing as a final group at
the end of a division, especially of a letter.
(salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other
division of a text, or the salutation in the closing of a letter, preface, etc.
(signature) contains the closing salutation, etc., appended to a foreword,
dedicatory epistle, or other division of a text.
contains a postscript, e.g. to a letter.
(title page) contains the title page of a text, appearing within the front or back matter.
classifies the title page according to any convenient typology.
(document title) contains the title of a document, including all its
constituents, as given on a title page.
contains a subsection or division of the title of a work, as
indicated on a title page.
specifies the role of this subdivision of the title.
Suggested values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
main
main title of the work
sub
(subordinate) subtitle of the work
alt
(alternate) alternative title of the work
short
abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work
(document author) contains the name of the author of the document, as given on the
title page (often but not always contained in a byline).
contains a formal statement authorizing the publication of a work, sometimes required to
appear on a title page or its verso.
(document edition) contains an edition statement as presented on a title page of a
document.
(document imprint) contains the imprint statement (place and date of publication,
publisher name), as given
(usually) at the foot of a title page.
(document date) contains the date of a document, as given
(usually) on a title page.
gives the value of the date in standard form, i.e. YYYY-MM-DD.
(front matter) contains any prefatory matter (headers,
title page, prefaces, dedications, etc.)
found at the start of a document, before the main body.
(back matter) contains any appendixes, etc. following the main part of a text.
(attribute) contains the name of an attribute appearing within running text.
supplies an identifier for the scheme in which this name is defined.
Sample values include: 1] TEI(text encoding initiative) ; 2] DBK(docbook) ; 3] XX(unknown)
contains literal code from some formal language such as a
programming language.
(formal language) a name identifying the formal language in which the
code is expressed
(example) contains any kind of illustrative example.
(example of XML) contains a single well-formed XML fragment demonstrating the use of some XML element or
attribute, in which the egXML element itself functions as the root element.
indicates the intended validity of the example with respect to
a schema.
true
the example is intended to be fully valid,
assuming that its root element can be used as the root element in the
schema concerned.
feasible
the example could be transformed into
a valid document by inserting any number of valid attributes and child
elements anywhere within it; it is valid against a version of the
schema concerned in which every data, list, element, or attribute
element has been made optional.
false
the example is not intended to be valid,
and contains deliberate errors.
(element name) contains the name (generic identifier) of an element.
supplies the name of the scheme in which this name is defined.
Sample values include: 1] TEI(text encoding initiative) ; 2] DBK(docbook) ; 3] XX(unknown) ; 4] Schematron; 5] HTML
(identifier) contains an identifier or name for an object of some kind in a formal language.
contains text of a complete start- or end-tag, possibly including attribute specifications,
but excluding the opening and closing markup delimiter characters.
indicates the type of XML tag intended
start
a start-tag, with delimiters < and > is intended
end
an end-tag, with delimiters </ and > is intended
empty
a empty tag, with delimiters < and /> is intended
pi
a pi (processing instruction), with delimiters <? and ?> is intended
comment
a comment, with delimiters <!-- and --> is intended
ms
a marked-section, with delimiters <[CDATA[ and ]]> is intended
supplies the name of the schema in which this tag is defined.
TEI
(text encoding initiative) This tag is defined as part of the TEI scheme.
DBK
(docbook) this tag is part of the Docbook scheme.
XX
(unknown) this tag is part of an unknown scheme.
(value) contains a single attribute value.
(specification list) marks where a list of descriptions is to be inserted into the prose documentation.
(specification description) indicates that a description of the specified element or class should be included at this
point within a document.
(identifier) supplies the identifier of the documentary element or class for which a description is
to be obtained.
(attributes) supplies attribute names for which descriptions should additionally be obtained.
points to the specification for an attribute or model class which is to be included in a schema
the identifier used for the required class within the
source indicated.
points to the specification for some element which is to be included in a schema
the identifier used for the required element within the
source indicated.
points to the specification for some pattern which is to be included in a schema
the identifier used for the required pattern within the
source indicated.
(module reference) references a module which is to be incorporated into a schema.
child elements of moduleRef are only allowed when an external module
is being loaded
supplies a list of the elements which are to be copied from the
specified module into the schema being defined.
supplies a list of the elements which are not to be copied from the
specified module into the schema being defined.
the name of a TEI module
(uniform resource locator) refers to a non-TEI module of RELAX NG code by external location
(module specification) documents the structure, content, and purpose of a single
module, i.e. a named and externally visible group of declarations.
type of module to be generated
(schema specification) generates a TEI-conformant schema and documentation for it.
specifies entry points to the schema, i.e. which elements
may be used as the root of documents conforming to
it.
(namespace) specifies the default namespace (if any) applicable to
components of the schema.
specifies a default prefix which will be prepended to all patterns
relating to TEI elements, unless otherwise stated. This allows for external schemas to be mixed in
which have elements of the same names as the TEI.
(target language) specifies which language to use when creating
the objects in a schema if names for elements or attributes are available in more
than one language, .
(documentation language) specifies which languages to
use when creating documentation if the description for an element, attribute, class or macro
is available in more than one language, .
(specification group) contains any convenient grouping of specifications for use within
the current module.
(reference to a specification group) indicates that the declarations contained by the specGrp referenced should be
inserted at this point.
points at the specification group which logically belongs here.
contains the intended expansion for the entity documented by a macroSpec element,
enclosed by quotation marks.
(element specification) documents the structure, content, and purpose of a single element type.
(namespace) specifies the namespace to which this element belongs
specifies a default prefix which will be prepended to all patterns
relating to the element, unless otherwise stated.
(class specification) contains reference information for a TEI element class;
that is a group of
elements which appear together in content models, or
which share some common attribute, or both.
indicates whether this is a model class or an attribute class
model
(content model) members of this class appear in the same content models
atts
(attributes) members of this class share common attributes
indicates which alternation and sequence instantiations
of a model class may be referenced. By default, all variations
are permitted.
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence,
but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more
times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence,
but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more
times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence,
but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more
times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence,
but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more
times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
alternation
members of the class are alternatives
sequence
members of the class are to be provided in sequence
sequenceOptional
members of the class may be provided, in sequence,
but are optional
sequenceOptionalRepeatable
members of the class may be provided one or more
times, in sequence, but are optional.
sequenceRepeatable
members of the class may be provided one or more times, in sequence
(macro specification) documents the function and implementation of a pattern.
indicates which type of entity should be generated, when an ODD
processor is generating a module using XML DTD syntax.
pe
(parameter entity)
dt
(datatype entity)
contains any commentary or discussion about the usage of an element, attribute, class, or
entity not otherwise documented within the containing element.
(list of references) supplies a list of significant references to places where this element is discussed, in the
current document or elsewhere.
groups an example demonstrating the use of an element along with optional paragraphs of
commentary.
specifies all the classes of which the documented element or
class is a member or subclass.
specifies the effect of this declaration on its parent
module.
change
this declaration changes the declaration of the same
name in the current definition
replace
this declaration replaces the declaration of the same
name in the current definition
specifies class membership of the parent element or class.
specifies the identifier for a class of which the documented element or class is a
member or subclass
specifies the effect of this declaration on its parent module.
add
this declaration is added to the current definitions
delete
this declaration and all of its children are removed from the current setup
(equivalent) specifies a component which is considered equivalent to the parent element, either by
co-reference, or by external link.
names the underlying concept of which the parent is a representation
(uniform resource identifier) references the underlying concept of which the parent is a representation by means of
some external identifier
references an external script which contains a method to transform instances of this
element to canonical TEI
(alternate identifier) supplies the recommended XML name for an element, class,
attribute, etc. in some language.
(content model) contains the text of a declaration for the schema
documented.
controls whether or not pattern names generated in the
corresponding RELAXNG schema source are automatically prefixed to
avoid potential nameclashes.
true
Each name referenced in e.g. a rng:ref
element within a content model is automatically prefixed by
the value of the prefix attribute on the current
schemaSpec
false
No prefixes are added:
any prefix required by the value of the prefix attribute on the current
schemaSpec must therefore be supplied explicitly, as appropriate.
(constraint rules) the formal rules of a constraint
(constraint on schema) contains a constraint, expressed in some formal syntax,
which cannot be expressed in the structural content model
Rules in the Schematron 1.* language must be inside
a constraint with a value of 'schematron' on the scheme attribute
Rules in the ISO Schematron language must be inside
a constraint with a value of 'isoschematron' on the scheme attribute
supplies the name of the language in which the constraints
are defined
schematron
(Schematron)
isoschematron
(ISO Schematron)
xsl
(XSLT)
private
(private constraint language)
contains documentation for all the attributes associated with this element, as a series of
attDef elements.
(organization) specifies whether all the attributes in the list are available (org="group") or only one
of them (org="choice")
group
grouped
choice
alternated
(attribute definition) contains the definition of a single attribute.
specifies the optionality of an attribute or element.
req
(required)
mwa
(mandatory when applicable )
rec
(recommended )
rwa
(recommended when applicable )
opt
(optional )
(namespace) specifies the namespace to which this attribute belongs
(attribute pointer) points to the definition of an attribute or group of attributes.
the name of the pattern defining the attribute(s)
specifies the declared value for an attribute, by referring to
any datatype defined by the chosen schema language.
(minimum number of occurences) indicates the minimum number of times this datatype may
occur in the specification of the attribute being defined
(maximum number of occurences) indicates the maximum number of times this datatype may
occur in the specification of the attribute being defined
unbounded
(default value) specifies the default declared value for an attribute.
(value description) specifies any semantic or syntactic constraint on the value that
an attribute may take, additional to the information carried by the
datatype element.
documents a single attribute-value within a list of possible
or mandatory items.
specifies the attribute value concerned.
(value list) contains one or more valItem elements defining possible values for an attribute.
specifies the extensibility of the list of attribute values specified.
closed
only the values specified are permitted.
semi
(semi-open) all the values specified should be supported, but other values are legal and
software should have appropriate fallback processing for them.
open
the values specified are sample values only.
specifies the effect of this declaration on its parent
object.
add
this declaration is added to the current definitions
delete
if present already, the whole of the declaration
for this object is removed from the current setup
change
this declaration changes the declaration of the same
name in the current definition
replace
this declaration replaces the declaration of the same
name in the current definition
Supplies the identifier by which this element may be referenced.
Says whether this object should be predeclared in the
tei infrastructure module.
Supplies a name for the module in which this object is to
be declared.
indicates the current status of the object identified with
respect to the current version of the TEI Guidelines.
deprecated
the item is not recommended for use,
and may be withdrawn at a future release.
unstable
the item is new and still under
review.
changed
the item has changed significantly since the
preceding version.
stable
the item has not recently changed and is
not expected to do so except for correction of any errors.
(feature system declaration) provides a feature system declaration comprising one or more
feature structure declarations or feature structure declaration links.
(feature structure declaration) declares one type of feature structure.
gives a name for the type of feature structure being declared.
gives the name of one or more typed feature structures
from which this type inherits feature specifications and
constraints;
if this type includes a feature specification
with the same name as that of any of those specified by this
attribute, or if more than one specification of the same name
is inherited, then the set of possible values is defined by
unification. Similarly, the set of constraints applicable is
derived by combining those specified explicitly within this
element with those implied by the baseTypes
attribute. When no baseTypes attribute is specified, no
feature specification or constraint is inherited.
(feature system description (in FSD)) describes in prose what is represented by the type of feature
structure declared in the enclosing fsDecl.
(feature structure declaration link) associates the name of a typed feature structure with a feature
structure declaration for it.
identifies the type of feature structure to be documented;
this will be the value of the type attribute on at least one
feature structure.
supplies a pointer to a feature structure declaration
(fsDecl) element within the current document or elsewhere.
(feature declaration) declares a single feature, specifying its name, organization,
range of allowed values, and optionally its default value.
indicates the name of the feature being declared; matches the
name attribute of f elements in the text.
indicates whether or not the value of this feature may
be present.
(feature description (in FSD)) describes in prose what is represented by the feature being
declared and its values.
(value range) defines the range of allowed values for a feature, in the form of
an fs, vAlt, or primitive value;
for the value of an f to be valid, it must be
subsumed by the specified range; if the f
contains multiple values (as sanctioned by the org attribute),
then each value must be subsumed by the vRange.
(value default) declares the default value to be supplied when a feature structure
does not contain an instance of f for this name; if
unconditional, it is specified as one (or, depending on the value of
the org attribute of the enclosing fDecl) more
fs elements or primitive values;
if conditional, it is specified as
one or more if elements; if no default is specified, or no
condition matches, the value none is assumed.
defines a conditional default value for a feature; the condition
is specified as a feature structure, and is met if it
subsumes the feature structure in the text for which a
default value is sought.
separates the condition from the default in an if, or
the antecedent and the consequent in a cond element.
(feature-structure constraints) specifies constraints on the content of valid feature
structures.
(conditional feature-structure constraint) defines a conditional feature-structure constraint; the consequent
and the antecedent are specified as feature structures or
feature-structure collections; the constraint is satisfied if both the
antecedent and the consequent subsume a given feature
structure, or if the antecedent does not.
(bi-conditional feature-structure constraint) defines a biconditional feature-structure constraint; both
consequent and antecedent are specified as feature structures or groups
of feature structures; the constraint is satisfied if both
subsume a given feature structure, or if both do not.
(if and only if) separates the condition from the consequence in a bicond
element.
(feature structure) represents a feature structure, that is, a
collection of feature-value pairs organized as a
structural unit.
specifies the type of the feature structure.
(features) references the feature-value specifications making up this feature structure.
(feature) represents a feature value specification, that
is, the association of a name with a value of any of several different types.
A feature value cannot
contain both text and element content
A feature value can contain
only one child element
provides a name for the feature.
(feature value) references any element which can be used to represent the
value of a feature.
(binary value) represents the value part of a feature-value specification which can contain either
of exactly two possible values.
supplies a binary value.
(symbolic value) represents the value part of a feature-value specification
which contains one of a finite list of symbols.
supplies the symbolic value for the feature, one of a finite list that
may be specified in a feature declaration.
(numeric value) represents the value part of a feature-value specification
which contains a numeric value or range.
supplies a lower bound for the numeric value represented,
and also (if max is not supplied) its upper bound.
supplies an upper bound for the numeric value represented.
specifies whether the value represented should be
truncated to give an integer value.
(string value) represents the value part of a feature-value specification
which contains a string.
(value label) represents the value part of a feature-value specification
which appears at more than one point in a feature structure.
supplies a name for the sharing point.
(collection of values) represents the value part of a feature-value specification
which contains multiple values organized as a set, bag, or list.
(organization) indicates organization of given value or values as set, bag or list.
set
indicates that the given values are organized as a set.
bag
indicates that the given values are organized as a
bag (multiset).
list
indicates that the given values are organized as a
list.
(default feature value) represents the value part of a feature-value specification
which contains a defaulted value.
(value alternation) represents the value part of a feature-value specification
which contains a set of values, only one of which can be valid.
(value negation) represents a feature value which is the negation of its content.
(merged collection of values) represents a feature value which is the result of merging
together the feature values contained by its children, using the organization
specified by the org attribute.
indicates the organization of the resulting merged values as set, bag or list.
set
indicates that the resulting values are organized as a set.
bag
indicates that the resulting values are organized as a bag (multiset).
list
indicates that the resulting values are organized as a list.
(feature library) assembles a library of feature elements.
(feature-value library) assembles a library of reusable feature value elements
(including complete feature structures).
(anonymous block) contains any arbitrary component-level unit of text, acting as an anonymous container for phrase or inter level elements analogous to, but without the semantic baggage of, a paragraph.
specifies whether or not the block is complete.
Y
(yes) the block is incomplete
N
(no) either the block is complete, or no claim is made as to its completeness
I
(initial) the initial part of an incomplete block
M
(medial) a medial part of an incomplete block
F
(final) the final part of an incomplete block
(arbitrary segment) represents any segmentation of text below the chunk level.
(corresponds) points to elements that correspond to the current
element in some way.
(synchronous) points to elements that are synchronous with the current
element.
points to an element that is the same as the current
element.
points to an element of which the current element is a
copy.
points to the next element of a virtual aggregate of which
the current element is part.
(previous) points to the previous element of a virtual aggregate of
which the current element is part.
points to elements that are in exclusive alternation
with the current element.
selects one or more alternants; if one alternant is
selected, the ambiguity or uncertainty is marked as resolved. If
more than one alternant is selected, the degree of ambiguity or
uncertainty is marked as reduced by the number of alternants not
selected.
Contains feature and feature-value libraries.
The topic of spoken conversation. It might seem that
textDesc/domain could be used for this pupose, but, alas!,
inclusion of textDesc makes it necessary to use all of its 8
subelements (channel, constitution, derivation, etc.), while we
only need something like domain...
The usual xml:lang attribute.