DTA-Basisformat – Overview (elements within text)

Language: deu, eng · document date: Mon May 14 15:38:41 2012

The following table provides an overview of all elements contained by the DTA-Guidelines. For each element a list of all possible attributes and their corresponding values (if invariable) is given. Each element belongs to a level, which specifies the necessity of its usage in accordance with the DTA-Guidelines. Furthermore, elements are assigned to functional categories, which specify the contexts of their usage. Attributes of the DTA-Guidelines, which can be used with most of the DTA-elements (universal attributes), are listed below the following table. For further documentation of the necessity levels and the functional categories see the end of this page.



Element Description Attributes/Notes Functional Category Level
ab  [doc]anonymous block/containeruse div, floatingText, or p insteadtextStructure
4 – proscribed
abbr  [doc]abbreviationeditorial
phraseStructure
3 – optional
actor  [doc]actor name (cast list)drama
phraseStructure
1 – required
argument  [doc]content summarytextStructure
2 – recommended
back  [doc]back matterdocumentStructure
1 – required
bibl  [doc]bibliographic citationcitations
textStructure
2 – recommended
body  [doc]text bodydocumentStructure
1 – required
byline  [doc]primary statement of responsibilitytextStructure
2 – recommended
c  [doc]a characterwill be annotated stand-off within the DTA corpusphraseStructure
3 – optional
castGroup  [doc]grouping of roles (drama)drama
textStructure
1 – required
castItem  [doc]description of a role (drama)drama
textStructure
1 – required
castList  [doc]dramatis personae (drama)drama
textStructure
1 – required
cb  [doc]column break
@ncolumn number
textStructure
1 – required
cell  [doc]table cell
@colsnumber of colums occupied
@rowsnumber of rows occupied
tables
1 – required
choice  [doc]choice of transcription/encodingeditorial
2 – recommended
cit  [doc]citationcitations
textstructure
2 – recommended
closer  [doc]last block of text (letter)letter
textStructure
2 – recommended
corr  [doc](in case of correction:) corrected formeditorial
2 – recommended
date  [doc]a date in any format
@whendate in ISO 8601 format
phraseStructure
3 – optional
dateline  [doc]place and date of formulation (letter)letter
textStructure
2 – recommended
div  [doc]a subdivision of a text
@ndepth of text structure
@typetype of text division ["act", "advertisement", "appendix", "bibliography", "chapter", "content", "copyright", "corrigenda", "dedication", "diaryEntry", "dramatisPersonae", "epigraph", "figures", "frontispiece", "halftitle", "imprint", "index", "letter", "listOfFigures", "poem", "preface", "scene"]
documentStructure
textStructure
1 – required
div1  [doc]level-1 text divisionuse <div n="1"> insteaddocumentStructure
textStructure
4 – proscribed
div2  [doc]level-2 text divisionuse <div n="2"> insteaddocumentStructure
textStructure
4 – proscribed
div3  [doc]level-3 text divisionuse <div n="3"> insteaddocumentStructure
textStructure
4 – proscribed
div4  [doc]level-4 text divisionuse <div n="4"> insteaddocumentStructure
textStructure
4 – proscribed
div5  [doc]level-5 text divisionuse <div n="5"> insteaddocumentStructure
textStructure
4 – proscribed
div6  [doc]level-6 text divisionuse <div n="6"> insteaddocumentStructure
textStructure
4 – proscribed
div7  [doc]level-7 text divisionuse <div n="7"> insteaddocumentStructure
textStructure
4 – proscribed
docAuthor  [doc]document authortitlepage
2 – recommended
docDate  [doc]document publication datetitlepage
2 – recommended
docEdition  [doc]document editiontitlepage
2 – recommended
docImprint  [doc]document imprinttitlepage
2 – recommended
docTitle  [doc]document titletitlepage
2 – recommended
epigraph  [doc]epigraphcitations
textStructure
2 – recommended
expan  [doc]expansion of an abbreviationeditorial
3 – optional
figure  [doc]illustration/figurefloats
1 – required
floatingText  [doc]interrupting text after which the surrounding text resumesfloats
2 – recommended
foreign  [doc]foreign-language phrase
@xml:langlanguage code (ISO 639-3)
phraseStructure
3 – optional
formula  [doc]formula
@notationtype of notation ["TeX"]
floats
phraseStructure
1 – required
front  [doc]front matterdocumentStructure
1 – required
fw  [doc]forme work
@placeposition ["bottom", "top"]
@typetype of element ["catch", "header", "sig"]
floats
2 – recommended
g  [doc]non-standard character or glyphuse U+FFFC insteadphraseStructure
4 – proscribed
gap  [doc]material omitted in the text
@quantityamount of text missing
@reasonreason for the omission ["fm", "illegible", "insignificant"]
@unitunit used for measuring the amount of text missing ["chars", "lines", "pages", "words"]
editorial
1 – required
head  [doc]headingtextStructure
1 – required
hi  [doc]highlighting
@renditionthe medium for highlighting
phraseStructure
2 – recommended
imprimatur  [doc]imprimaturtextStructure
2 – recommended
item  [doc]list itemtextStructure
1 – required
l  [doc]line or verse
@nverse number (in the textsource)
textStructure
verse
1 – required
label  [doc]introduction of a postscript; should be transcribed within postscript insteadletter
phraseStructure
4 – proscribed
lb  [doc]line break
@nprinted line number
textStructure
2 – recommended
lg  [doc]group of verse lines
@n(for stanzas:) number of stanza
@typetype of verse group ["poem"]
textStructure
verse
1 – required
list  [doc]listfloats
textStructure
1 – required
listBibl  [doc]citation listtextStructure
3 – optional
milestone  [doc]text/division separator
@unitinterruption, only used for vertical lines ["section"]
textStructure
2 – recommended
name  [doc]proper noun (except persons, places, and organizations)
@typetype of proper noun
phraseStructure
3 – optional
note  [doc]note (foot, end, marginal)
@nnote sign/number
@placeposition ["bottom", "end", "left", "right"]
floats
1 – required
opener  [doc]introduction of a letterletter
textStructure
2 – recommended
orgName  [doc]name (organization)phraseStructure
3 – optional
orig  [doc](in case of normalization:) original spellingeditorial
phraseStructure
2 – recommended
p  [doc]paragraphtextStructure
1 – required
pb  [doc]page break
@facslink to facsimile
@npage number
documentStructure
1 – required
persName  [doc]name (person)phraseStructure
3 – optional
placeName  [doc]name (place)phraseStructure
3 – optional
postscript  [doc]postscript (letter)
@n(in case of more than one postscripts:) number
letter
textStructure
2 – recommended
publisher  [doc]publishertitlepage
2 – recommended
q  [doc]direct speechphraseStructure
3 – optional
quote  [doc](in citation:) quoted textcitations
phraseStructure
2 – recommended
ref  [doc]referencephraseStructure
3 – optional
reg  [doc](in case of normalization:) regularized formeditorial
2 – recommended
role  [doc]role name
@xml:idid for a particular role (drama)
drama
phraseStructure
2 – recommended
roleDesc  [doc]role description (drama)drama
phraseStructure
2 – recommended
row  [doc]table rowtables
1 – required
s  [doc]a sentencewill be annotated stand-off within the DTA corpusphraseStructure
3 – optional
salute  [doc]salutationletter
textStructure
2 – recommended
seg  [doc]arbitrary segmentuse @corresp, @next, and @prev insteaddocumentStructure
textStructure
4 – proscribed
sic  [doc](in case of correction:) reproduced, incorrect formeditorial
phraseStructure
2 – recommended
signed  [doc]signature (letter)letter
textStructure
2 – recommended
sp  [doc]speech act
@whospeaker's id (as assigned to the role)
drama
textStructure
1 – required
space  [doc]significant space
@dimdimension ["horizontal", "vertical"]
@quantityamount of space concerned
@unitunit measuring the amount of space concerned ["chars", "lines", "words"]
textStructure
2 – recommended
speaker  [doc]speaker of a speech (drama)drama
1 – required
stage  [doc]stage directiondrama
1 – required
supplied  [doc]text supplied by the editoreditorial
1 – required
table  [doc]tablefloats
1 – required
text  [doc]textdocumentStructure
1 – required
titlePage  [doc]a title page within the front matter
@typetype of title page ["halftitle", "main", "series", "volume"]
documentStructure
1 – required
titlePart  [doc]part of a title
@typetype of title part ["copyright", "dedication", "desc", "description", "halftitle", "imprimatur", "main", "price", "privilegium", "series", "sub", "volume"]
titlepage
1 – required
trailer  [doc]closing title/footer at the end of a division of a texttextStructure
2 – recommended
w  [doc]a wordwill be annotated stand-off within the DTA corpusphraseStructure
3 – optional


Restriction levels

Level 1Elements, that have to be considered to fullfill the DTA-guidelines. These elements are used consequently in the DTA-corpus.
Level 2Elements included in the DTA-guidelines, that may be ignored in text annotation. They are used in all texts of the DTA-corpus.
Level 3Elements, which are not part of the DTA-guidelines, but that are compatible with the DTA-schema. They are not consequently used in the texts of the DTA-corpus.
Level 4Elements, which were explicitly excluded from the DTA-guidelines. They should therefore be avoided in favor of the solutions offered in the DTA-guidelines.

Functional categories

citations

Elements describing citations.

show elements within this category

documentStructure

Elements describung the structure of the document.

show elements within this category

drama

Elements describing text units within dramas.

show elements within this category

editorial

Elements describing editorial interventions.

show elements within this category

floats

Elements describing blocks or structures, which interrupt the running text at document or element level.

show elements within this category

letter

Elements describing text units within letters.

show elements within this category

phraseStructure

Elements describing the semantics or appearance of single words or phrases.

show elements within this category

tables

Elements describing the structure of tables.

show elements within this category

textStructure

Elements describing the semantics or appearance of text units within the document.

show elements within this category

titlepage

Elements describing text units occuring at a work's title page.

show elements within this category

verse

Elements describing verse specific text units.

show elements within this category

Generic attributes

@corresppoints to elements that correspond to the current element in some way ["xml:id"]
@nextpoints to the next element of a virtual aggregate of which the current element is part ["xml:id"]
@prevpoints to the previous element of a virtual aggregate of which the current element is part ["xml:id"]
@renddescribes the way of highlighting of a string
@renditionthe way of highlighting of a block ["#above-cap", "#b", "#blue", "#g", "#i", "#in", "#inline", "#k", "#red", "#right", "#sub", "#u", "#up", "#uu"]
@xml:idID of the element
@xml:langlanguage code (ISO 639-3)