Document Classifications (//profileDesc)

The Profile Description contains information on the languages in which the text is mainly written (<langUsage>/<language ident="[code according to ISO 639-3]">, → ISO 639-3), as well as classifications of the texts according to text type, subject and origin of the DTA full text (<textClass>).

Multiple classifications are possible for classifying texts and topics. The descriptors used in each case are displayed in the <classCode> element. The attribute @scheme in <classCode> contains a reference to the underlying classification scheme, e. g. the Classification Scheme of the DTA Core Corpus Texts.

In order to group the digitized full-text editions published in the DTA into sub-corpora according to their origin, the metadata contains information on the context in which the full-text was created and, as a result of this, on the recording method (e. g. china, ocr, mts, aedit, gutenberg, wikisource). The corresponding specification is made in a <classCode> element, again with a reference to the underlying classification scheme (here: the Classification Scheme of the DTA Core Corpus Texts) in the @scheme attribute.

<profileDesc>
  <langUsage>
  <language ident="[code according to ISO 639-3, e. g. deu]">[language, e. g. German]</language>
  </langUsage>
  <textClass>
  <classCode scheme="[URL DTA main classification]">[text type acc. to DTA main classification]</classCode>
  <classCode scheme="[URL DTA subclassification]">[text type acc. to DTA subclassification]</classCode>
  <classCode scheme="[URL DWDS main classification01]">[text type acc. to DWDS main classification 1]</classCode>
  <classCode scheme="[URL DWDS subclassification01]">[text type acc. to DWDS subclassification 1]</classCode>
  <classCode scheme="[URL DWDS main classification02]">[text type acc. to DWDS main classification 2]</classCode>
  <classCode scheme="[URL DWDS-subclassification02]">[text type acc. to DWDS subclassification 2]</classCode>
  <classCode scheme="[URL classification text origin]">[origin of the DTA full text]</classCode>
  </textClass>
</profileDesc>