Publications on the DTA Basic Format

  • On the DTABf:

    Susanne Haaf, Alexander Geyken, Frank Wiegand: The DTA “Base Format”: A TEI Subset for the Compilation of a Large Reference Corpus of Printed Text from Multiple Sources. In: Journal of the Text Encoding Initiative 8, 2014/15. Online version, DOI: 10.4000/jtei.1114.

    Alexander Geyken, Susanne Haaf, Frank Wiegand: The DTA ‘base format’: A TEI-Subset for the Compilation of Interoperable Corpora. In: 11th Conference on Natural Language Processing (KONVENS) – Empirical Methods in Natural Language Processing, Proceedings of the Conference. Edited by Jeremy Jancsary. Wien, 2012 (= Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence 5). Online version.
  • On the DTABf for newspapers:

    Susanne Haaf, Matthias Schulz: Historical Newspapers & Journals for the DTA. In: Language Resources and Technologies for Processing and Linking Historical Documents and Archives – Deploying Linked Open Data in Cultural Heritage – LRT4HDA. Proceedings of the workshop, held at the Ninth International Conference on Language Resources and Evaluation (LREC'14), May 26–31, 2014, Reykjavik (Iceland), p. 50–54. Online version.
  • On the DTABf for manuscripts:

    Susanne Haaf, Christian Thomas: Introducing the DTABf-M: A Manuscript-specific Extension to the DTA ›Base Format‹ (DTABf). [In review; submitted for jTEI 10]
  • On the analysis options in corpora based on TEI(DTABf) tagging:

    Susanne Haaf: Corpus Analysis based on Structural Phenomena in Texts: Exploiting TEI Encoding for Linguistic Research. LREC 2016. [Forthcoming, intended for the "Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC' 16)".]