Public Corpora
Historical Corpora
- dta: Deutsches Textarchiv (1600-1900)
- dingler: Polytechnisches Journal (1820-1931)
- DSDK: Digitale Sammlung Deutscher Kolonialismus (1884-1919)
- grenzboten: Die Grenzboten (1841-1922)
- rem: Referenzkorpus Mittelhochdeutsch (1050–1350)
Newspaper Corpora
- bz: Berliner Zeitung (1994-2005)
- tagesspiegel: Tagesspiegel (1996-2004)
- zeit: ZEIT (1946-2018)
Synchronic Corpora
- blogs: Blogs (2003-2014)
- ddr: DWDS DDR-Korpus (1949-1990)
- kern: DWDS Kernkorpus (1900-1999)
- korpus21: DWDS Kernkorpus-21 (2000-2010)
- politische_reden: Political Speeches (1984-2017)
- untertitel: Film Subtitles (1916-2014)
Non-German Corpora
- apwcf: APWCF (fr, 1644-1647; compiled by A. Gerstenberg)
- bnc: British National Corpus, XML edition (en, 1947-1994; courtesy of the Oxford Text Archive)
- nhess: NHESS (en, 2001-2016; compiled by S. Blau)
- rsc: Royal Society Corpus (en, 1665-1869; based on sources provided by the Universität des Saarlandes)
Restricted Corpora
CLARIN Corpora (*)
* non-public: authentication via CLARIN credentials required- bz_pp: Berliner Zeitung (1945-1993; DDR-Presseportal)
- nd: Neues Deutschland (1946-1990; DDR-Presseportal)
- nz: Neue Zeit (1945-1994; DDR-Presseportal)
DWDS Corpora (**)
** non-public: authentication via www.dwds.de credentials required- ibk_dchat: Dortmund Chat Corpus (1998-2006)
- ibk_web_2016c: Webcorpus 2016c (2001-2016)
- textberg: Jahrbuch des Schweizer-Alpenclubs (1864–2015; Academic use only)
- ... see https://www.dwds.de/r/ for an up-to-date list of all DiaCollo instances currently hosted by the DWDS project at the BBAW. Click on the DiaCollo icon () in the "Tools" column to access the DiaCollo GUI for a particular corpus.