Bryan Jurish / Publications
2019
-
Jurish, B. and M. Nieländer.
"Using DiaCollo for historical research."
In
Proceedings of the
CLARIN Annual Conference 2019,
Leipzig, Germany, 30th September - 2nd October,
2019.
(pdf:paper,
pdf:slides,
bib)
-
Wynne, M. and B. Jurish,
"Natural Language Processing for Historical Documents."
Poster presentation of the eponymous CLARIN Workshop (9-11 September, Berlin, Germany),
presented at the
CLARIN Annual Conference 2019,
Leipzig, Germany, 30th September - 2nd October,
2019.
(pdf:poster,
CLARIN Blog,
Zentrum Sprache Blog)
-
Jurish, B.
"Exploring diachronic collocations with DiaCollo."
Talk presented at
the Julius-Maximilians Universität Würzburg,
6th July, 2019.
(online resources,
pdf:slides,
workshop notes,
"Steckbriefe" notes)
-
Jurish, B.
"DTA::CAB - a Field Spotter's Guide."
Talk and afternoon session
for the workshop Information Extraction aus frühneuhochdeutschen Texten
at the
Universität Graz, Zentrum für Informationsmodellierung,
19th March, 2019.
(pdf:slides,
workshop materials)
2018
-
Jurish, B.
"Diachronic Collocations, Genre, and DiaCollo."
In R. J. Whitt (editor),
Diachronic Corpora, Genre, and Language Change.
Amsterdam, John Benjamins, 2018, pages 42–64.
(online,
pdf:draft,
bib)
-
A. Geyken, M. Boenig, S. Haaf, B. Jurish, C. Thomas, and F. Wiegand.
"Das Deutsche Textarchiv als Forschungsplattform für historische Daten in CLARIN."
In: H. Lobin, R. Schneider and A. Witt (editors),
Digitale Infrastrukturen für die germanistische Forschung,
volume 6 of Germanistische Sprachwissenschaft um 2020.
Berlin/Boston, 2018, pages 219–248.
DOI: 10.1515/9783110538663-011
(online,
epub,
pdf,
pdf:local)
2017
-
Jurish, B.,
M. Nieländer,
and T. Werneke.
"DiaCollo and die Grenzboten."
Talk presented at the conference
Genealogies of Knowledge I:
Translating Political and Scientific Thought across Time and Space,
University of Manchester, 7th-9th December, 2017.
(pdf:abstract, pdf:slides)
-
Jurish, B.
and T. Werneke.
"Visualizing Semantic Change with DiaCollo."
Talk presented at the
20th International Conference on Conceptual History: Concepts in the World: Politics, Knowledge, and Time,
University of Oslo, 23rd September, 2017.
(pdf:slides)
-
Geyken, A.,
A. Barbaresi,
J. Didakowski,
B. Jurish,
F. Wiegand,
and L. Lemnitzer.
"Die Korpusplattform des 'Digitalen Wörterbuchs der deutschen Sprache' (DWDS)."
Zeitschrift für germanistische Linguistik, 45(2):327-344, 2017.
(epub,
bib)
-
Jurish, B.
"Some remarks on text data visualization and codec
transparency."
Talk presented at
Visualisierungsprozesse in den Humanities
Linguistische Perspektiven auf Prägungen, Praktiken, Positionen
(VisuHu 2017),
Zürich, 17th-19th July, 2017.
(pdf:abstract,
pdf:slides)
-
Jurish, B.
"Exploring diachronic collocations with DiaCollo."
Talk presented at
the Göttingen Centre for Digital Humanties (19th June);
the Universität Potsdam, Institut für Linguistik (23rd June);
and the Freien Universität Berlin, Institut für Romanische Philologie (28th June);
2017.
(online resources,
pdf:slides:Göttingen,
pdf:slides:Potsdam,
pdf:slides:FU-Berlin)
2016
-
Jurish, B.
"Tools, Toys, and Filters."
In: Rechtsgeschichte - Legal History Rg 24,
pages 347-348, 2016.
(online,
pdf:epub,
pdf:local,
pdf:draft,
bib)
-
Jurish, B., A. Maletti, U. Springmann, and K.-M. Würzner (eds).
Proceedings of the SIGFSM Workshop on Statistical NLP and Weighted Automata.
Workshop held at the 54th Annual Meeting of the Association for Computational Linguistics,
Berlin, Germany, 12th August, 2016.
(epub)
-
Hinrichs, E., B. Jurish, A. Geyken, and L. Lemnitzer.
"Searching Linguistic Patterns in Large Text Corpora for Digital Humanities Research."
Workshop taught at the
7th European Summer University in Digital Humanities, "Culture & Technology",
Leipzig, 19th-23rd July, 2016.
(pdf:slides:bio,
pdf:slides:cab,
pdf:slides:diacollo)
-
Jurish, B. "DiaCollo."
Talk presented at the
Centre for Corpus Research,
Univeristy of Birmingham,
28th June, 2016.
(pdf:slides)
-
Jurish, B. "DWDS Wortverläufe"
Talk presented at the
Berlin-Brandenburgische Akademie der Wissenschaften,
31st May, 2016.
(pdf:slides)
-
Jurish, B.
"Diachronic Collocations and Genre: a case for DiaCollo?"
In R. J. Whitt (editor),
Diachronic Corpora, Genre, and Language Change: Book of Abstracts,
Nottingham, UK, 8th-9th April,
pages 22-24,
2016.
(pdf:epub,
pdf:draft,
pdf:slides,
bib)
-
Jurish, B., A. Geyken, and T. Werneke.
"DiaCollo: diachronen Kollokationen auf der Spur."
In DHd 2016: Modellierung - Vernetzung - Visualisierung,
Leipzig, 7th-12th March,
pages 172-175,
2016.
(pdf:revised-draft,
pdf:epub [contains typesetting errors],
slides,
bib)
-
Jurish, B.
"Visualisierung diachroner Kollokationen mit DiaCollo."
Talk presented at the workshop
Die geisteswissenschaftliche Perspektive: Welche Forschungsergebnisse lassen Digital Humanities erwarten?,
Akademie der Wissenschaften und der Literatur, Mainz,
19th February, 2016.
(slides)
-
Jurish, B. and T. Werneke.
"Computergestützte Analyse von Kollokationen im diachronen Verlauf."
Talk presented at the workshop
Digitale Geschichtswissenschaft - neue Tools für neue Fragen?
of the CLARIN-D Working Groups "Neuere Geschichte" and "Zeitgeschichte",
Berlin-Brandenburgische Akademie der Wissenschaften,
8th February, 2016.
(slides)
2015
- Geyken, A. and B. Jurish.
"Neue Entwicklungen und Wege bei der Erstellung, Erweiterung und Nutzung von Korpora
am Zentrum Sprache."
Talk presented at the
KobRA workshop
Neue Wege in der Nutzung von Korpora - Data-Mining für die textorientierten Geisteswissenschaften,
Berlin-Brandenburgische Akademie der Wissenschaften,
30th October, 2015.
(slides)
- Jurish, B.
"DiaCollo: On the trail of diachronic collocations."
In K. De Smedt (editor),
Proceedings of the
CLARIN Annual Conference 2015,
Wrocław, Poland, 15th-17th October,
pages 28-31,
2015.
(pdf:paper,
pdf:poster,
bib)
- Würzner, K.-M. & B. Jurish.
"Dsolve - Morphological segmentation for German using conditional random fields."
In C. Mahlow and M. Piotrowski (editors),
Systems and Frameworks for Computational Morphology
(Proceedings of the Fourth International Workshop SFCM 2015, Stuttgart, Germany, 17-18 September, 2015),
volume 537 of Communications in Computer and Information Science (CCIS),
Springer,
pages 94-103, 2015.
(epub,
pdf:draft,
bib,
slides)
- Jurish, B.
"DiaCollo: ein interaktives Werkzeug zur Extraktion und Exploration diachroner Kollokationen."
Talk presented at the workshop
Historische Semantik und Semantic Web
of the AG "Elektronisches Publizieren",
Union der deutschen Akademien der Wissenschaften,
Heidelberg, Germany, 14th - 16th September, 2015.
(slides)
- Würzner, K.-M. & B. Jurish.
"A hybrid approach to grapheme-phoneme conversion."
In Proceedings of the
12th International Conference on Finite State Methods and Natural Language Processing
(Düsseldorf, Germany, 22nd - 24th June, 2015),
2015.
(pdf,
pdf:local,
bib:local,
slides)
- Jurish, B. & H. Ast.
"Using an alignment-based lexicon for canonicalization of historical text."
In J. Gippert & R. Gehrke (editors),
Historical Corpora: Challenges and Perspectives,
volume 5 of
Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP),
pages 197-208. Narr, Tübingen, 2015.
(pdf:draft,
bib,
slides,
corpus)
2014
-
Jurish, B., C. Thomas, & F. Wiegand.
"Querying the Deutsches Textarchiv."
In U. Kruschwitz, F. Hopfgartner, & C. Gurrin (editors),
Proceedings of the Workshop
MindTheGap 2014: Beyond Single-Shot Text Queries: Bridging the Gap(s) between Research Communities
(co-located with iConference 2014, Berlin, Germany, 4th March, 2014),
pages 25-30, 2014.
(pdf,
bib,
pdf:local,
slides)
-
Jurish, B.
"Semantics, similarity, and corpus search in the Deutsches Textarchiv."
Talk presented at the
2nd DTA- & CLARIN-D Conference und Workshop
Textkorpora in Infrastrukturen für die Geistes- und Sozialwissenschaften,
Berlin-Brandenburgische Akademie der Wissenschaften,
17th - 18th November, 2014.
(slides)
-
Haaf, S. & B. Jurish.
"Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF."
Talk presented with S. Haaf at the
2nd DTA- & CLARIN-D Conference und Workshop
Textkorpora in Infrastrukturen für die Geistes- und Sozialwissenschaften,
Berlin-Brandenburgische Akademie der Wissenschaften,
17th - 18th November, 2014.
2013
-
Jurish, B., K.-M. Würzner, M. Ermakova, & S. Arana.
"Canonicalization techniques for computer-mediated communication."
Talk presented by K.-M. Würzner at the workshop
Verarbeitung und Annotation von Sprachdaten aus Genres internetbasierter Kommunikation
at the GSCL Conference 2013 in Darmstadt, 23rd September 2013.
(slides)
-
Jurish, B., & K.-M. Würzner.
"Word and Sentence Tokenization with Hidden Markov Models."
Journal for Language Technology and Computational Linguistics,
28(2):61-83,
2013.
(pdf,
pdf:local,
pdf:draft,
bib)
- Jurish, B., & K.-M. Würzner.
"Multi-threaded composition of finite-state automata."
In Proceedings of the
11th International Conference on Finite State Methods and Natural Language Processing
(St Andrews, Scotland, 15th - 17th July, 2013),
pages 157-161, 2013.
(pdf,
bib,
slides,
pdf:local,
bib:local)
-
Jurish, B.
" 'Elchen, Elektroskapiefken, vn Andrés Kopfweh'
or: When Canonicalization Algorithms Attack."
Talk presented at the Ruhr-Universität Bochum, 28th May 2013.
(slides)
- Jurish, B., M. Drotschmann, & H. Ast.
"Constructing a canonicalized corpus of historical German by text alignment."
In P. Bennett, M. Durrell, S. Scheible, and R. J. Whitt (editors),
New Methods in Historical Corpora,
volume 3 of
Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP),
pages 221-234. Narr, Tübingen, 2013.
(pdf:draft,
bib,
slides,
corpus)
- Jurish, B.
"Canonicalizing the deutsches Textarchiv."
In I. Hafemann (ed.),
Perspektiven einer corpusbasierten historischen Linguistik und Philologie
(Berlin, Germany, 12th - 13th December 2011),
volume 4 of Thesaurus Linguae Aegyptiae,
Berlin-Brandenburgische Akademie der Wissenschaften,
2013.
(pdf,
bib,
pdf:local,
pdf:draft)
- Thomas, C., & B. Jurish.
"Named Entity Recognition (NER) im Deutschen Textarchiv – Computerlinguistisch gestützte Identifikation von Personen- und Ortsnamen in den Korpora des DTA."
Talk presented by C. Thomas at the Workshop Mehr Personen – Mehr Daten – Mehr Repositorien,
Berlin-Brandenburgische Akademie der Wissenschaften,
4th – 6th March 2013.
(abstract,
slides,
slides:local,
data)
- Würzner, K.-M., L. Lemnitzer, A. Geyken, & B. Jurish.
"Linguistic Annotation of Computer-Mediated Communication, (not only) an Explorative Analysis."
Talk presented by K.-M. Würzner at 35. Jahrestagung der DGfS,
12th - 15th March 2013.
(slides)
2012
- Jurish, B.
Finite-state Canonicalization Techniques for Historical German.
PhD thesis, Universität Potsdam, 2012 (completed 2011, published 2012).
URN urn:nbn:de:kobv:517-opus-55789,
URL http://opus.kobv.de/ubp/volltexte/2012/5578/.
(pdf:local,
bib)
- Jurish, B. and K.-M. Würzner.
"Multi-threaded composition of finite-state transducers."
Talk presented at the
6th International Workshop on Weighted Automata Theory and Applications (WATA 2012)
29th May – 2nd June, 2012, Dresden, Germany.
(slides)
- Würzner, K.-M., B. Jurish, A. Geyken, & L. Lemnitzer.
"Kollaborative Erstellung eines annotierten Korpus als Grundlage für die Anwendung statistischer Ansätze der automatischen Sprachverarbeitung auf internetbasierte Kommunikation",
Talk presented by K.-M. Würzner at the Workshop Webkorpora in Linguistik und Sprachforschung,
Mannheim, Germany, 27th - 28th September 2012.
(abstract,
slides)
- Geyken, A., S. Haaf, B. Jurish, M. Schulz, C. Thomas, & F. Wiegand.
"TEI und Textkorpora: Fehlerklassifikation und Qualitätskontrolle vor, während und nach der Texterfassung im Deutschen Textarchiv."
In: Jahrbuch für Computerphilologie - online, 2012.
(html)
2011
-
Gärtner, H.-M. & B. Jurish.
"Postmodern linguistics and the prospects of neural syntax: Some polemical remarks."
Theoretical Linguistics 37(1/2):37-44.
(pdf:draft)
- Geyken, A., S. Haaf, B. Jurish, M. Schulz, J. Steinmann, C. Thomas & F. Wiegand.
"Das Deutsche Textarchiv: Vom historischen Korpus zum aktiven Archiv."
In
S. Schomburg, C. Leggewie, H. Lobin & C. Puschmann (editors),
Proceedings of Digitale Wissenschaft: Stand und Entwicklung digital vernetzter Forschung in Deutschland,
20th-21st September 2010: 2nd, expanded edition, pages 157-161, 2011.
(pdf)
2010
- Jurish, B.
"More than words: using token context to improve canonicalization of historical German."
Journal for Language Technology and Computational Linguistics,
25(1):23-40,
2010.
(pdf,
pdf:local,
pdf:draft,
bib)
- Jurish, B.
"Comparing canonicalizations of historical German text."
In
Proceedings of the 11th Meeting of the ACL Special Interest Group on
Computational Morphology and Phonology (SIGMORPHON), pages 72-77,
Uppsala, Sweden, 15 July 2010.
(pdf
bib,
pdf:local,
bib:local,
slides)
- Jurish, B.
"Efficient online k-best lookup in weighted finite-state cascades."
In T. Hanneforth and G. Fanselow, editors,
Language and Logos: Studies in Theoretical and Computational Linguistics,
volume 72 of Studia grammatica. Akademie Verlag, Berlin, 2010. ISBN 978-3-05-004931-1.
(pdf:draft,
bib)
2009
- Jurish, B.
"Canonicalization Strategies for Historical German Text."
Talk presented at the
Berlin-Brandenburg Academy of Sciences
(BBAW), Berlin, Germany, 19 November, 2009.
(slides)
- Jurish, B., A. Siebert, and K.-M. Würzner.
"Real-Time
Error-Tolerant Linuigistic Analysis of User-Generated Content."
Talk presented by A. Siebert at the GSCL Workshop, 29 September 2009, Potsdam.
2008
- Jurish, B.
"Finding canonical forms for historical German text"
In A. Storrer, A. Geyken, A. Siebert and K.-M. Würzner (editors),
Text Resources and Lexical Knowledge
selected papers from the 9th Conference on Natural Language Processing (KONVENS 2008),
pages 27-37.
Berlin, de Gruyter, September, 2008.
ISBN 978-3-11-020735-4.
(pdf:draft,
slides,
bib)
2006
- Jurish, B.
"Deterministic Letter-to-Sound Transduction in the Taxi/Grimm Corpus Indexing System."
Talk presented at the
Berlin-Brandenburg Academy of Sciences
(BBAW), Berlin, Germany, 22 December, 2006.
(slides,
slides:revised)
- Jurish, B.
"Experiments in Unsupervised Morphology Induction."
Talk presented at the
Berlin-Brandenburg Academy of Sciences
(BBAW), Berlin, Germany, June, 2006.
(slides)
2005
- Jurish, B.
"Hybrid syntactic category induction."
Paper presented at the
Workshop on Computational Modelling of Language Acquisition
(CPALA), Split, Croatia, 25-27 July, 2005.
(pdf:color,
pdf:monochrome,
slides
bib)
2004
- beim Graben, P., B. Jurish, D. Saddy, & S. Frisch.
"Language processing by dynamical systems."
International Journal of Bifurcation and Chaos, 14(2): 599 - 622 (2004).
(pdf)
- Jurish, B. "Music as a formal language."
Paper presented at the first international
pd~convention, Graz, Austria, September, 2004.
(pdf:draft,
slides)
2003
-
Jurish, B.
"A hybrid approach to part-of-speech tagging."
Final report, Project Kollokationen im Wörterbuch,
Berlin-Brandenburgische Akademie der Wissenschaften, 2003.
(pdf,
Software Page)
-
Jurish, B.
"Part-of-Speech Tagging with Finite State Morphology",
Poster presented at the conference
Collocations and Idioms:
Linguistic, Computational, and Psycholinguistic Perspectives
,
Berlin, 18.-20. September, 2003.
(pdf)
2001
-
Jurish, B. Relational Query Feature Structures,
Diplom thesis, Universität Potsdam, October, 2001.
(HTML,
PostScript)