- moot user tutorial
- moot file formats
- mootutils summary
- libmoot C++ API
"A hybrid approach to part-of-speech tagging";
Final report, Project Kollokationen im Wörterbuch,
Berlin-Brandenburgische Akademie der Wissenschaften, 2003.
(Somewhat outdated but still largely applicable details on the moot system architecture, capabilities, and theoretical background)
Bryan Jurish and Kay-Michael Würzner,
"Word and Sentence Tokenization with Hidden Markov Models";
to appear in Journal for Language Technology and Computational Linguistics,
(technical details on and emprical evaluation of the moot/WASTE tokenization system)
- Bryan Jurish, "A hybrid approach to part-of-speech tagging"; Final report, Project Kollokationen im Wörterbuch, Berlin-Brandenburgische Akademie der Wissenschaften, 2003.
BinariesPlease note that the packages in this section are hopelessly outdated. Building from source is recommended.
Word- and sentence tokenizer using libmoot for disambiguation, described in Jurish and Würzner (2013). The WASTE library and runtime utilities are included with moot source distributions >= v2.0.10.
Finite-state morphological analyzer for use with moot. Users must supply their own morphological transducer. Note that this packge replaces the integrated morphological component described in Jurish (2003).
My (old, unmaintained) port of Alain Coetmeur's C++ ports of the famous lexer/parser generator pair. If you're having trouble compiling from CVS, you might try installing this package. Current versions should be available from: ftp://iecc.com/pub/file/bison++flex++
A perl module for generating command-line parsers and .pod documentation from specification files. You might need this to compile from CVS.