NAME

DTA::TokWrap::Processor::tcfdecode - DTA tokenizer wrappers: TCF->TEI+ws decoding via proxy document

SYNOPSIS

 use DTA::TokWrap::Processor::tcfdecode;
 
 $dec = DTA::TokWrap::Processor::tcfdecode->new(%opts);
 $doc_or_undef = $dec->tcfdecode($doc);

DESCRIPTION

DTA::TokWrap::Processor::tcfdecode provides an object-oriented DTA::TokWrap::Processor wrapper for decoding the tokens and sentences layers as extracted from a TCF document by the DTA::TokWrap::Processor::tcfdecode0 processor into the decoded TEI textSource layer as w and s elements.

Constants

@ISA

DTA::TokWrap::Processor::tcfdecode inherits from DTA::TokWrap::Processor.

Constructors etc.

new
 $obj = $CLASS_OR_OBJECT->new(%args);

Constructor. Options in %args:

 'att.linguistic' => $bool,  ##-- if true, use TEI att.linguistic features (default=1)
defaults
 %defaults = $CLASS->defaults();

Static class-dependent defaults.

Methods

tcfdecode
 $doc_or_undef = $CLASS_OR_OBJECT->tcfdecode($doc);

Decodes the token- and sentence-structure extracted from a TCF document and merges the results into the original TEI, assuming that the original encoding was done by TokWrap. Uses a proxy DTA::TokWrap::Document object to perform the decoding.

Relevant %$doc keys:

 tcfxfile => $tcfxfile,     ##-- (input) TEI-XML decoded from TCF
 tcftfile => $tcftfile,     ##-- (input) text data decoded from TCF
 tcfwfile => $tcfwfile,     ##-- (input) tokenized data decoded from TCF, without byte-offsets, with "SID/WID" attributes
 ##
 tcfcwsfile => $tcfcwsfile, ##-- (output) tcf-decoded+aligned+ws-spliced output file
 tcfdecode_stamp0 => $f,    ##-- (output) timestamp of operation begin
 tcfdecode_stamp  => $f,    ##-- (output) timestamp of operation end

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

AUTHOR

Bryan Jurish <jurish@bbaw.de>

COPYRIGHT AND LICENSE

Copyright (C) 2014-2018 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.