NAME

DTA::TokWrap::Processor::standoff - DTA tokenizer wrappers: t.xml -> (s.xml, w.xml, a.xml) via external filter programs

SYNOPSIS

 use DTA::TokWrap::Processor::standoff;
 
 $so = DTA::TokWrap::Processor::standoff->new(%opts);
 $doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);
 $doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);
 $doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);
 $doc_or_undef = $CLASS_OR_OBJECT->standoff($doc);
 
 ##-- backwards-compatibility
 undef = $so->dump_t2s_stylesheet($filename_or_fh);
 undef = $so->dump_t2w_stylesheet($filename_or_fh);
 undef = $so->dump_t2a_stylesheet($filename_or_fh);

DESCRIPTION

Constants

Variable: @ISA

DTA::TokWrap::Processor::standoff inherits from DTA::TokWrap::Processor.

Constructors etc.

new
 $so = $CLASS_OR_OBJECT->new(%args);

Constructor.

%args, %$so:

 t2w => $path_to_dtatw_txml2wxml, ##-- default: search
 t2s => $path_to_dtatw_txml2sxml, ##-- default: search
 t2a => $path_to_dtatw_txml2axml, ##-- default: search
 inplace => $bool,                ##-- prefer in-place programs for search?
defaults
 %defaults = CLASS->defaults();

Static class-dependent defaults.

init
 $so = $so->init();

Dynamic object-dependent defaults.

Methods: Backwards-compatibility

_xsl
 $so_xsl = $so->_xsl();

Return a DTA::TokWrap::Processor::standoff::xsl object which may or may not be logically equivalent to $so.

dump_t2s_stylesheet
 undef = $so->dump_t2s_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2s_stylesheet().

dump_t2w_stylesheet
 undef = $so->dump_t2w_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2w_stylesheet().

dump_t2a_stylesheet
 undef = $so->dump_t2a_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2a_stylesheet().

Methods: document processing

soxml
 $doc_or_undef = $CLASS_OR_OBJECT->soxml($doc,$X,$xmlbase);

Low-level generic standoff formatting method. Generate $X-level standoff for the DTA::TokWrap::Document object $doc.

Relevant %$doc keys:

 xtokdata    => $xtokdata, ##-- (input) XML-ified tokenizer output data (string)
 so${X}file  => $sosfile,  ##-- (output) standoff file, refers to $xml_base
 ##
 so${X}xml_stamp0 => $f,   ##-- (output) timestamp of operation begin
 so${X}xml_stamp  => $f,   ##-- (output) timestamp of operation end
 so${X}file_stamp => $f,   ##-- (output) timestamp of operation end
sosxml
 $doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);

Just a wrapper for:

 $so->soxml($doc,'s',basename($doc->{sowfile}));
sowxml
 $doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);

Just a wrapper for:

 $so->soxml($doc,'w',$doc->{xmlbase});
soaxml
 $doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);

Just a wrapper for:

 $so->soxml($doc,'a',basename($doc->{sowfile}));

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

AUTHOR

Bryan Jurish <jurish@bbaw.de>

COPYRIGHT AND LICENSE

Copyright (C) 2009-2018 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.