NAME

DTA::CAB::Format::XmlTokWrapFast - DTA::TokWrap XML, fast quick & dirty I/O for (.ddc).t.xml

SYNOPSIS

 ##========================================================================
 ## PRELIMINARIES
 
 use DTA::CAB::Format::XmlTokWrapFast;
 
 ##========================================================================
 ## Constructors etc.
 
 $fmt = CLASS_OR_OBJ->new(%args);
 $xmlparser = $fmt->xmlparser();
 
 ##========================================================================
 ## Methods: I/O: generic
 
 $fmt = $fmt->close($savetmp=0);
 @layers = $fmt->iolayers();
 
 ##========================================================================
 ## Methods: I/O: Block-wise: Generic
 
 %blockOpts = $CLASS_OR_OBJECT->blockDefaults();
 
 ##========================================================================
 ## Methods: Input: Input selection
 
 $fmt = $fmt->fromString(\$string);
 $fmt = $fmt->fromFile($filename);
 $fmt = $fmt->fromFh($handle);
 
 ##========================================================================
 ## Methods: Input: Generic API
 
 $doc = $fmt->parseDocument();
 
 ##========================================================================
 ## Methods: Output: MIME & HTTP stuff
 
 $short = $fmt->shortName();
 $ext = $fmt->defaultExtension();
 
 ##========================================================================
 ## Methods: Output: output selection
 
 $fmt = $fmt->flush();
 $str = $fmt->toString();
 $fmt_or_undef = $fmt->toFile($filename_or_handle, $formatLevel);
 $fmt_or_undef = $fmt->toFh($fh,$formatLevel);
 
 ##========================================================================
 ## Methods: Output: quick and dirty
 
 $fmt = $fmt->putDocument($doc);
 

DESCRIPTION

Globals

Variable: @ISA

DTA::CAB::Format::XmlTokWrapFast inherits from the more generic but slower DTA::CAB::Format::XmlTokWrap.

Constructors etc.

new
 $fmt = CLASS_OR_OBJ->new(%args);

object structure: HASH ref

    {
     ##-- input: new
     doc   => $doc,         ##-- cached parsed DTA::CAB::Document
     ##-- input: inherited (but unused)
     #xdoc => $xdoc,                          ##-- XML::LibXML::Document
     #xprs => $xprs,                          ##-- override: XML::Parser parser
     ##-- output: inherited from DTA::CAB::Format
     utf8  => $bool,                         ##-- always true
     level => $level,                        ##-- output formatting level (default=0)
     output_moot => $bool,                     ##-- include <moot> output element? (default=1)
     output_ner  => $bool,                     ##-- include <ner> output element? (default=0)
    }
xmlparser
 $xmlparser = $fmt->xmlparser();

returns cached $fmt->{xprs} if available, otherwise caches & returns new XML::Parser

Methods: I/O: generic

close
 $fmt = $fmt->close($savetmp=0);

override calls $fmt->flush() and deletes @$fmt{qw(xdoc output)}

iolayers
 @layers = $fmt->iolayers();

returns PerlIO layers to use for I/O handles; override returns ':raw'

Methods: I/O: Block-wise: Generic

blockDefaults
 %blockOpts = $CLASS_OR_OBJECT->blockDefaults();

returns default block options as for blockOptions(); override returns as for $CLASS_OR_OBJECT->blockOptions('2m@s')

Methods: Input: Input selection

fromString
 $fmt = $fmt->fromString(\$string);

input from string

fromFile
 $fmt = $fmt->fromFile($filename);

input from named file: override buffers XML document in $fmt->{xdoc}

fromFh
 $fmt = $fmt->fromFh($handle);

input from filehandle: override buffers XML document in $fmt->{xdoc}

Methods: Input: Generic API

parseDocument
 $doc = $fmt->parseDocument();

parse document from currently selected input source; override returns buffered $fmt->{doc}.

Methods: Output: MIME & HTTP stuff

shortName
 $short = $fmt->shortName();

returns "official" short name for this format; override returns "ftxml".

defaultExtension
 $ext = $fmt->defaultExtension();

returns default filename extension for this format; override returns ".ft.xml".

Methods: Output: output selection

flush
 $fmt = $fmt->flush();

flush accumulated output

toString
 $str = $fmt->toString();
 $str = $fmt->toString($formatLevel);

flush buffered output document to byte-string

toFile
 $fmt_or_undef = $fmt->toFile($filename_or_handle, $formatLevel);

flush buffered output document to $filename_or_handle; default implementation calls $fmt->toFh().

toFh
 $fmt_or_undef = $fmt->toFh($fh,$formatLevel);

flush buffered output document to filehandle $fh

Methods: Output: quick and dirty

putDocument
 $fmt = $fmt->putDocument($doc);

quick and dirty output using .ddc.t.xml attributes only.

EXAMPLE

An example file in the format accepted/generated by this module is:

 <?xml version="1.0" encoding="UTF-8"?>
 <doc>
  <s>
        <w t="wie" exlex="wie" errid="ec" msafe="1"><moot word="wie" tag="PWAV" lemma="wie"/></w>
        <w t="oede" msafe="0"><moot word="öde" tag="ADJD" lemma="öde"/></w>
        <w t="!" exlex="!" errid="ec" msafe="1"><moot word="!" tag="$." lemma="!"/></w>
  </s>
 </doc>

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2011-2019 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.

SEE ALSO

dta-cab-analyze.perl(1), dta-cab-convert.perl(1), dta-cab-http-server.perl(1), dta-cab-http-client.perl(1), dta-cab-xmlrpc-server.perl(1), dta-cab-xmlrpc-client.perl(1), DTA::CAB::Server(3pm), DTA::CAB::Client(3pm), DTA::CAB::Format(3pm), DTA::CAB(3pm), perl(1), ...