Ubuntu Manpage: frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parser

Provided by: frog_0.12.17-7.1build1_amd64

NAME

       frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parser

SYNOPSYS

       frog [options]

       frog -t test-file

DESCRIPTION

       frog is an integration of memory-based natural language processing (NLP) modules developed
       for Dutch.  frog's current version will  tokenize,  tag,  lemmatize,  and  morphologically
       segment word tokens in Dutch text files, add IOB chunks and will assign a dependency graph
       to each sentence.

OPTIONS

       -c <configfile>
              set the configuration using 'file'

       -d <level>
              set debug level.

       -e <encoding>
              set input encoding. (default UTF8)

       -h
              give some help

       --keep-parser-files=[yes|no]
              keep the intermediate files from the parser. Last sentence only!

       -n
              assume inputfile to hold one sentence per line

       -o <file>
              send output to 'file' instead of stdout. Defaults to the name of the inputfile with
              '.out' appended.

       --outputdir <dir>
              send   all   output  to  'dir'  instead  of  stdout.  Creates  filenames  from  the
              inputfilename(s) with '.out' appended.

       --skip=[aclmnpt]
              skip parts of the proces: Tokenizer (t), Chunker (c), Lemmatizer (l), Morphological
              Analyzer (a), Multi-Word unit (m), Named-Entity recognizer (n) or Parser (p)

       -Q
              Enable quotedetection in the tokenizer. May run havock!

       -S <port>
              Run a server on 'port'

       -t <file>
              process 'file'

       -x <xmlfile>
              process  'xmlfile', which is supposed to be in FoLiA format! If 'xmlfile' is empty,
              and --testdir=<dir> is provided, all '.xml' files in 'dir'  will  be  processed  as
              FoLia XML.

       --textclass=<cls>
              When -x is given, use 'cls' to find text in the FoLiA document(s).

       --testdir=<dir>
              process all files in 'dir'. When the input mode is XML, only '.xml' files are teken
              from 'dir'. see also --outputdir

       --tmpdir=<dir>
              location to store intermediate files. Default /tmp.

       --threads=<n>
              use a maximum of 'n' threads. The default  is  to  take  whatever  is  needed.   In
              servermode we always run on 1 thread.

       -V or --version
              show version info

       --xmldir=<dir>
              generate  FoLiA  XML  output  and  send  it  to  'dir'.  Creates filenames from the
              inputfilename with '.xml' appended. (Except when it already ends with '.xml')

       -X <file>
              generate FoLiA XML output and send it to  'file'.  Defaults  to  the  name  of  the
              inputfile(s) with '.xml' appended. (Except when it already ends with '.xml')

       --id=<id>
              When -X for FoLia is given, use 'id' to give the doc an ID.

BUGS

       likely

AUTHORS

       Maarten van Gompel proycon@anaproy.nl

       Ko van der Sloot Timbl@uvt.nl

       Antal van den Bosch Timbl@uvt.nl

NAME

SYNOPSYS

DESCRIPTION

OPTIONS

BUGS

AUTHORS

SEE ALSO