Provided by: genomethreader_1.7.3+dfsg-10build2_amd64 bug

NAME

       gthbssmtrain - train splice site model

SYNOPSIS

       gthbssmtrain [option ...] GFF3_file

DESCRIPTION

       Create BSSM training data from annotation given in GFF3_file.

OPTIONS

       -outdir
           set name of output directory to which the training files are written default:
           training_data

       -gcdonor
           extract training data for GC donor sites default: yes

       -filtertype
           set type of features to used for filtering (usually 'exon' or 'CDS') default: exon

       -goodexoncount
           set the minimum number of good exons a feature must have to be included into the
           training data default: 1

       -cutoff
           set the minimum score an exon must have to count towards the ``good exon count''
           (exons without a score count as good) default: 1.00

       -extracttype
           set type of features to be extracted as exons (usually 'exon' or 'CDS') default: CDS

       -seqfile
           set the sequence file from which to take the sequences default: undefined

       -encseq
           set the encoded sequence indexname from which to take the sequences default: undefined

       -seqfiles
           set the sequence files from which to extract the features use '--' to terminate the
           list of sequence files

       -matchdesc
           search the sequence descriptions from the input files for the desired sequence IDs (in
           GFF3), reporting the first match default: no

       -matchdescstart
           exactly match the sequence descriptions from the input files for the desired sequence
           IDs (in GFF3) from the beginning to the first whitespace default: no

       -usedesc
           use sequence descriptions to map the sequence IDs (in GFF3) to actual sequence
           entries. If a description contains a sequence range (e.g., III:1000001..2000000), the
           first  part is used as sequence ID ('III') and the first range position as offset
           ('1000001') default: no

       -regionmapping
           set file containing sequence-region to sequence file mapping default: undefined

       -seed
           set seed for random number generator manually 0 generates a seed from the current time
           and the process id default: 0

       -v
           be verbose default: no

       -gzip
           write gzip compressed output files default: no

       -bzip2
           write bzip2 compressed output files default: no

       -force
           force writing to output files default: no

       -help
           display help and exit

       -version
           display version information and exit

                                                                                  GTHBSSMTRAIN(1)