lunar (1) tigr-glimmer3.1.gz

Provided by: tigr-glimmer_3.02b-6_amd64 bug

NAME

       tigr-glimmer  —  Find/Score  potential genes in genome-file using the probability model in
       icm-file

SYNOPSIS

       tigr-glimmer3 [genome-file]  [icm-file]  [[options]]

DESCRIPTION

       tigr-glimmer is a system for finding genes in microbial DNA,  especially  the  genomes  of
       bacteria  and  archaea.  tigr-glimmer  (Gene Locator and Interpolated Markov Modeler) uses
       interpolated Markov models (IMMs) to identify the coding regions and distinguish them from
       noncoding  DNA.  The  IMM approach, described in our Nucleic Acids Research paper on tigr-
       glimmer 1.0 and in our subsequent paper on tigr-glimmer 2.0, uses a combination of  Markov
       models from 1st through 8th-order, weighting each model according to its predictive power.
       tigr-glimmer 1.0 and 2.0 use 3-periodic nonhomogenous Markov models in their IMMs.

       tigr-glimmer is the primary microbial gene finder at TIGR, and has been used  to  annotate
       the  complete  genomes  of  B. burgdorferi (Fraser et al., Nature, Dec. 1997), T. pallidum
       (Fraser et al., Science, July 1998), T. maritima, D.  radiodurans,  M.  tuberculosis,  and
       non-TIGR  projects  including  C.  trachomatis, C. pneumoniae, and others. Its analyses of
       some of these genomes and others is available at the TIGR microbial database site.

       A special version of tigr-glimmer designed for small eukaryotes,  GlimmerM,  was  used  to
       find  the  genes  in  chromosome  2  of  the malaria parasite, P. falciparum.. GlimmerM is
       described in S.L. Salzberg, M. Pertea,  A.L.  Delcher,  M.J.  Gardner,  and  H.  Tettelin,
       "Interpolated  Markov  models  for  eukaryotic  gene  finding," Genomics 59 (1999), 24-31.
       Click here (http://www.tigr.org/software/glimmerm/) to  visit  the  GlimmerM  site,  which
       includes information on how to download the GlimmerM system.

       The  tigr-glimmer system consists of two main programs. The first of these is the training
       program, build-imm. This program takes an input set of sequences and  builds  and  outputs
       the  IMM  for  them. These sequences can be complete genes or just partial orfs. For a new
       genome, this training data can consist of those genes with strong database hits as well as
       very  long  open  reading  frames  that  are statistically almost certain to be genes. The
       second program is glimmer, which uses this IMM to identify putative  genes  in  an  entire
       genome.  tigr-glimmer  automatically  resolves conflicts between most overlapping genes by
       choosing one of them. It also identifies genes that are suspected to  truly  overlap,  and
       flags these for closer inspection by the user. These ``suspect'' gene candidates have been
       a very small percentage of the total for all the genomes analyzed thus far.   tigr-glimmer
       is a program that...

OPTIONS

       -C n      Use n as GC percentage of independent model

                 Note:  n should be a percentage, e.g., -C 45.2

       -f        Use ribosome-binding energy to choose start codon

       +f        Use first codon in orf as start codon

       -g n      Set minimum gene length to n

       -i filename
                 Use  filename   to select regions of bases that are off limits, so that no bases
                 within that area will be examined

       -l        Assume linear rather than circular genome, i.e., no wraparound

       -L filename
                 Use filename to specify a list of orfs that should be scored separately, with no
                 overlap rules

       -M        Input  is  a  multifasta file of separate genes to be scored separately, with no
                 overlap rules

       -o n      Set minimum overlap length to n.  Overlaps shorter than this are ignored.

       -p n      Set minimum overlap percentage to n%.  Overlaps shorter than this percentage  of
                 *both* strings are ignored.

       -q n      Set  the  maximum  length  orf  that  can be rejected because of the independent
                 probability score column to (n - 1)

       -r        Don't use independent probability score column

       +r        Use independent probability score column

       -r        Don't use independent probability score column

       -s s      Use string s as the ribosome binding pattern to find start codons.

       +S        Do use stricter independent intergenic model that doesn't give probabilities  to
                 in-frame stop codons.  (Option is obsolete since this is now the only behaviour

       -t n      Set  threshold score for calling as gene to n.  If the in-frame score >= n, then
                 the region is given a number and considered a potential gene.

       -w n      Use "weak" scores on tentative genes  n  or  longer.   Weak  scores  ignore  the
                 independent probability score.

SEE ALSO

       tigr-adjust  (1), tigr-anomaly   (1), tigr-build-icm (1), tigr-check (1), tigr-codon-usage
       (1), tigr-compare-lists (1), tigr-extract (1), tigr-generate (1), tigr-get-len (1),  tigr-
       get-putative (1), tigr-glimmer3 (1), tigr-long-orfs (1)

       http://www.tigr.org/software/glimmer/

       Please see the readme in /usr/share/doc/glimmer for a description on how to use Glimmer.

AUTHOR

       This  manual  page  was  quickly  copied  from  the  glimmer  web  site by Steffen Moeller
       moeller@debian.org for the Debian system.

                                                                                  TIGR-GLIMMER(1)