Provided by: tigr-glimmer_3.02b-2build1_amd64 bug

NAME

       tigr-glimmer — Find/Score potential genes in genome-file using the probability model in icm-file

SYNOPSIS

       tigr-glimmer3 [genome-file]  [icm-file]  [[options]]

DESCRIPTION

       tigr-glimmer  is  a  system  for  finding  genes in microbial DNA, especially the genomes of bacteria and
       archaea. tigr-glimmer (Gene Locator and Interpolated Markov  Modeler)  uses  interpolated  Markov  models
       (IMMs)  to  identify  the  coding  regions  and  distinguish  them  from noncoding DNA. The IMM approach,
       described in our Nucleic Acids Research paper on tigr-glimmer 1.0 and in our subsequent  paper  on  tigr-
       glimmer  2.0,  uses  a  combination  of  Markov  models  from 1st through 8th-order, weighting each model
       according to its predictive power. tigr-glimmer 1.0 and 2.0 use 3-periodic nonhomogenous Markov models in
       their IMMs.

       tigr-glimmer  is  the  primary  microbial gene finder at TIGR, and has been used to annotate the complete
       genomes of B. burgdorferi (Fraser et al., Nature, Dec. 1997), T. pallidum (Fraser et al.,  Science,  July
       1998),  T.  maritima, D. radiodurans, M. tuberculosis, and non-TIGR projects including C. trachomatis, C.
       pneumoniae, and others. Its analyses of some of these  genomes  and  others  is  available  at  the  TIGR
       microbial database site.

       A  special version of tigr-glimmer designed for small eukaryotes, GlimmerM, was used to find the genes in
       chromosome 2 of the malaria parasite, P. falciparum.. GlimmerM is described in S.L. Salzberg, M.  Pertea,
       A.L.  Delcher,  M.J.  Gardner, and H. Tettelin, "Interpolated Markov models for eukaryotic gene finding,"
       Genomics 59 (1999), 24-31.  Click here (http://www.tigr.org/software/glimmerm/)  to  visit  the  GlimmerM
       site, which includes information on how to download the GlimmerM system.

       The tigr-glimmer system consists of two main programs. The first of these is the training program, build-
       imm. This program takes an input set of sequences  and  builds  and  outputs  the  IMM  for  them.  These
       sequences can be complete genes or just partial orfs. For a new genome, this training data can consist of
       those genes with strong database hits as well as very long open reading  frames  that  are  statistically
       almost certain to be genes. The second program is glimmer, which uses this IMM to identify putative genes
       in an entire genome. tigr-glimmer automatically resolves conflicts  between  most  overlapping  genes  by
       choosing  one  of them. It also identifies genes that are suspected to truly overlap, and flags these for
       closer inspection by the user. These ``suspect'' gene candidates have been a very small percentage of the
       total for all the genomes analyzed thus far.  tigr-glimmer is a program that...

OPTIONS

       -C n      Use n as GC percentage of independent model

                 Note:  n should be a percentage, e.g., -C 45.2

       -f        Use ribosome-binding energy to choose start codon

       +f        Use first codon in orf as start codon

       -g n      Set minimum gene length to n

       -i filename
                 Use  filename    to  select  regions of bases that are off limits, so that no bases within that
                 area will be examined

       -l        Assume linear rather than circular genome, i.e., no wraparound

       -L filename
                 Use filename to specify a list of orfs that should be scored separately, with no overlap rules

       -M        Input is a multifasta file of separate genes to be scored separately, with no overlap rules

       -o n      Set minimum overlap length to n.  Overlaps shorter than this are ignored.

       -p n      Set minimum overlap percentage to n%.  Overlaps shorter than this percentage of *both*  strings
                 are ignored.

       -q n      Set  the  maximum  length orf that can be rejected because of the independent probability score
                 column to (n - 1)

       -r        Don't use independent probability score column

       +r        Use independent probability score column

       -r        Don't use independent probability score column

       -s s      Use string s as the ribosome binding pattern to find start codons.

       +S        Do use stricter independent intergenic model that doesn't give probabilities to  in-frame  stop
                 codons.  (Option is obsolete since this is now the only behaviour

       -t n      Set  threshold  score for calling as gene to n.  If the in-frame score >= n, then the region is
                 given a number and considered a potential gene.

       -w n      Use "weak" scores on  tentative  genes  n  or  longer.   Weak  scores  ignore  the  independent
                 probability score.

SEE ALSO

       tigr-adjust  (1),  tigr-anomaly   (1),  tigr-build-icm  (1),  tigr-check (1), tigr-codon-usage (1), tigr-
       compare-lists (1), tigr-extract (1), tigr-generate (1), tigr-get-len (1),  tigr-get-putative  (1),  tigr-
       glimmer3 (1), tigr-long-orfs (1)

       http://www.tigr.org/software/glimmer/

       Please see the readme in /usr/share/doc/glimmer for a description on how to use Glimmer.

AUTHOR

       This  manual  page was quickly copied from the glimmer web site by Steffen Moeller moeller@debian.org for
       the Debian system.

                                                                                                 TIGR-GLIMMER(1)