Provided by: gmap_2011-11-30-1_amd64 bug


       gmap - Genomic Mapping and Alignment Program


       gmap -dDB|-gFASTA [OPTION]... [QUERY]...


       Align  the sequences QUERY to the reference, specified with -d or -g.  With no QUERY, read
       standard input.


   Input options
       -D, --dir=directory
              Genome directory

       -d, --db=STRING
              Genome database. If argument is '?' (with the quotes), this command lists available

       -k, --kmer=INT
              kmer  size to use in genome database (allowed values: 12-15). If not specified, the
              program will find the highest available kmer size in the genome database

       -G, --genomefull
              Use full genome (all ASCII chars  allowed;  built  explicitly  during  setup),  not
              compressed version

       -g, --gseg=filename
              User-supplied genomic segment

       -2, --pairalign
              Align  two  sequences in FASTA format via stdin, first one being genomic and second
              one being cDNA

              Align these two sequences provided on the command line, first one being genomic and
              second one being cDNA

       -q, --part=INT/INT
              Process  only  the  i-th out of every n sequences e.g., 0/100 or 99/100 (useful for
              distributing jobs to a computer farm).

              Size of input buffer (program reads this many sequences at a time  for  efficiency)
              (default 1000)

   Computation options
       -B, --batch=INT
               Mode     Offsets       Positions       Genome
                 0      allocate      mmap            mmap
                 1      allocate      mmap & preload  mmap
                 2      allocate      mmap & preload  mmap & preload (default)
                 3      allocate      allocate        mmap & preload
                 4      allocate      allocate        allocate
                 5      expand        allocate        allocate

              Note:  For  a single sequence, all data structures use mmap.  If mmap not available
              and allocate not chosen, then will use fileio (very slow)

              Turns off splicing (useful for aligning genomic sequences onto a genome)

              Min length for one internal intron (default 9).  Below this  size,  a  genomic  gap
              will be considered a deletion rather than an intron.

       -K, --intronlength=INT
              Max length for one internal intron (default 1000000)

       -w, --localsplicedist=INT
              Max length for known splice sites at ends of sequence (default 200000)

       -L, --totallength=INT
              Max total intron length (default 2400000)

       -x, --chimera-margin=INT
              Amount  of  unaligned  sequence  that  triggers  search  for the remaining sequence
              (default 40).  Enables alignment of chimeric reads, and may  help  with  some  non-
              chimeric reads. To turn off, set to a large value (greater than the query length).

       -t, --nthreads=INT
              Number of worker threads

       -C, --chrsubsetfile=filename
              User-supplied chromosome subset file

       -c, --chrsubset=string
              Chromosome subset to search

       -z, --direction=STRING
              cDNA  direction  (sense_force,  antisense_force, sense_filter, antisense_filter, or
              auto (default))

       -H, --trimendexons=INT
              Trim end exons with fewer than given number of matches (in nt, default 12)

              For cross-species alignments, use a more sensitive search for canonical splicing

              Reward for  canonical  and  semi-canonical  introns  0=low  reward,  1=high  reward
              (default), 2=low reward for high-identity sequences and high reward otherwise

              Allow  an insertion and deletion close to each other (0=no, 1=yes (default), 2=only
              for high-quality alignments)

              Allow microexons only if one of the splice site probabilities is greater than  this
              value (default 0.90)

       -p, --prunelevel
              Pruning  level:  0=no pruning (default), 1=poor seqs, 2=repetitive seqs, 3=poor and

   Output types
       -S, --summary
              Show summary of alignments only

       -A, --align
              Show alignments

       -3, --continuous
              Show alignment in three continuous lines

       -4, --continuous-by-exon
              Show alignment in three lines per exon

       -Z, --compress
              Print output in compressed format

       -E, --exons=STRING
              Print exons ("cdna" or "genomic")

       -P, --protein_dna
              Print protein sequence (cDNA)

       -Q, --protein_gen
              Print protein sequence (genomic)

       -f, --format=INT
              Other format for output (also note the -A and -S options and other  options  listed
              under Output types):
               psl (or 1)= PSL (BLAT) format,
               gff3_gene (or 2)= GFF3 gene format,
               gff3_match_cdna (or 3)= GFF3 cDNA_match format,
               gff3_match_est (or 4) = GFF3 EST_match format,
               splicesites (or 6) = splicesites output (for GSNAP splicing file),
               introns = introns output (for GSNAP splicing file),
               map_exons (or 7) = IIT FASTA exon map format,
               map_genes (or 8) = IIT FASTA map format,
               coords (or 9) = coords in table format,
               sampe = SAM format (setting paired_read bit in flag),
               samse = SAM format (without setting paired_read bit)

   Output options
       -n, --npaths=INT
              Maximum number of paths to show. If set to 0, prints two paths if chimera detected,
              else one.

              If more than maximum number of paths are found, then nothing is printed.

              Report only paths whose score is within this value of the best path. By default, if
              this option is not provided, the program prints all paths found.

       -O, --ordered
              Print output in same order as input (relevant only if there is more than one worker

       -5, --md5
              Print MD5 checksum for each query sequence

       -o, --chimera-overlap
              Overlap to show, if any, at chimera breakpoint

              Print only failed alignments, those with no results

              Exclude printing of failed alignments

              Print completely failed alignments as input FASTA or FASTQ format  Allowed  values:
              yes, no

       -V, --usesnps=STRING
              Use  database  containing  known  SNPs  (in  <STRING>.iit,  built  previously using
              snpindex) for reporting output

              Basename for multiple-file output,  separately  for  nomapping,  uniq,  mult,  (and
              chimera, if --chimera-margin is selected)

              Buffer  size,  in  queries,  for  output  thread (default 1000). When the number of
              results to be printed exceeds this size, the worker threads are  halted  until  the
              backlog is cleared

       -F, --fulllength
              Assume full-length protein, starting with Met

              Translate codons from given nucleotide (1-based)

       -T, --truncate
              Truncate alignment around full-length protein, Met to Stop Implies -F flag.

       -Y, --tolerant
              Translates cDNA with corrections for frameshifts

   Options for SAM output
              Do not print headers beginning with '@'

              Value to put into read-group id (RG-ID) field

              Value to put into read-group name (RG-SM) field

              Value to put into read-group library (RG-LB) field

              Value to put into read-group library (RG-PL) field

   Options for quality scores
              Protocol for input quality scores. Allowed values:
               illumina (ASCII 64-126) (equivalent to -J 64 -j -31)
               sanger   (ASCII 33-126) (equivalent to -J 33 -j 0)

              Default  is  sanger  (no  quality print shift) SAM output files should have quality
              scores in sanger protocol.  Or you can specify the print shift with this flag:

       -j, --quality-print-shift=INT
              Shift FASTQ quality scores by this amount  in  output  (default  is  0  for  sanger
              protocol; to change Illumina input to Sanger output, select -31)

   External map file options
       -M, --mapdir=directory
              Map directory

       -m, --map=iitfile
              Map file. If argument is '?' (with the quotes), this lists available map files.

       -e, --mapexons
              Map each exon separately

       -b, --mapboth
              Report hits from both strands of genome

       -u, --flanking=INT
              Show flanking hits (default 0)

              Show comment line for each hit

   Alignment output options
       -N, --nolengths
              No intron lengths in alignment

       -I, --invertmode=INT
              Mode for alignments to genomic (-) strand:
               0=Don't invert the cDNA (default)
               1=Invert cDNA and print genomic (-) strand
               2=Invert cDNA and print genomic (+) strand

       -i, --introngap=INT
              Nucleotides to show on each end of intron (default=3)

       -l, --wraplength=INT
              Wrap length for alignment (default=50)

   Help options
              Show version

       --help Show this help message


       GMAPDB genome directory (eqivalent to -D)


              configuration file


       Thomas D. Wu and Colin K. Watanabe


       Report bugs to Thomas Wu <>.


       Copyright 2005 Genentech, Inc. All rights reserved.


       gmap_setup(1), gsnap(1)