Ubuntu Manpage: stellar - the SwifT Exact LocaL AligneR

Provided by: seqan-apps_2.4.0+dfsg-15ubuntu1_amd64

NAME

       stellar - the SwifT Exact LocaL AligneR

SYNOPSIS

       stellar [OPTIONS] <FASTA FILE 1> <FASTA FILE 2>

DESCRIPTION

       STELLAR  implements the SWIFT filter algorithm (Rasmussen et al., 2006) and a verification
       step for the SWIFT hits  that  applies  local  alignment,  gapped  X-drop  extension,  and
       extraction of the longest epsilon-match.

       Input  to  STELLAR  are  two files, each containing one or more sequences in FASTA format.
       Each sequence from file 1 will be compared to each sequence in file 2. The sequences  from
       file 1 are used as database, the sequences from file 2 as queries.

       (c) 2010-2012 by Birte Kehr

REQUIRED ARGUMENTS

       FASTA_FILE_1 INPUT_FILE
               Valid filetypes are: .fasta and .fa.

       FASTA_FILE_2 INPUT_FILE
               Valid filetypes are: .fasta and .fa.

OPTIONS

       -h, --help
              Display the help message.

       --version
              Display version information.

   Main Options:
       -e, --epsilon DOUBLE
              Maximal error rate (max 0.25). In range [0.0000001..0.25]. Default: 0.05.

       -l, --minLength INTEGER
              Minimal length of epsilon-matches. In range [0..inf]. Default: 100.

       -f, --forward
              Search only in forward strand of database.

       -r, --reverse
              Search only in reverse complement of database.

       -a, --alphabet STRING
              Alphabet type of input sequences (dna, rna, dna5, rna5, protein, char). One of dna,
              dna5, rna, rna5, protein, and char.

       -v, --verbose
              Set verbosity mode.

   Filtering Options:
       -k, --kmer INTEGER
              Length of the q-grams (max 32). In range [1..32].

       -rp, --repeatPeriod INTEGER
              Maximal period of low complexity repeats to be filtered. Default: 1.

       -rl, --repeatLength INTEGER
              Minimal length of low complexity repeats to be filtered. Default: 1000.

       -c, --abundanceCut DOUBLE
              k-mer overabundance cut ratio. In range [0..1]. Default: 1.

   Verification Options:
       -x, --xDrop DOUBLE
              Maximal x-drop for extension. Default: 5.

       -vs, --verification STRING
              Verification strategy: exact or bestLocal or bandedGlobal One of exact,  bestLocal,
              and bandedGlobal. Default: exact.

       -dt, --disableThresh INTEGER
              Maximal  number  of  verified  matches  before disabling verification for one query
              sequence (default infinity). In range [0..inf].

       -n, --numMatches INTEGER
              Maximal number of kept matches per  query  and  database.  If  STELLAR  finds  more
              matches, only the longest ones are kept. Default: 50.

       -s, --sortThresh INTEGER
              Number  of  matches  triggering  removal  of duplicates. Choose a smaller value for
              saving space. Default: 500.

   Output Options:
       -o, --out OUTPUT_FILE
              Name of output file. Valid filetypes are: .txt and .gff. Default: stellar.gff.

       -od, --outDisabled OUTPUT_FILE
              Name of output file for disabled query sequences. Valid  filetypes  are:  .sam[.*],
              .raw[.*],  .frn[.*], .fq[.*], .fna[.*], .ffn[.*], .fastq[.*], .fasta[.*], .faa[.*],
              .fa[.*], and .bam, where * is any of the following extensions: gz,  bz2,  and  bgzf
              for transparent (de)compression. Default: stellar.disabled.fasta.

REFERENCES

       Kehr,  B.,  Weese,  D.,  Reinert,  K.:  STELLAR:  fast  and  exact  local  alignments. BMC
       Bioinformatics, 12(Suppl 9):S15, 2011.