Ubuntu Manpage: fasts3, fasts3_t - compare several short peptide sequences against a protein database using a modified

Provided by: fasta3_36.3.8i.14-Nov-2020-3_amd64

NAME

       fasts3,  fasts3_t  -  compare several short peptide sequences against a protein database using a modified
       fasta algorithm.

       tfasts3, tfasts3_t - compare short pepides against a translated DNA database.

DESCRIPTION

       fasts3 and tfasts3 are designed to compare set of  (presumably  non-contiguous)  peptides  to  a  protein
       (fasts3)  or  translated  DNA  (tfasts3)  database.   fasts3/tfasts3  are designed particularly for short
       peptide data from mass-spec analysis of protein digests.  Unlike the  traditional  fasta3  search,  which
       uses a protein or DNA sequence, fasts3 and tfasts3 work with a query sequence of the form:
            >tests from mgstm1
            MLLE,
            MILGYW,
            MGADP,
            MLCYNP
(included with the distribution), the result is:
     testf    MILGYW----------MLLE------------MGDAP-----------
              ::::::          ::::            :::::
     GT8.7  MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEK
                    10        20        30        40        50

     testf  --------------------------------------------------

     GT8.7  FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV
                    60        70        80        90       100

                           20
     testf  ------------MLCYNP
                        ::::::
     GT8.7  ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAG
                   110       120       130       140       150

Options

fasts3 and tfasts3 can accept a query sequence from the unix "stdin" data stream. This makes it much
easier to use fasta3 and its relatives as part of a WWW page. To indicate that stdin is to be used, use
"-" or "@" as the query sequence file name.

-b # number of best scores to show (must be < -E cutoff)

-d # number of best alignments to show ( must be < -E cutoff)

-D turn on debugging mode. Enables checks on sequence alphabet that cause problems with tfastx3,
tfasty3, tfasta3.

-E # Expectation value limit for displaying scores and alignments. Expectation values for fasts3 and
tfasts3 are not as accurate as those for the other fasta3 programs.

-H turn off histogram display

-i compare against only the reverse complement of the library sequence.

-L report long sequence description in alignments

-m 0,1,2,3,4,5,6,9,10
alignment display options

-N # break long library sequences into blocks of # residues. Useful for bacterial genomes, which have
only one sequence entry. -N 2000 works well for well for bacterial genomes.

-O file
send output to file

-q/-Q quiet option; do not prompt for input

-R file
save all scores to statistics file

-S # offset substitution matrix values by a constant #

-s name
specify substitution matrix. BLOSUM50 is used by default; PAM250, PAM120, and BLOSUM62 can be
specified by setting -s P120, P250, or BL62. With this version, many more scoring matrices are
available, including BLOSUM80 (BL80), and MDM_10, MDM_20, MDM_40 (M10, M20, M40). Alternatively,
BLASTP1.4 format scoring matrix files can be specified.

-T # (threaded, parallel only) number of threads or workers to use (set by default to 4 at compile
time).

-t # Translation table - tfasts3 can use the BLAST tranlation tables. See
http://www.ncbi.nih.gov/htbin-post/Taxonomy/wprintgc?mode=c/.

-w # line width for similarity score, sequence alignment, output.

-x "#,#"
offsets query, library sequence for numbering alignments

-z # Specify statistical calculation. Default is -z 1, which uses regression against the length of the
library sequence. -z 0 disables statistics. -z 2 uses the ln() length correction. -z 3 uses
Altschul and Gish's statistical estimates for specific protein BLOSUM scoring matrices and gap
penalties. -z 4: an alternate regression method.

-Z db_size
Set the apparent database size used for expectation value calculations.

-3 (TFASTS3 only) use only forward frame translations

Environment variables:

       FASTLIBS
              location of library choice file (-l FASTLIBS)

       SMATRIX
              default scoring matrix (-s SMATRIX)

       SRCH_URL
              the format string used to define the option to re-search the database.

       REF_URL
              the  format  string  used  to  define the option to lookup the library sequence in entrez, or some
              other database.

AUTHOR

       Bill Pearson
       wrp@virginia.EDU

                                                      local                                    FASTS/TFASTSv3(1)