Provided by: fasta3_36.3.8i.14-Nov-2020-1_amd64
NAME
fasts3, fasts3_t - compare several short peptide sequences against a protein database using a modified fasta algorithm. tfasts3, tfasts3_t - compare short pepides against a translated DNA database.
DESCRIPTION
fasts3 and tfasts3 are designed to compare set of (presumably non-contiguous) peptides to a protein (fasts3) or translated DNA (tfasts3) database. fasts3/tfasts3 are designed particularly for short peptide data from mass-spec analysis of protein digests. Unlike the traditional fasta3 search, which uses a protein or DNA sequence, fasts3 and tfasts3 work with a query sequence of the form: >tests from mgstm1 MLLE, MILGYW, MGADP, MLCYNP testf MILGYW----------MLLE------------MGDAP----------- :::::: :::: ::::: GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEK 10 20 30 40 50 testf -------------------------------------------------- GT8.7 FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV 60 70 80 90 100 20 testf ------------MLCYNP :::::: GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAG 110 120 130 140 150
Options
fasts3 and tfasts3 can accept a query sequence from the unix "stdin" data stream. This makes it much easier to use fasta3 and its relatives as part of a WWW page. To indicate that stdin is to be used, use "-" or "@" as the query sequence file name. -b # number of best scores to show (must be < -E cutoff) -d # number of best alignments to show ( must be < -E cutoff) -D turn on debugging mode. Enables checks on sequence alphabet that cause problems with tfastx3, tfasty3, tfasta3. -E # Expectation value limit for displaying scores and alignments. Expectation values for fasts3 and tfasts3 are not as accurate as those for the other fasta3 programs. -H turn off histogram display -i compare against only the reverse complement of the library sequence. -L report long sequence description in alignments -m 0,1,2,3,4,5,6,9,10 alignment display options -N # break long library sequences into blocks of # residues. Useful for bacterial genomes, which have only one sequence entry. -N 2000 works well for well for bacterial genomes. -O file send output to file -q/-Q quiet option; do not prompt for input -R file save all scores to statistics file -S # offset substitution matrix values by a constant # -s name specify substitution matrix. BLOSUM50 is used by default; PAM250, PAM120, and BLOSUM62 can be specified by setting -s P120, P250, or BL62. With this version, many more scoring matrices are available, including BLOSUM80 (BL80), and MDM_10, MDM_20, MDM_40 (M10, M20, M40). Alternatively, BLASTP1.4 format scoring matrix files can be specified. -T # (threaded, parallel only) number of threads or workers to use (set by default to 4 at compile time). -t # Translation table - tfasts3 can use the BLAST tranlation tables. See http://www.ncbi.nih.gov/htbin-post/Taxonomy/wprintgc?mode=c/. -w # line width for similarity score, sequence alignment, output. -x "#,#" offsets query, library sequence for numbering alignments -z # Specify statistical calculation. Default is -z 1, which uses regression against the length of the library sequence. -z 0 disables statistics. -z 2 uses the ln() length correction. -z 3 uses Altschul and Gish's statistical estimates for specific protein BLOSUM scoring matrices and gap penalties. -z 4: an alternate regression method. -Z db_size Set the apparent database size used for expectation value calculations. -3 (TFASTS3 only) use only forward frame translations
Environment variables:
FASTLIBS location of library choice file (-l FASTLIBS) SMATRIX default scoring matrix (-s SMATRIX) SRCH_URL the format string used to define the option to re-search the database. REF_URL the format string used to define the option to lookup the library sequence in entrez, or some other database.
AUTHOR
Bill Pearson wrp@virginia.EDU local FASTS/TFASTSv3(1)