Provided by: fasta3_36.3.8i.14-Nov-2020-1_amd64
NAME
fastf3, fastf3_t - compare a mixed peptide sequence against a protein database using a modified fasta algorithm. tfastf3, tfastf3_t - compare a mixed pepide sequence against a translated DNA database.
DESCRIPTION
fastf3 and tfastf3 are designed to compare a sequence of mixed peptides to a protein (fastf3) or translated DNA (tfastf3) database. Unlike the traditional fasta3 search, which uses a protein or DNA sequence, fastf3 and tfastf3 work with a query sequence of the form: >testf from mgstm1 MGCEN, MIDYP, MLLAY, MLLGY testf MILGY-----------MLLEY-----------MGDAP----------- ::::: ::::: ::::: GT8.7 MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEK 10 20 30 40 50 testf -------------------------------------------------- GT8.7 FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV 60 70 80 90 100 20 testf ------------MLCYN ::::: GT8.7 ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAG 110 120 130 140 150
Options
fastf3 and tfastf3 can accept a query sequence from the unix "stdin" data stream. This makes it much easier to use fasta3 and its relatives as part of a WWW page. To indicate that stdin is to be used, use "-" or "@" as the query sequence file name. -b # number of best scores to show (must be < -E cutoff) -d # number of best alignments to show ( must be < -E cutoff) -D turn on debugging mode. Enables checks on sequence alphabet that cause problems with tfastx3, tfasty3, tfasta3. -E # Expectation value limit for displaying scores and alignments. Expectation values for fastf3 and tfastf3 are not as accurate as those for the other fasta3 programs. -H turn off histogram display -i compare against only the reverse complement of the library sequence. -L report long sequence description in alignments -m 0,1,2,3,4,5,6,10 alignment display options -n force query to nucleotide sequence -N # break long library sequences into blocks of # residues. Useful for bacterial genomes, which have only one sequence entry. -N 2000 works well for well for bacterial genomes. -O file send output to file -q/-Q quiet option; do not prompt for input -R file save all scores to statistics file -S # offset substitution matrix values by a constant # -s name specify substitution matrix. BLOSUM50 is used by default; PAM250, PAM120, and BLOSUM62 can be specified by setting -s P120, P250, or BL62. With this version, many more scoring matrices are available, including BLOSUM80 (BL80), and MDM_10, MDM_20, MDM_40 (M10, M20, M40). Alternatively, BLASTP1.4 format scoring matrix files can be specified. -T # (threaded, parallel only) number of threads or workers to use (set by default to 4 at compile time). -t # Translation table - tfastf3 can use the BLAST tranlation tables. See http://www.ncbi.nih.gov/htbin-post/Taxonomy/wprintgc?mode=c/. -w # line width for similarity score, sequence alignment, output. -x "#,#" offsets query, library sequence for numbering alignments -z # Specify statistical calculation. Default is -z 1, which uses regression against the length of the library sequence. -z 0 disables statistics. -z 2 uses the ln() length correction. -z 3 uses Altschul and Gish's statistical estimates for specific protein BLOSUM scoring matrices and gap penalties. -z 4: an alternate regression method. -Z db_size Set the apparent database size used for expectation value calculations. -1 Sort by "init1" score. -3 (TFASTF3 only) use only forward frame translations
Environment variables:
FASTLIBS location of library choice file (-l FASTLIBS) SMATRIX default scoring matrix (-s SMATRIX) SRCH_URL the format string used to define the option to re-search the database. REF_URL the format string used to define the option to lookup the library sequence in entrez, or some other database.
AUTHOR
Bill Pearson wrp@virginia.EDU local FASTF/TFASTFv3(1)