Provided by: cdbfasta_1.00+git20230710.da8f5ba+dfsg-1build1_amd64
NAME
cdbyank - Query an index file created with cdbfasta.
DESCRIPTION
Usage: cdbyank <index_file> [-d <fasta_file>] [-a <key>|-n|-l|-s] [-o <outfile>] [-q <char>|-Q][-F] [-R] [-P] [-x] [-w] [-z <dbfasta.cdbz> <index_file> is the index file created previously with cdbfasta (usually having a ".cidx" suffix) -a <key> the sequence name (accession) for a fasta record to be retrieved; if not given, a list of accessions is expected at stdin -d <fasta_file> is the fasta file to pull records from; if not specified, cdbyank will look in the same directory where <index_file> resides, for a file with the same name but without the ".cidx" suffix -o the records found are written to file <outfile> instead of stdout -x allows retrieval of multiple records per key, if the indexed database had records with the same key (non-unique keys); (without -x only one record for a given key is retrieved) -i case insensitive query (expects the <index_file> to have been created with cdbfasta -i option) -Q output the query key surrounded by character '%' before the corresponding record -q same as -Q but use character <char> instead of '%' -w enable warnings (sent to stderr) when a key is not found -F pulls only the defline for each record (discard the sequence) -P only displays the position(s) (file offset) within the database file, for the requested record(s) -R sequence range extraction: expects the input <key(s)> to have the format: '<seq_name> <start> <end>' and pulls only the specified sequence range -z decompress the entire file <dbfasta.cdbz> (assumes it was built using cdbfasta with '-z' option) -v show version number and exit Index file statistics (no database file needed): -n display the number of records indexed -l list all keys stored in <index_file> -s display indexing summary info