Provided by: ncbi-tools-bin_6.1.20170106+dfsg1-9_amd64
NAME
cleanasn - clean up irregularities in NCBI ASN.1 objects
SYNOPSIS
cleanasn [-] [-A filename] [-B str] [-C str] [-D str] [-F str] [-K str] [-L filename] [-M filename] [-N str] [-O str] [-P str] [-Q str] [-R] [-S str] [-T] [-U str] [-V str] [-X str] [-Z str] [-a str] [-b] [-c] [-d str] [-f str] [-i filename] [-j filename] [-k filename] [-m str] [-n path] [-o filename] [-p path] [-q path] [-r path] [-v path] [-x ext]
DESCRIPTION
cleanasn is a utility program to clean up irregularities in NCBI ASN.1 objects.
OPTIONS
A summary of options is included below. - Print usage message -A filename Accession list file -B str Branch, per the flags in str: c Has coding regions d No coding regions p Passes validation q Validator errors or rejects r Only pop/phy/mut/eco/WGS sets s Exclude pop/phy/mut/eco/WGS sets t Only nuc-prot sets u Exclude nuc-prot sets v Only segmented sequences w Exclude segmented sequences x Only segmented proteins y Exclude segmented proteins -C str Sequence operations, per the flags in str: c Compress d Decompress l Recalculated segmented sequence length v Virtual gaps inside segmented sequence s Convert segmented set to delta sequence t Non-NucProt segmented set to delta sequence u Improved non-NucProt segmented set to delta sequence g Raw to delta by assembly gap m Merge assembly gap features -D str Clean up descriptors, per the flags in str: t Remove Title c Remove Comment n Remove Nuc-Prot Set title e Remove Pop/Phy/Mut/Eco Set title m Remove mRNA title p Remove Protein title a Title to name b AutoDef title or name x Prefix title with organism name -F str Clean up features, per the flags in str: u Remove User-objects d Remove db_xrefs e Remove /evidence and /inference g Fuse multi-interval genes i Fuse adjacent-interval imported features r Remove redundant gene xrefs f Fuse duplicate features s Package features on referenced Bioseq k Package coding-region or parts features z Delete or update EC numbers b Set Best coding-region reading frame x Retranslate coding regions a Adjust for missing stop codon -K str Perform a general cleanup, per the flags in str: b BasicSeqEntryCleanup p C++ BasicCleanup (via an external utility) v AdvancedSeqEntryCleanup s SeriousSeqEntryCleanup x ExtendedSeqEntryCleanup g GpipeSeqEntryCleanup n Normalize descriptor order u Remove NcbiCleanup User Objects c Synchronize genetic Codes f CDS partial from translation e Impose CDS partials d Resynchronize CDS partials m Resynchronize mRNA partials t Resynchronize Peptide partials a Adjust consensus splice i Promote to "worst" Seq-ID r Reassign local IDs l Remove locus -L filename Log file -M filename Macro file -N str Clean up links, per the flags in str: o Link CDS mRNA by Overlap p Link CDS mRNA by Product l Link CDS mRNA by Label and Location r Reassign feature IDs m Merge colliding feature IDs f Fix missing reciprocal feature IDs c Clear feature IDs -O str Missing prot-ref name -P str Publication options: a Remove All publications s Remove Serial number f Remove Figure, numbering, and name r Remove Remark u Update PMID-only publication j Lookup ISO Journal title abbreviation m Merge identical publication features # Replace unpublished with PMID -Q str Report: c Record count r ASN.1 BSEC report s ASN.1 SSEC report n NORM vs. SSEC report e PopPhyMutEco AutoDef report o Overlap report l Latitude-longitude country diff d Log SSEC differences g GenBank SSEC diff f asn2gb/asn2flat diff h Seg-to-delta GenBank diff v Validator SSEC diff m Modernize Gene/RNA/PCR u Unpublished Pub lookup p Published Pub lookup j Unindexed Journal report t tRNA anticodon report w Component offset report x Custom scan -R Remote fetching from ID (NCBI sequence databases) -S str Selective difference filter (capital letters skip) s SSEC b BSEC A Author p Publication l Location r RNA q Qualifier sort order g Genbank block k Package CdRegion or parts features m Move publication o Leave duplicate Bioseq publication d Automatic definition line e Pop/Phy/Mut/Eco Set definition line -T Taxonomy Lookup -U str Modernize, per the flags in str: g Genes r RNA p PCR Primers -V str Remove features by validator severity: r Reject e Error w Warning i Info -X str Miscellaneous options, per str: d Automatic definition line s Automatic definition line with Source qualifiers e Pop/Phy/Mut/Eco Set definition line n Instantiate NC title m Instantiate NM titles x Special XM titles p Instantiate Protein titles g GPipe instantiate titles c Create mRNAs for coding sequences f Fix reciprocal protein_id/transcript_id v Revert preRNA or ncRNA transcript_id t Parse anticodon from Sequence b Batch cleanup of multireader output z Wrap SegSet with NucProt set w GFF/WGS genome cleanup -Z str Remove indicated User-object -a str ASN.1 type a Any (default) e Seq-entry b Bioseq s Bioseq-set m Seq-submit t Batch Bioseq-set u Batch Seq-submit -b Input ASN.1 is Binary -c Input ASN.1 is Compressed -d str Source database a Any (default) g GenBank e EMBL d DDBJ b EMBL or DDBJ i INSD r RefSeq n NCBI x Exclude EMBL/DDBJ y Exclude gbcon, gbest, gbgss, gbhtg, gbpat, gbsts -f str Substring filter -i filename Single input file (defaults to stdin) -j filename First filename -k filename Last filename -m str Flatfile mode: r Release e Entrez s Sequin d Dump -n path asn2flat executable (default is /netopt/ncbi_tools/bin/asn2flat) -o filename Single output file (defaults to stdout) -p path Process all matching files in path -q path ffdiff executable (default is /netopt/genbank/subtool/bin/ffdiff) -r path Path for results -v path asnval executable (default is /netopt/ncbi_tools/bin/asnval) -x ext File selection suffix for use with -p (defaults to .ent)
AUTHOR
The National Center for Biotechnology Information.
SEE ALSO
asndisc(1), asnval(1), sequin(1).