Ubuntu Manpage: split_fasta - Split a fasta file according to sequence and character counts

Provided by: coop-computing-tools_7.14.5-1build1_amd64

NAME

       split_fasta - Split a fasta file according to sequence and character counts

SYNOPSIS

       split_fasta query_granularity character_granularity fasta_file

DESCRIPTION

       split_fasta  is  a simple script to split a fasta file according to user provided parameters.  The script
       iterates over the given file, generating a new sub_file called input.i each  time  the  contents  of  the
       previous  file  (input.(i-1))  exceed  the  number of queries given by query_granularity or the number of
       characters given by character_granularity.

OPTIONS

EXIT STATUS

       On success, returns zero.  On failure, returns non-zero.

ENVIRONMENT VARIABLES

EXAMPLES

       To split a fasta file smallpks.fa into pieces no larger than 500 queries  and  with  no  piece  receiving
       additional sequences if it exceeds 10000 characters we would do:

               python split_fasta 500 10000 smallpks.fa

       This  would  generate  files  input.0,  input.1, ..., input.N where N is the number of appropriately con‐
       strained files necessary to contain all sequences in smallpks.fa.

COPYRIGHT

       The Cooperative Computing Tools are Copyright (C) 2022 The University of Notre Dame.   This  software  is
       distributed under the GNU General Public License.  See the file COPYING for details.

       •   Cooperative Computing Tools Documentation

       •   Makeflow User Manual

       •   makeflow(1)  makeflow_monitor(1)  makeflow_analyze(1) makeflow_viz(1) makeflow_graph_log(1) starch(1)
           makeflow_ec2_setup(1) makeflow_ec2_cleanup(1)

CCTools 7.14.5 FINAL                                                                              split_fasta(1)

NAME

SYNOPSIS

DESCRIPTION

OPTIONS

EXIT STATUS

ENVIRONMENT VARIABLES

EXAMPLES

COPYRIGHT

SEE ALSO