xenial (1) dsimulator.1.gz

Provided by: dazzdb_1.0-1_amd64 bug

NAME

       dsimulator - generate synthetic reads for a random genome

SYNOPSIS

       dsimulator genlen:double [-cdouble(20.)] [-bdouble(.5)] [-rint] [-mint(10000)] [-sint(2000)][-xint(4000)]
       [-edouble(.15)][-Mfile]

DESCRIPTION

       dsimulator first generates a fake genome of size genlen*1Mb long, that has an AT-bias  of  -b.   It  then
       generates  sample  reads  of mean length -m from a log-normal length distribution with standard deviation
       -s, but ignores reads of length less than -x.  It collects enough reads to cover the genome -c times  and
       introduces  -e fraction errors into each read where the ratio of insertions, deletions, and substitutions
       are set by defined constants INS_RATE (default 73%) and DEL_RATE (default 20%)  within  generate.c.   One
       can  also  control the rate at which reads are picked from the forward and reverse strands by setting the
       defined constant FLIP_RATE (default 50/50).  The -r option seeds the  random  number  generator  for  the
       generation of the genome so that one can reproducibly generate the same underlying genome to sample from.
       If this parameter is missing, then the job id of the invocation seeds the random number  generator.   The
       output  is  sent to the standard output (i.e.  it is a UNIX pipe).  The output is in Pacbio .fasta format
       suitable as input to fasta2DB(1).  Finally, the -M option requests that the coordinates from  which  each
       read  has  been  sampled are written to the indicated file, one line per read, ASCII encoded.  This "map"
       file essentially tells one where every read belongs in an assembly and is very useful for  debugging  and
       testing purposes.  If a read pair is say b,e then if b < e the read was sampled from [b,e] in the forward
       direction, and if b > e from [e,b] in the reverse direction.

SEE ALSO

       daligner(1)