Provided by: clonalframeml_1.12-2_amd64
NAME
ClonalFrameML - Efficient Inference of Recombination in Whole Bacterial Genomes
SYNOPSIS
ClonalFrameML newick_file fasta_file output_file [OPTIONS]
DESCRIPTION
ClonalFrameML is a software package that performs efficient inference of recombination in bacterial genomes. ClonalFrameML was created by Xavier Didelot and Daniel Wilson. ClonalFrameML can be applied to any type of aligned sequence data, but is especially aimed at analysis of whole genome sequences. It is able to compare hundreds of whole genomes in a matter of hours on a standard Desktop computer. There are three main outputs from a run of ClonalFrameML: a phylogeny with branch lengths corrected to account for recombination, an estimation of the key parameters of the recombination process, and a genomic map of where recombination took place for each branch of the phylogeny. ClonalFrameML is a maximum likelihood implementation of the Bayesian software ClonalFrame which was previously described by Didelot and Falush (2007). The recombination model underpinning ClonalFrameML is exactly the same as for ClonalFrame, but this new implementation is a lot faster, is able to deal with much larger genomic dataset, and does not suffer from MCMC convergence issues
OPTIONS
Options specifying the analysis type -em true (default) or false Estimate parameters by a Baum-Welch expectation maximization algorithm. -embranch true or false (default) Estimate parameters for each branch using the EM algorithm. -rescale_no_recombination true or false (default) Rescale branch lengths for given sites with no recombination model. -imputation_only true or false (default) Perform only ancestral state reconstruction and imputation. Options affecting all analyses -kappa value > 0 (default 2.0) Relative rate of transitions vs transversions in substitution model -fasta_file_list true or false (default) Take fasta_file to be a white-space separated file list. -xmfa_file true or false (default) Take fasta_file to be an XMFA file. -ignore_user_sites sites_file Ignore sites listed in whitespace-separated sites_file. -ignore_incomplete_sites true or false (default) Ignore sites with any ambiguous bases. -use_incompatible_sites true (default) or false Use homoplasious and multiallelic sites to correct branch lengths. -show_progress true or false (default) Output the progress of the maximum likelihood routines. -chromosome_name name, eg "chr" Output importation status file in BED format using given chromosome name. -min_branch_length value > 0 (default 1e-7) Minimum branch length. -reconstruct_invariant_sites true or false (default) Reconstruct the ancestral states at invariant sites. -label_uncorrected_tree true or false (default) Regurgitate the uncorrected Newick tree with internal nodes labelled. Options affecting -em and -embranch: -prior_mean df "0.1 0.001 0.1 0.0001" Prior mean for R/theta, 1/delta, nu and M. -prior_sd df "0.1 0.001 0.1 0.0001" Prior standard deviation for R/theta, 1/delta, nu and M. -initial_values default "0.1 0.001 0.05" Initial values for R/theta, 1/delta and nu. -guess_initial_m true (default) or false Initialize M and nu jointly in the EM algorithms. -emsim value >= 0 (default 0) Number of simulations to estimate uncertainty in the EM results. -embranch_dispersion value > 0 (default .01) Dispersion in parameters among branches in the -embranch model. Options affecting -rescale_no_recombination: -brent_tolerance tolerance (default .001) Set the tolerance of the Brent routine for -rescale_no_recombination. -powell_tolerance tolerance (default .001) Set the tolerance of the Powell routine for -rescale_no_recombination.
AUTHOR
This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.