oracular (3) Bio::Tools::Run::Phylo::Phyml.3pm.gz
NAME
Bio::Tools::Run::Phylo::Phyml - Wrapper for rapid reconstruction of phylogenies using Phyml
SYNOPSIS
use Bio::Tools::Run::Phylo::Phyml; # Make a Phyml factory $factory = Bio::Tools::Run::Phylo::Phyml->new(-verbose => 2); # it defaults to protein alignment # change parameters $factory->model('Dayhoff'); # Pass the factory an alignment and run $inputfilename = 't/data/protpars.phy'; $tree = $factory->run($inputfilename); # $tree is a Bio::Tree::Tree object. # or set parameters at object creation my %args = ( -data_type => 'dna', -model => 'HKY', -kappa => 4, -invar => 'e', -category_number => 4, -alpha => 'e', -tree => 'BIONJ', -opt_topology => '0', -opt_lengths => '1', ); $factory = Bio::Tools::Run::Phylo::Phyml->new(%args); # if you need the output files do $factory->save_tempfiles(1); $factory->tempdir($workdir); # and get a Bio::Align::AlignI (SimpleAlign) object from somewhere $tree = $factory->run($aln);
DESCRIPTION
This is a wrapper for running the phyml application by Stephane Guindon and Olivier Gascuel. You can download it from: http://atgc.lirmm.fr/phyml/ Installing After downloading, you need to rename a the copy of the program that runs under your operating system. I.e. "phyml_linux" into "phyml". You will need to help this Phyml wrapper to find the "phyml" program. This can be done in (at least) three ways: 1. Make sure the Phyml executable is in your path. Copy it to, or create a symbolic link from a directory that is in your path. 2. Define an environmental variable PHYMLDIR which is a directory which contains the 'phyml' application: In bash: export PHYMLDIR=/home/username/phyml_v2.4.4/exe In csh/tcsh: setenv PHYMLDIR /home/username/phyml_v2.4.4/exe 3. Include a definition of an environmental variable PHYMLDIR in every script that will use this Phyml wrapper module, e.g.: BEGIN { $ENV{PHYMLDIR} = '/home/username/phyml_v2.4.4/exe' } use Bio::Tools::Run::Phylo::Phyml; Running This wrapper has been tested with PHYML v2.4.4 and v.3.0. It may work with recent Phyml releases using a date format for the format, but the wrapper hasn't been extensively tested in these cases, so for the moment only the simpler numbered versions are supported. In its current state, the wrapper supports only input of one MSA and output of one tree. It can easily be extended to support more advanced capabilities of "phyml". Two convienience methods have been added on top of the standard BioPerl WrapperBase ones: stats() and tree_string(). You can call them to after running the phyml program to retrieve into a string the statistics and the tree in Newick format.
FEEDBACK
Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists Support Please direct usage questions or support issues to the mailing list: bioperl-l@bioperl.org rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: http://bugzilla.open-bio.org/
AUTHOR - Heikki Lehvaslaiho
heikki at bioperl dot org
APPENDIX
The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ new Title : new Usage : $factory = Bio::Tools::Run::Phylo::Phyml->new(@params) Function: creates a new Phyml factory Returns : Bio::Tools::Run::Phylo::Phyml Args : Optionally, provide any of the following (default in []): -data_type => 'dna' or 'protein', [protein] -dataset_count => integer, [1] -model => 'HKY'... , [HKY|JTT] -kappa => 'e' or float, [e] -invar => 'e' or float, [e] -category_number => integer, [1] -alpha => 'e' or float (int v3),[e] -tree => 'BIONJ' or your own, [BION] -bootstrap => integer [123] -opt_topology => boolean [1] -opt_lengths => boolean [1] -no_memory_check => boolean [1] -program_name => string program_name Title : program_name Usage : $factory>program_name() Function: holds the program name Returns : string Args : None program_dir Title : program_dir Usage : $factory->program_dir(@params) Function: returns the program directory, obtained from ENV variable. Returns : string Args : None version Title : version Usage : exit if $prog->version < 1.8 Function: Determine the version number of the program Example : Returns : float or undef Args : none Phyml before 3.0 did not display the version. Assume 2.44 when can not determine it. Some releases do not state version number, only date, so the version might have to be inferred from this date. run Title : run Usage : $factory->run($aln_file); $factory->run($align_object); Function: Runs Phyml to generate a tree Returns : Bio::Tree::Tree object Args : file name for your input alignment in a format recognised by AlignIO, OR Bio::Align::AlignI compliant object (eg. Bio::SimpleAlign). stats Title : stats Usage : $factory->stats; Function: Returns the contents of the phyml '_phyml_stat.txt' output file Returns : string with statistics about the run, undef before run() Args : none tree_string Title : tree_string Usage : $factory->tree_string; $factory->run($align_object); Function: Returns the contents of the phyml '_phyml_tree.txt' output file Returns : string with tree in Newick format, undef before run() Args : none Getsetters These methods are used to set and get program parameters before running. data_type Title : data_type Usage : $phyml->data_type('nt'); Function: Sets sequence alphabet to 'dna' (nt in v3) or 'aa' If leaved unset, will be set automatically Returns : set value, defaults to 'protein' Args : None to get, 'dna' ('nt') or 'aa' to set. data_format Title : data_format Usage : $phyml->data_format('s'); Function: Sets PHYLIP format to 'i' interleaved or 's' sequential Returns : set value, defaults to 'i' Args : None to get, 'i' or 's' to set. dataset_count Title : dataset_count Usage : $phyml->dataset_count(3); Function: Sets dataset number to deal with Returns : set value, defaults to 1 Args : None to get, positive integer to set. model Title : model Usage : $phyml->model('HKY'); Function: Choose the substitution model to use. One of JC69 | K2P | F81 | HKY | F84 | TN93 | GTR (DNA) JTT | MtREV | Dayhoff | WAG (amino acids) v3.0: HKY85 (default) | JC69 | K80 | F81 | F84 | TN93 | GTR (DNA) LG (default) | WAG | JTT | MtREV | Dayhoff | DCMut | RtREV | CpREV | VT | Blosum62 | MtMam | MtArt | HIVw | HIVb (amino acids) Returns : Name of the model, v2.4.4 defaults to {HKY|JTT} Args : None to get, string to set. kappa Title : kappa Usage : $phyml->kappa(4); Function: Sets transition/transversion ratio, leave unset to estimate Returns : set value, defaults to 'e' Args : None to get, float or integer to set. invar Title : invar Usage : $phyml->invar(.3); Function: Sets proportion of invariable sites, leave unset to estimate Returns : set value, defaults to 'e' Args : None to get, float or integer to set. category_number Title : category_number Usage : $phyml->category_number(4); Function: Sets number of relative substitution rate categories Returns : set value, defaults to 1 Args : None to get, integer to set. alpha Title : alpha Usage : $phyml->alpha(1.0); Function: Sets gamma distribution parameter, leave unset to estimate Returns : set value, defaults to 'e' Args : None to get, float or integer to set. tree Title : tree Usage : $phyml->tree('/tmp/tree.nwk'); Function: Sets starting tree, leave unset to estimate a distance tree Returns : set value, defaults to 'BIONJ' Args : None to get, newick tree file name to set. v2 options These methods can be used with PhyML v2* only. opt_topology Title : opt_topology Usage : $factory->opt_topology(1); Function: Choose to optimise the tree topology Returns : 1 or 0. Default is 1. Args : None to get, boolean to set. v2.* only opt_lengths Title : opt_lengths Usage : $factory->opt_lengths(0); Function: Choose to optimise branch lengths and rate parameters Returns : 1 or 0. Default is 1. Args : None to get, boolean to set. v2.* only v3 options These methods can be used with PhyML v3* only. freq Title : freq Usage : $phyml->freq(e); $phyml->freq("0.2, 0.6, 0.6, 0.2"); Function: Sets nucleotide frequences or asks residue to be estimated according to two models: e or d Returns : set value, Args : None to get, string to set. v3 only. opt Title : opt Usage : $factory->opt(1); Function: Optimise tree parameters: tlr|tl|tr|l|n Returns : {value|n} (default n) Args : None to get, string to set. v3.* only search Title : search Usage : $factory->search(SPR); Function: Tree topology search operation algorithm: NNI|SPR|BEST Returns : string (defaults to NNI) Args : None to get, string to set. v3.* only rand_start Title : rand_start Usage : $factory->rand_start(1); Function: Sets the initial SPR tree to random. Returns : boolean (defaults to false) Args : None to get, boolean to set. v3.* only; only meaningful if $prog->search is 'SPR' rand_starts Title : rand_starts Usage : $factory->rand_starts(10); Function: Sets the number of initial random SPR trees Returns : integer (defaults to 1) Args : None to get, integer to set. v3.* only; only valid if $prog->search is 'SPR' rand_seed Title : rand_seed Usage : $factory->rand_seed(1769876); Function: Seeds the random number generator Returns : random integer Args : None to get, integer to set. v3.* only; only valid if $prog->search is 'SPR' Uses perl rand() to initialize if not explicitely set. no_memory_check Title : no_memory_check Usage : $factory->no_memory_check(1); Function: Returns : boolean (defaults to false) Args : None to get, integer to set. bootstrap Title : bootstrap Usage : $factory->bootstrap(100); Function: Set number of bootstraps Returns : Args : None to get, integer to set. command Title : command Usage : $factory->command(...); Function: Returns : string Args : None to get, integer to set. Internal methods These methods are private and should not be called outside this class. _setparams Title : _setparams Usage : Internal function, not to be called directly Function: Creates a string of params to be used in the command string Returns : string of params Args : none _write_phylip_align_file Title : _write_phylip_align_file Usage : obj->__write_phylip_align_file($aln) Function: Internal (not to be used directly) Writes the alignment into the tmp directory in PHYLIP interlieved format Returns : filename Args : Bio::Align::AlignI