BEST Bayesian

From Research Computing Center Wiki
Jump to navigation Jump to search

Category

Bioinformatics

Program On

zcluster

Version

2.3.1

Author / Distributor

Liang Liu

Description

BEST website

Running Program

BEST is a free phylogenetics program to estimate the joint posterior distribution of gene trees and species tree using multilocus molecular data that accounts for deep coalescence but not for other issues such as horizontal transfer or gene duplication.

This code works as a modification of the popular phylogenetics software package MrBayes

mbbest, /usr/local/BEST/latest/mbbest are always pointed to the latest update version.

Version: /usr/local/BEST/2.3.1/mbbest /usr/local/BEST/2.2/mbbest

Example of interactive program. Please refer to here about running jobs at interactive nodes.

qlogin
mbbest
...
exit

run the mpirun command with:


Below is an example file defines the data. Save this file as e.x. yh.nex

cp /usr/local/mrbayes/primates.nex yh.nex


Content of yh.nex is:

#NEXUS
begin data;
dimensions ntax=12 nchar=898;
format datatype=dna interleave=no gap=-;
matrix
Tarsius_syrichta AAGTTTCATTGGAGCCACCACTCTTATAATTGCCCATGGCCTCACCTCCTCCCTATTATTTTGC
TACGAACGAGTCCACAGTCGAACAATAGCACTAGCCCGTGGCCTTCAAACCCTATTACCTCTTGCAGCAACATGA
.... 
end;


Define all the parameters in the .par file, e.x. yh.par. Notice the yh.nex is a data file defined above. This example performs three single-run analyses of the data yh.nex.

set autoclose=yes nowarn=yes;
execute yh.nex;
lset nst=6 rates=gamma;
mcmc nruns=1 ngen=10000 samplefreq=10 file=yh.nex1;
mcmc file=yh.nex2;
mcmc file=yh.nex3;


Below is a script subp.sh to the batch queue, yh.par is the file defined above

#!/bin/bash
cd my-working-dir
/usr/local/mpich2/1.4/gcc_4.1.2/bin/mpirun -np $NSLOTS /usr/local/BEST/latest/mbbest < yh.par > myout

Then submit all these to the queue as

qsub -q queuename -pe mpi 2 ./subp.sh

the -pe mpi 2 is number of processors for the MPIRUN, it has to be matched with the number of processors (-np) in the subp.sh.

Documentation

Online tutorial available at BEST website.

Commands that are available from the command                                  
   line or from a MrBayes block include:                                         
                                                                                 
   About           -- Describes the program
   Acknowledgments -- Shows program acknowledgments
   Charset         -- Assigns a group of sites to a set
   Charstat        -- Shows status of characters
   Citations       -- Appropriate citation of program
   Comparetree     -- Compares the trees from two tree files
   Constraint      -- Defines a constraint on tree topology
   Ctype           -- Assigns ordering for the characters
   Databreaks      -- Defines nucleotide pairs (doublets) for stem models
   Delete          -- Deletes taxa from the analysis
   Deroot          -- Deroots user tree
   Disclaimer      -- Describes program disclaimer
   Exclude         -- Excludes sites from the analysis
   Execute         -- Executes a file
   Help            -- Provides detailed description of commands
   Include         -- Includes sites
   Link            -- Links parameters across character partitions
   Log             -- Logs screen output to a file
   Lset            -- Sets the parameters of the likelihood model
   Manual          -- Prints a command reference to a text file
   Mcmc            -- Starts Markov chain Monte Carlo analysis
   Mcmcp           -- Sets the parameters of a chain (without starting analysis)
   Outgroup        -- Changes outgroup taxon
   Pairs           -- Defines nucleotide pairs (doublets) for stem models
   Partition       -- Assigns a character partition
   Plot            -- Plots parameters from MCMC analysis
   Prset           -- Sets the priors for the parameters
   Props           -- Set proposal probabilities
   Quit            -- Quits the program
   Report          -- Controls how model parameters are reported
   Restore         -- Restores taxa
   Root            -- Roots user tree
   Set             -- Sets run conditions and defines active data partition
   Showmatrix      -- Shows current character matrix
   Showmodel       -- Shows model settings
   Showtree        -- Shows user tree
   Sump            -- Summarizes parameters from MCMC analysis
   Sumt            -- Summarizes trees from MCMC analysis
   Taxastat        -- Shows status of taxa
   Taxset          -- Assigns a group of taxa to a set
   Unlink          -- Unlinks parameters across character partitions
   Usertree        -- Defines a single user tree
   Version         -- Shows program version
                                                                                 
   Commands that should be in a NEXUS file (data                                 
   block or trees block) include:                                                
                                                                                 
   Begin           -- Denotes beginning of block in file
   Dimensions      -- Defines size of character matrix
   End             -- Denotes end of a block in file
   Endblock        -- Alternative way of denoting end of a block
   Format          -- Defines character format in data block
   Matrix          -- Defines matrix of characters in data block
   Translate       -- Defines alternative names for taxa
   Tree            -- Defines a tree from MCMC analysis
                                                                                 
   Note that this program supports the use of the shortest unambiguous           
   spelling of the above commands (e.g., "exe" instead of "execute"). 

Installation

Source downloaded from BEST website.

System

64-bit Linux