IQ-Tree-Sapelo2: Difference between revisions

From Research Computing Center Wiki
Jump to navigation Jump to search
(Created page with "Category:TeachingCategory:SoftwareCategory:Bioinformatics === Category === Bioinformatics === Program On === Sapelo2 === Version === 1.6.12, 2.2.2.6 ===Author / Distributor=== Please see http://www.iqtree.org/ ===Description=== Efficient phylogenomic software by maximum likelihood. More information: http://www.iqtree.org/ === Running Program === *Version 2.2.2.6, installed in /apps/eb/IQ-TREE/2.2.2.6-gompi-2022a To use this version of IQ-TREE,...")
 
No edit summary
 
Line 11: Line 11:
=== Version ===
=== Version ===


1.6.12, 2.2.2.6
2.2.2.7, 2.3.5, 2.3.6


===Author / Distributor===
===Author / Distributor===
Line 23: Line 23:
=== Running Program ===
=== Running Program ===
   
   
*Version 2.2.2.6, installed in /apps/eb/IQ-TREE/2.2.2.6-gompi-2022a
*Version 2.2.2.7, installed in /apps/eb/IQ-TREE/2.2.2.7-gompi-2023a


To use this version of IQ-TREE, please first load the module with
To use this version of IQ-TREE, please first load the module with
<pre class="gscript">
<pre class="gscript">
module load IQ-TREE/2.2.2.6-gompi-2022a
module load IQ-TREE/2.2.2.7-gompi-2023a
</pre>
</pre>
*Version 1.6.12, installed in /apps/eb/IQ-TREE/1.6-12-foss-2019b
 
*Version 2.3.5, installed in /apps/eb/IQ-TREE/2.3.5-gompi-2023a
 
To use this version of IQ-TREE, please first load the module with
<pre class="gscript">
module load IQ-TREE/2.3.5-gompi-2023a
</pre>
 
*Version 2.3.6, installed in /apps/eb/IQ-TREE/2.3.6-gompi-2023a


To use this version of IQ-TREE, please first load the module with
To use this version of IQ-TREE, please first load the module with
<pre class="gscript">
<pre class="gscript">
module load IQ-TREE/1.6.12-foss-2019b
module load IQ-TREE/2.3.6-gompi-2023a
</pre>  
</pre>


'''Please note:''' The name of the executable for IQ-TREE version 2+ is iqtree2 and iqtree2-mpi (MPI version).




Sample job submission script (sub.sh) to run IQ-Tree version. 1.6.12:
Sample job submission script (sub.sh) to run IQ-TREE version 2.3.6:


<div class="gscript2">
<div class="gscript2">
Line 47: Line 56:
<nowiki>#</nowiki>SBATCH --mail-user=<u>username@uga.edu</u><br>   
<nowiki>#</nowiki>SBATCH --mail-user=<u>username@uga.edu</u><br>   
<nowiki>#</nowiki>SBATCH --ntasks=<u>1</u><br>
<nowiki>#</nowiki>SBATCH --ntasks=<u>1</u><br>
<nowiki>#</nowiki>SBATCH --cpus-per-task=<u>4</u><br>
<nowiki>#</nowiki>SBATCH --mem=<u>2gb</u><br>     
<nowiki>#</nowiki>SBATCH --mem=<u>2gb</u><br>     
<nowiki>#</nowiki>SBATCH --time=<u>04:00:00</u><br>   
<nowiki>#</nowiki>SBATCH --time=<u>04:00:00</u><br>   
Line 54: Line 64:
cd $SLURM_SUBMIT_DIR<br>
cd $SLURM_SUBMIT_DIR<br>


module load IQ-TREE/1.6.12-foss-2019b<br>     
module load IQ-TREE/2.3.6-gompi-2023a<br>     
iqtree <u>[options]</u><br>   
iqtree2 <u>[options]</u><br>   
</div>
</div>


In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values.
In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values. If you request multiple cores per tasks in order to have iqtree2 run with multiple threads, then please use the -T option to specify how many threads iqtree2 should use. We recommend setting the number of threads to match the number of cores requested, e.g. '''-T 4''' in this example. 
 


Please refer to [[Running Jobs on Sapelo2]] for more information on running jobs on the Sapelo2 cluster.
Please refer to [[Running Jobs on Sapelo2]] for more information on running jobs on the Sapelo2 cluster.
Line 70: Line 81:


<pre class="gcomment">
<pre class="gcomment">
module load IQ-TREE/1.6.12-foss-2019b
[shtsai@b1-24 ~]$ ml IQ-TREE/2.3.6-gompi-2023a
[shtsai@b1-24 ~]$ iqtree2 -h
IQ-TREE multicore version 2.3.6 for Linux x86 64-bit built May  5 2025
Developed by Bui Quang Minh, Nguyen Lam Tung, Olga Chernomor, Heiko Schmidt,
Dominik Schrempf, Michael Woodhams, Ly Trong Nhan, Thomas Wong


iqtree -h
Usage: iqtree [-s ALIGNMENT] [-p PARTITION] [-m MODEL] [-t TREE] ...
IQ-TREE multicore version 1.6.12 for Linux 64-bit built Jul  9 2020
Developed by Bui Quang Minh, Nguyen Lam Tung, Olga Chernomor,
Heiko Schmidt, Dominik Schrempf, Michael Woodhams.


Usage: iqtree -s <alignment> [OPTIONS]
GENERAL OPTIONS:
  -h, --help          Print (more) help usages
  -s FILE[,...,FILE]  PHYLIP/FASTA/NEXUS/CLUSTAL/MSF alignment file(s)
  -s DIR              Directory of alignment files
  --seqtype STRING    BIN, DNA, AA, NT2AA, CODON, MORPH (default: auto-detect)
  -t FILE|PARS|RAND    Starting tree (default: 99 parsimony and BIONJ)
  -o TAX[,...,TAX]    Outgroup taxon (list) for writing .treefile
  --prefix STRING      Prefix for all output files (default: aln/partition)
  --seed NUM          Random seed number, normally used for debugging purpose
  --safe              Safe likelihood kernel to avoid numerical underflow
  --mem NUM[G|M|%]     Maximal RAM usage in GB | MB | %
  --runs NUM          Number of indepedent runs (default: 1)
  -v, --verbose        Verbose mode, printing more messages to screen
  -V, --version        Display version number
  --quiet              Quiet mode, suppress printing to screen (stdout)
  -fconst f1,...,fN    Add constant patterns into alignment (N=no. states)
  --epsilon NUM        Likelihood epsilon for parameter estimate (default 0.01)
  -T NUM|AUTO          No. cores/threads or AUTO-detect (default: 1)
  --threads-max NUM    Max number of threads for -T AUTO (default: all cores)


GENERAL OPTIONS:
CHECKPOINT:
   -? or -h            Print this help dialog
   --redo              Redo both ModelFinder and tree search
   -version            Display version number
   --redo-tree         Restore ModelFinder and only redo tree search
  -s <alignment>      Input alignment in PHYLIP/FASTA/NEXUS/CLUSTAL/MSF format
   --undo              Revoke finished run, used when changing some options
  -st <data_type>      BIN, DNA, AA, NT2AA, CODON, MORPH (default: auto-detect)
   --cptime NUM        Minimum checkpoint interval (default: 60 sec and adapt)
  -q <partition_file>  Edge-linked partition model (file in NEXUS/RAxML format)
-spp <partition_file> Like -q option but allowing partition-specific rates
  -sp <partition_file> Edge-unlinked partition model (like -M option of RAxML)
  -t <start_tree_file> or -t BIONJ or -t RANDOM
                      Starting tree (default: 99 parsimony tree and BIONJ)
  -te <user_tree_file> Like -t but fixing user tree (no tree search performed)
   -o <outgroup_taxon>  Outgroup taxon name for writing .treefile
  -pre <PREFIX>        Prefix for all output files (default: aln/partition)
  -nt <num_threads>    Number of cores/threads or AUTO for automatic detection
  -ntmax <max_threads> Max number of threads by -nt AUTO (default: #CPU cores)
  -seed <number>      Random seed number, normally used for debugging purpose
  -v, -vv, -vvv        Verbose mode, printing more messages to screen
  -quiet              Quiet mode, suppress printing to screen (stdout)
   -keep-ident          Keep identical sequences (default: remove & finally add)
  -safe                Safe likelihood kernel to avoid numerical underflow
  -mem RAM            Maximal RAM usage for memory saving mode
  --runs NUMBER        Number of indepedent runs (default: 1)


CHECKPOINTING TO RESUME STOPPED RUN:
PARTITION MODEL:
   -redo                Redo analysis even for successful runs (default: resume)
   -p FILE|DIR          NEXUS/RAxML partition file or directory with alignments
   -cptime <seconds>    Minimum checkpoint time interval (default: 60 sec)
                      Edge-linked proportional partition model
  -q FILE|DIR          Like -p but edge-linked equal partition model
  -Q FILE|DIR          Like -p but edge-unlinked partition model
  -S FILE|DIR          Like -p but separate tree inference
  --subsample NUM      Randomly sub-sample partitions (negative for complement)
   --subsample-seed NUM Random number seed for --subsample


LIKELIHOOD MAPPING ANALYSIS:
LIKELIHOOD/QUARTET MAPPING:
   -lmap <#quartets>    Number of quartets for likelihood mapping analysis
   --lmap NUM          Number of quartets for likelihood mapping analysis
   -lmclust <clustfile> NEXUS file containing clusters for likelihood mapping
   --lmclust FILE      NEXUS file containing clusters for likelihood mapping
   -wql                Print quartet log-likelihoods to .quartetlh file
   --quartetlh          Print quartet log-likelihoods to .quartetlh file


NEW STOCHASTIC TREE SEARCH ALGORITHM:
TREE SEARCH ALGORITHM:
   -ninit <number>      Number of initial parsimony trees (default: 100)
   --ninit NUM          Number of initial parsimony trees (default: 100)
   -ntop <number>      Number of top initial trees (default: 20)
   --ntop NUM          Number of top initial trees (default: 20)
   -nbest <number>      Number of best trees retained during search (defaut: 5)
   --nbest NUM          Number of best trees retained during search (defaut: 5)
   -n <#iterations>    Fix number of iterations to stop (default: auto)
   -n NUM              Fix number of iterations to stop (default: OFF)
   -nstop <number>      Number of unsuccessful iterations to stop (default: 100)
   --nstop NUM          Number of unsuccessful iterations to stop (default: 100)
   -pers <proportion>  Perturbation strength for randomized NNI (default: 0.5)
   --perturb NUM        Perturbation strength for randomized NNI (default: 0.5)
   -sprrad <number>    Radius for parsimony SPR search (default: 6)
   --radius NUM        Radius for parsimony SPR search (default: 6)
   -allnni             Perform more thorough NNI search (default: off)
   --allnni             Perform more thorough NNI search (default: OFF)
   -g <constraint_tree> (Multifurcating) topological constraint tree file
   -g FILE              (Multifurcating) topological constraint tree file
   -fast               Fast search to resemble FastTree
   --fast               Fast search to resemble FastTree
  --polytomy          Collapse near-zero branches into polytomy
  --tree-fix          Fix -t tree (no tree search performed)
  --treels            Write locally optimal trees into .treels file
  --show-lh            Compute tree likelihood without optimisation
  --terrace            Check if the tree lies on a phylogenetic terrace


ULTRAFAST BOOTSTRAP:
ULTRAFAST BOOTSTRAP/JACKKNIFE:
   -bb <#replicates>   Ultrafast bootstrap (>=1000)
   -B, --ufboot NUM    Replicates for ultrafast bootstrap (>=1000)
   -bsam GENE|GENESITE Resample GENE or GENE+SITE for partition (default: SITE)
  -J, --ufjack NUM    Replicates for ultrafast jackknife (>=1000)
   -wbt                Write bootstrap trees to .ufboot file (default: none)
   --jack-prop NUM      Subsampling proportion for jackknife (default: 0.5)
   -wbtl               Like -wbt but also writing branch lengths
  --sampling STRING    GENE|GENESITE resampling for partitions (default: SITE)
   -nm <#iterations>    Maximum number of iterations (default: 1000)
   --boot-trees        Write bootstrap trees to .ufboot file (default: none)
   -nstep <#iterations> #Iterations for UFBoot stopping rule (default: 100)
   --wbtl               Like --boot-trees but also writing branch lengths
   -bcor <min_corr>    Minimum correlation coefficient (default: 0.99)
   --nmax NUM          Maximum number of iterations (default: 1000)
   -beps <epsilon>      RELL epsilon to break tie (default: 0.5)
   --nstep NUM          Iterations for UFBoot stopping rule (default: 100)
   -bnni               Optimize UFBoot trees by NNI on bootstrap alignment
   --bcor NUM          Minimum correlation coefficient (default: 0.99)
  -j <jackknife>      Proportion of sites for jackknife (default: NONE)
   --beps NUM          RELL epsilon to break tie (default: 0.5)
   --bnni               Optimize UFBoot trees by NNI on bootstrap alignment


STANDARD NON-PARAMETRIC BOOTSTRAP:
NON-PARAMETRIC BOOTSTRAP/JACKKNIFE:
   -b <#replicates>    Bootstrap + ML tree + consensus tree (>=100)
   -b, --boot NUM      Replicates for bootstrap + ML tree + consensus tree
   -bc <#replicates>    Bootstrap + consensus tree
  -j, --jack NUM      Replicates for jackknife + ML tree + consensus tree
   -bo <#replicates>    Bootstrap only
  --jack-prop NUM      Subsampling proportion for jackknife (default: 0.5)
   --bcon NUM          Replicates for bootstrap + consensus tree
   --bonly NUM          Replicates for bootstrap only
  --tbe                Transfer bootstrap expectation


SINGLE BRANCH TEST:
SINGLE BRANCH TEST:
   -alrt <#replicates>  SH-like approximate likelihood ratio test (SH-aLRT)
   --alrt NUM          Replicates for SH approximate likelihood ratio test
   -alrt 0             Parametric aLRT test (Anisimova and Gascuel 2006)
   --alrt 0             Parametric aLRT test (Anisimova and Gascuel 2006)
   -abayes             approximate Bayes test (Anisimova et al. 2011)
   --abayes             approximate Bayes test (Anisimova et al. 2011)
   -lbp <#replicates>  Fast local bootstrap probabilities
   --lbp NUM            Replicates for fast local bootstrap probabilities


MODEL-FINDER:
MODEL-FINDER:
Line 151: Line 178:
   -m MF                Extended model selection with FreeRate heterogeneity
   -m MF                Extended model selection with FreeRate heterogeneity
   -m MFP              Extended model selection followed by tree inference
   -m MFP              Extended model selection followed by tree inference
   -m TESTMERGEONLY    Find best partition scheme (like PartitionFinder)
   -m ...+LM            Additionally test Lie Markov models
   -m TESTMERGE        Find best partition scheme followed by tree inference
   -m ...+LMRY          Additionally test Lie Markov models with RY symmetry
   -m MF+MERGE         Find best partition scheme incl. FreeRate heterogeneity
   -m ...+LMWS         Additionally test Lie Markov models with WS symmetry
   -m MFP+MERGE        Like -m MF+MERGE followed by tree inference
   -m ...+LMMK          Additionally test Lie Markov models with MK symmetry
   -rcluster <percent>  Percentage of partition pairs (relaxed clustering alg.)
   -m ...+LMSS          Additionally test strand-symmetric models
  -rclusterf <perc.>  Percentage of partition pairs (fast relaxed clustering)
   --mset STRING       Restrict search to models supported by other programs
   -rcluster-max <num>  Max number of partition pairs (default: 10*#partitions)
                       (raxml, phyml, mrbayes, beast1 or beast2)
  -mset program       Restrict search to models supported by other programs
   --mset STR,...       Comma-separated model list (e.g. -mset WAG,LG,JTT)
                       (raxml, phyml or mrbayes)
   --msub STRING        Amino-acid model source
   -mset <lm-subset>    Restrict search to a subset of the Lie-Markov models
                      Options for lm-subset are:
                      liemarkov, liemarkovry, liemarkovws, liemarkovmk, strandsymmetric
  -mset m1,...,mk      Restrict search to models in a comma-separated list
                      (e.g. -mset WAG,LG,JTT)
   -msub source         Restrict search to AA models for specific sources
                       (nuclear, mitochondrial, chloroplast or viral)
                       (nuclear, mitochondrial, chloroplast or viral)
   -mfreq f1,...,fk    Restrict search to using a list of state frequencies
   --mfreq STR,...     List of state frequencies
                      (default AA: -mfreq FU,F; codon: -mfreq ,F1x4,F3x4,F)
  --mrate STR,...     List of rate heterogeneity among sites
  -mrate r1,...,rk    Restrict search to a list of rate-across-sites models
                       (e.g. -mrate E,I,G,I+G,R is used for -m MF)
                       (e.g. -mrate E,I,G,I+G,R is used for -m MF)
   -cmin <kmin>        Min #categories for FreeRate model [+R] (default: 2)
   --cmin NUM          Min categories for FreeRate model [+R] (default: 2)
   -cmax <kmax>        Max #categories for FreeRate model [+R] (default: 10)
   --cmax NUM          Max categories for FreeRate model [+R] (default: 10)
   -merit AIC|AICc|BIC  Optimality criterion to use (default: all)
   --merit AIC|AICc|BIC  Akaike|Bayesian information criterion (default: BIC)
   -mtree               Perform full tree search for each model considered
   --mtree             Perform full tree search for every model
   -mredo              Ignore model results computed earlier (default: reuse)
   --madd STR,...       List of mixture models to consider
  -madd mx1,...,mxk    List of mixture models to also consider
   --mdef FILE          Model definition NEXUS file (see Manual)
   -mdef <nexus_file>  A model definition NEXUS file (see Manual)
  --modelomatic        Find best codon/protein/DNA models (Whelan et al. 2015)
 
PARTITION-FINDER:
  --merge              Merge partitions to increase model fit
  --merge greedy|rcluster|rclusterf
                      Set merging algorithm (default: rclusterf)
  --merge-model 1|all  Use only 1 or all models for merging (default: 1)
  --merge-model STR,...
                      Comma-separated model list for merging
  --merge-rate 1|all  Use only 1 or all rate heterogeneity (default: 1)
  --merge-rate STR,...
                      Comma-separated rate list for merging
  --rcluster NUM      Percentage of partition pairs for rcluster algorithm
  --rclusterf NUM      Percentage of partition pairs for rclusterf algorithm
  --rcluster-max NUM  Max number of partition pairs (default: 10*partitions)


SUBSTITUTION MODEL:
SUBSTITUTION MODEL:
   -m <model_name>
   -m STRING            Model name string (e.g. GTR+F+I+G)
                  DNA: HKY (default), JC, F81, K2P, K3P, K81uf, TN/TrN, TNef,
                DNA: HKY (default), JC, F81, K2P, K3P, K81uf, TN/TrN, TNef,
                       TIM, TIMef, TVM, TVMef, SYM, GTR, or 6-digit model
                       TIM, TIMef, TVM, TVMef, SYM, GTR, or 6-digit model
                       specification (e.g., 010010 = HKY)
                       specification (e.g., 010010 = HKY)
              Protein: LG (default), Poisson, cpREV, mtREV, Dayhoff, mtMAM,
            Protein: LG (default), Poisson, cpREV, mtREV, Dayhoff, mtMAM,
                       JTT, WAG, mtART, mtZOA, VT, rtREV, DCMut, PMB, HIVb,
                       JTT, WAG, mtART, mtZOA, VT, rtREV, DCMut, PMB, HIVb,
                       HIVw, JTTDCMut, FLU, Blosum62, GTR20, mtMet, mtVer, mtInv
                       HIVw, JTTDCMut, FLU, Blosum62, GTR20, mtMet, mtVer, mtInv, FLAVI,
      Protein mixture: C10,...,C60, EX2, EX3, EHO, UL2, UL3, EX_EHO, LG4M, LG4X
Q.LG, Q.pfam, Q.pfam_gb, Q.bird, Q.mammal, Q.insect, Q.plant, Q.yeast
              Binary: JC2 (default), GTR2
    Protein mixture: C10,...,C60, EX2, EX3, EHO, UL2, UL3, EX_EHO, LG4M, LG4X
      Empirical codon: KOSI07, SCHN05
              Binary: JC2 (default), GTR2
    Mechanistic codon: GY (default), MG, MGK, GY0K, GY1KTS, GY1KTV, GY2K,
    Empirical codon: KOSI07, SCHN05
  Mechanistic codon: GY (default), MG, MGK, GY0K, GY1KTS, GY1KTV, GY2K,
                       MG1KTS, MG1KTV, MG2K
                       MG1KTS, MG1KTV, MG2K
Semi-empirical codon: XX_YY where XX is empirical and YY is mechanistic model
Semi-empirical codon: XX_YY where XX is empirical and YY is mechanistic model
      Morphology/SNP: MK (default), ORDERED, GTR
      Morphology/SNP: MK (default), ORDERED, GTR
      Lie Markov DNA: One of the following, optionally prefixed by RY, WS or MK:
      Lie Markov DNA: 1.1, 2.2b, 3.3a, 3.3b, 3.3c, 3.4, 4.4a, 4.4b, 4.5a,
                      1.1, 2.2b, 3.3a, 3.3b, 3.3c,
                      4.5b, 5.6a, 5.6b, 5.7a, 5.7b, 5.7c, 5.11a, 5.11b, 5.11c,
                      3.4, 4.4a, 4.4b, 4.5a, 4.5b,
                      5.16, 6.6, 6.7a, 6.7b, 6.8a, 6.8b, 6.17a, 6.17b, 8.8,
                      5.6a, 5.6b, 5.7a, 5.7b, 5.7c,
                      8.10a, 8.10b, 8.16, 8.17, 8.18, 9.20a, 9.20b, 10.12,
                      5.11a,5.11b,5.11c,5.16, 6.6,
                       10.34, 12.12 (optionally prefixed by RY, WS or MK)
                      6.7a, 6.7b, 6.8a, 6.8b, 6.17a,
      Non-reversible: STRSYM (strand symmetric model, equiv. WS6.6),
                      6.17b,8.8, 8.10a,8.10b, 8.16,
                      NONREV, UNREST (unrestricted model, equiv. 12.12)
                      8.17, 8.18, 9.20a,9.20b,10.12,
                      NQ.pfam, NQ.bird, NQ.mammal, NQ.insect, NQ.plant, NQ.yeast
                       10.34,12.12
           Otherwise: Name of file containing user-model parameters
      Non-reversible: STRSYM (strand symmetric model, synonymous with WS6.6)
      Non-reversible: UNREST (most general unrestricted model, functionally equivalent to 12.12)
      Models can have parameters appended in brackets.
          e.g. '-mRY3.4{0.2,-0.3}+I' specifies parameters for
          RY3.4 model but leaves proportion of invariant sites
          unspecified. '-mRY3.4{0.2,-0.3}+I{0.5} gives both.
           When this is done, the given parameters will be taken
          as fixed (default) or as start point for optimization
          (if -optfromgiven option supplied)
 
        Otherwise: Name of file containing user-model parameters
                  (rate parameters and state frequencies)


STATE FREQUENCY:
STATE FREQUENCY:
   Append one of the following +F... to -m <model_name>
   -m ...+F             Empirically counted frequencies from alignment
  +F                   Empirically counted frequencies from alignment
   -m ...+FO           Optimized frequencies by maximum-likelihood
   +FO (letter-O)      Optimized frequencies by maximum-likelihood
   -m ...+FQ           Equal frequencies
   +FQ                 Equal frequencies
   -m ...+FRY           For DNA, freq(A+G)=1/2=freq(C+T)
   +FRY, +FWS, +FMK    For DNA models only, +FRY is freq(A+G)=1/2=freq(C+T),
  -m ...+FWS           For DNA, freq(A+T)=1/2=freq(C+G)
                      +FWS is freq(A+T)=1/2=freq(C+G), +FMK is freq(A+C)=1/2=freq(G+T).
  -m ...+FMK           For DNA, freq(A+C)=1/2=freq(G+T)
   +F####              where # are digits - for DNA models only, for basis in ACGT order,
   -m ...+Fabcd        4-digit constraint on ACGT frequency
                       digits indicate which frequencies are constrained to be the same.
                       (e.g. +F1221 means f_A=f_T, f_C=f_G)
                      E.g. +F1221 means freq(A)=freq(T), freq(C)=freq(G).
   -m ...+FU           Amino-acid frequencies given protein matrix
   +FU                 Amino-acid frequencies by the given protein matrix
   -m ...+F1x4         Equal NT frequencies over three codon positions
   +F1x4 (codon model)  Equal NT frequencies over three codon positions
   -m ...+F3x4         Unequal NT frequencies over three codon positions
   +F3x4 (codon model)  Unequal NT frequencies over three codon positions
 
MIXTURE MODEL:
  -m "MIX{model1,...,modelK}"  Mixture model with K components
  -m "FMIX{freq1,...freqK}"    Frequency mixture model with K components
  -mwopt              Turn on optimizing mixture weights (default: none)


RATE HETEROGENEITY AMONG SITES:
RATE HETEROGENEITY AMONG SITES:
   -m modelname+I       A proportion of invariable sites
   -m ...+I             A proportion of invariable sites
   -m modelname+G[n]   Discrete Gamma model with n categories (default n=4)
   -m ...+G[n]         Discrete Gamma model with n categories (default n=4)
   -m modelname*G[n]   Discrete Gamma model with unlinked model parameters
   -m ...*G[n]         Discrete Gamma model with unlinked model parameters
   -m modelname+I+G[n] Invariable sites plus Gamma model with n categories
   -m ...+I+G[n]       Invariable sites plus Gamma model with n categories
   -m modelname+R[n]   FreeRate model with n categories (default n=4)
   -m ...+R[n]         FreeRate model with n categories (default n=4)
   -m modelname*R[n]   FreeRate model with unlinked model parameters
   -m ...*R[n]         FreeRate model with unlinked model parameters
   -m modelname+I+R[n] Invariable sites plus FreeRate model with n categories
   -m ...+I+R[n]       Invariable sites plus FreeRate model with n categories
   -m modelname+Hn     Heterotachy model with n classes
   -m ...+Hn           Heterotachy model with n classes
   -m modelname*Hn     Heterotachy model with n classes and unlinked parameters
   -m ...*Hn           Heterotachy model with n classes and unlinked parameters
   -a <Gamma_shape>    Gamma shape parameter for site rates (default: estimate)
   --alpha-min NUM      Min Gamma shape parameter for site rates (default: 0.02)
  -amin <min_shape>    Min Gamma shape parameter for site rates (default: 0.02)
   --gamma-median      Median approximation for +G site rates (default: mean)
   -gmedian            Median approximation for +G site rates (default: mean)
   --rate              Write empirical Bayesian site rates to .rate file
   --opt-gamma-inv      More thorough estimation for +I+G model parameters
   --mlrate            Write maximum likelihood site rates to .mlrate file
  -i <p_invar>        Proportion of invariable sites (default: estimate)
  -wsr                Write site rates to .rate file
   -mh                  Computing site-specific rates to .mhrate file using
                      Meyer & von Haeseler (2003) method


POLYMORPHISM AWARE MODELS (PoMo):
POLYMORPHISM AWARE MODELS (PoMo):
-s <counts_file>      Input counts file (see manual)
  -s FILE              Input counts file (see manual)
-m <MODEL>+P         DNA substitution model (see above) used with PoMo
  -m ...+P             DNA substitution model (see above) used with PoMo
  +N<POPSIZE>         Virtual population size (default: 9)
  -m ...+N<POPSIZE>   Virtual population size (default: 9)
  +[WB|WH|S]         Sampling method (default: +WB), WB: Weighted binomial,
  -m ...+WB|WH|S]     Weighted binomial sampling
                      WH: Weighted hypergeometric S: Sampled sampling
  -m ...+WH           Weighted hypergeometric sampling
  +G[n]               Discrete Gamma rate model with n categories (default n=4)
  -m ...+S             Sampled sampling
  -m ...+G[n]         Discrete Gamma rate with n categories (default n=4)
 
COMPLEX MODELS:
  -m "MIX{m1,...,mK}"  Mixture model with K components
  -m "FMIX{f1,...fK}"  Frequency mixture model with K components
  --mix-opt            Optimize mixture weights (default: detect)
  -m ...+ASC          Ascertainment bias correction
  --tree-freq FILE    Input tree to infer site frequency model
  --site-freq FILE    Input site frequency model file
  --freq-max          Posterior maximum instead of mean approximation


ASCERTAINMENT BIAS CORRECTION:
TREE TOPOLOGY TEST:
   -m modelname+ASC    Correction for absence of invariant sites in alignment
   --trees FILE        Set of trees to evaluate log-likelihoods
  --test NUM          Replicates for topology test
  --test-weight        Perform weighted KH and SH tests
  --test-au            Approximately unbiased (AU) test (Shimodaira 2002)
  --sitelh            Write site log-likelihoods to .sitelh file


SINGLE TOPOLOGY HETEROTACHY MODEL:
ANCESTRAL STATE RECONSTRUCTION:
-m <model_name>+H[k]  Heterotachy model mixed branch lengths with k classes
  --ancestral          Ancestral state reconstruction by empirical Bayes
-m "MIX{m1,...mK}+H"
  --asr-min NUM        Min probability of ancestral state (default: equil freq)
-nni-eval <m>        Loop m times for NNI evaluation (default m=1)


SITE-SPECIFIC FREQUENCY MODEL:
TEST OF SYMMETRY:
   -ft <tree_file>      Input tree to infer site frequency model
  --symtest              Perform three tests of symmetry
   -fs <in_freq_file>   Input site frequency model file
  --symtest-only          Do --symtest then exist
   -fmax                Posterior maximum instead of mean approximation
  --symtest-remove-bad    Do --symtest and remove bad partitions
   --symtest-remove-good  Do --symtest and remove good partitions
   --symtest-type MAR|INT  Use MARginal/INTernal test when removing partitions
   --symtest-pval NUMER    P-value cutoff (default: 0.05)
   --symtest-keep-zero    Keep NAs in the tests


CONSENSUS RECONSTRUCTION:
CONCORDANCE FACTOR ANALYSIS:
   -t <tree_file>      Set of input trees for consensus reconstruction
   -t FILE              Reference tree to assign concordance factor
   -minsup <threshold>  Min split support in range [0,1]; 0.5 for majority-rule
  --gcf FILE          Set of source trees for gene concordance factor (gCF)
                      consensus (default: 0, i.e. extended consensus)
   --df-tree            Write discordant trees associated with gDF1
   -bi <burnin>        Discarding <burnin> trees at beginning of <treefile>
  --scf NUM            Number of quartets for site concordance factor (sCF)
   -con                Computing consensus tree to .contree file
   --scfl NUM          Like --scf but using likelihood (recommended)
   -net                Computing consensus network to .nex file
   -s FILE              Sequence alignment for --scf
   -sup <target_tree>  Assigning support values for <target_tree> to .suptree
   -p FILE|DIR          Partition file or directory for --scf
   -suptag <name>      Node name (or ALL) to assign tree IDs where node occurs
   --cf-verbose        Write CF per tree/locus to cf.stat_tree/_loci
   --cf-quartet        Write sCF for all resampled quartets to .cf.quartet


ROBINSON-FOULDS DISTANCE:
ALISIM: ALIGNMENT SIMULATOR
  -rf_all              Computing all-to-all RF distances of trees in <treefile>
  -rf <treefile2>      Computing all RF distances between two sets of trees
                      stored in <treefile> and <treefile2>
  -rf_adj              Computing RF distances of adjacent trees in <treefile>


TREE TOPOLOGY TEST:
Usage: iqtree --alisim <OUTPUT_PREFIX> [-m MODEL] [-t TREE] ...
   -z <trees_file>     Evaluating a set of user trees
 
   -zb <#replicates>   Performing BP,KH,SH,ELW tests for trees passed via -z
  --alisim OUTPUT_ALIGNMENT Activate AliSim and specify the output alignment filename
   -zw                  Also performing weighted-KH and weighted-SH tests
  -t TREE_FILE              Set the input tree file name
   -au                  Also performing approximately unbiased (AU) test
  --length LENGTH          Set the length of the root sequence
  --num-alignments NUMBER  Set the number of output datasets
  --seqtype STRING          BIN, DNA, AA, CODON, MORPH{NUM_STATES} (default: auto-detect)
                            For morphological data, 0<NUM_STATES<=32
  --m MODEL_STRING          Specify the evolutionary model. See Manual for more detail
  --mdef FILE              Name of a NEXUS model file to define new models (see Manual)
  --fundi TAXA_LIST,RHO    Specify a list of taxa, and Rho (Fundi weight) for FunDi model
   --indel <INS>,<DEL>      Set the insertion and deletion rate of the indel model,
                            relative to the substitution rate
  --indel-size <INS_DIS>,<DEL_DIS> Set the insertion and deletion size distributions
  --sub-level-mixture      Enable the feature to simulate substitution-level mixture model
  --no-unaligned            Disable outputing a file of unaligned sequences
                            when using indel models
  --root-seq FILE,SEQ_NAME  Specify the root sequence from an alignment
  -s FILE                  Specify the input sequence alignment
  --no-copy-gaps            Disable copying gaps from input alignment (default: false)
  --site-freq <OPTION>      Specify the option (MEAN (default), or SAMPLING, or MODEL)
                            to mimic the site-frequencies for mixture models from
                            the input alignment (see Manual)
   --site-rate <OPTION>     Specify the option (MEAN (default), or SAMPLING, or MODEL)
                            to mimic the discrete rate heterogeneity from
                            the input alignment (see Manual)
  -t RANDOM{MODEL,NUM_TAXA} Specify the model and the number of taxa to generate a random tree
  -rlen MIN MEAN MAX        Specify three numbers: minimum, mean and maximum branch lengths
                            when generating a random tree
  -p FILE                  NEXUS/RAxML partition file
                            Edge-linked proportional partition model
  -q FILE                  Like -p but edge-linked equal partition model
  -Q FILE                  Like -p but edge-unlinked partition model
  --distribution FILE      Supply a definition file of distributions,
                            which could be used to generate random model parameters
  --branch-distribution DIS Specify a distribution, from which branch lengths of the input trees
                            are randomly generated and overridden.
  --branch-scale SCALE      Specify a value to scale all branch lengths
  --single-output          Output all alignments into a single file
  --write-all              Enable outputting internal sequences
  --seed NUM                Random seed number (default: CPU clock)
                            Be careful to make the AliSim reproducible,
                            users should specify the seed number
  -gz                      Enable output compression but taking longer running time
   -af phy|fasta            Set the output format (default: phylip)
  User Manual is available at http://www.iqtree.org/doc/alisim
 
ANALYSIS WITH GENTRIUS ALGORITHM:
  --gentrius FILE      File must contain either a single species-tree or a set of subtrees.
  -pr_ab_matrix FILE  Presence-absence matrix of loci coverage.
  -s FILE              PHYLIP/FASTA/NEXUS/CLUSTAL/MSF alignment file(s)
  -p FILE              NEXUS/RAxML partition file
  -g_stop_t NUM        Stop after NUM species-trees were generated, or use 0 to turn off this stopping rule. Default: 1MLN trees.
  -g_stop_i NUM        Stop after NUM intermediate trees were visited, or use 0 to turn off this stopping rule. Default: 10MLN trees.
   -g_stop_h NUM        Stop after NUM hours (CPU time), or use 0 to turn off this stopping rule. Default: 7 days.
  -g_non_stop          Turn off all stopping rules.
  -g_query FILE        Species-trees to test for identical set of subtrees.
  -g_print            Write all generated species-trees. WARNING: there might be millions of trees!
  -g_print_lim NUM    Limit on the number of species-trees to be written.
  -g_print_induced    Write induced partition subtrees.
  -g_print_m          Write presence-absence matrix.
  -g_rm_leaves NUM    Invoke reverse analysis for complex datasets.
 
GENOMIC EPIDEMIOLOGICAL ANALYSIS:
  --pathogen          Apply CMAPLE tree search algorithm if sequence
                      divergence is low, otherwise, apply IQ-TREE algorithm.
  --pathogen-force    Apply CMAPLE tree search algorithm regardless
                      of sequence divergence.


ANCESTRAL STATE RECONSTRUCTION:
TIME TREE RECONSTRUCTION:
   -asr                Ancestral state reconstruction by empirical Bayes
   --date FILE          File containing dates of tips or ancestral nodes
   -asr-min <prob>      Min probability of ancestral state (default: equil freq)
  --date TAXNAME      Extract dates from taxon names after last '|'
  --date-tip STRING    Tip dates as a real number or YYYY-MM-DD
  --date-root STRING  Root date as a real number or YYYY-MM-DD
   --date-ci NUM        Number of replicates to compute confidence interval
  --clock-sd NUM      Std-dev for lognormal relaxed clock (default: 0.2)
  --date-no-outgroup  Exclude outgroup from time tree
  --date-outlier NUM  Z-score cutoff to remove outlier tips/nodes (e.g. 3)
  --date-options ".."  Extra options passing directly to LSD2
  --dating STRING      Dating method: LSD for least square dating (default)


GENERATING RANDOM TREES:
  -r <num_taxa>        Create a random tree under Yule-Harding model
  -ru <num_taxa>      Create a random tree under Uniform model
  -rcat <num_taxa>    Create a random caterpillar tree
  -rbal <num_taxa>    Create a random balanced tree
  -rcsg <num_taxa>    Create a random circular split network
  -rlen <min_len> <mean_len> <max_len> 
                      min, mean, and max branch lengths of random trees


MISCELLANEOUS:
  -wt                  Write locally optimal trees into .treels file
  -blfix              Fix branch lengths of user tree passed via -te
  -blscale            Scale branch lengths of user tree passed via -t
  -blmin              Min branch length for optimization (default 0.000001)
  -blmax              Max branch length for optimization (default 100)
  -wsr                Write site rates and categories to .rate file
  -wsl                Write site log-likelihoods to .sitelh file
  -wslr                Write site log-likelihoods per rate category
  -wslm                Write site log-likelihoods per mixture class
  -wslmr              Write site log-likelihoods per mixture+rate class
  -wspr                Write site probabilities per rate category
  -wspm                Write site probabilities per mixture class
  -wspmr              Write site probabilities per mixture+rate class
  -wpl                Write partition log-likelihoods to .partlh file
  -fconst f1,...,fN    Add constant patterns into alignment (N=#nstates)
  -me <epsilon>        LogL epsilon for parameter estimation (default 0.01)
  --no-outfiles        Suppress printing output files
  --eigenlib          Use Eigen3 library
  -alninfo            Print alignment sites statistics to .alninfo
  -czb                Collapse zero branches in final tree
  --show-lh            Compute tree likelihood without optimisation
</pre>
</pre>
[[#top|Back to Top]]
[[#top|Back to Top]]

Latest revision as of 12:35, 2 September 2025


Category

Bioinformatics

Program On

Sapelo2

Version

2.2.2.7, 2.3.5, 2.3.6

Author / Distributor

Please see http://www.iqtree.org/

Description

Efficient phylogenomic software by maximum likelihood. More information: http://www.iqtree.org/

Running Program

  • Version 2.2.2.7, installed in /apps/eb/IQ-TREE/2.2.2.7-gompi-2023a

To use this version of IQ-TREE, please first load the module with

module load IQ-TREE/2.2.2.7-gompi-2023a
  • Version 2.3.5, installed in /apps/eb/IQ-TREE/2.3.5-gompi-2023a

To use this version of IQ-TREE, please first load the module with

module load IQ-TREE/2.3.5-gompi-2023a
  • Version 2.3.6, installed in /apps/eb/IQ-TREE/2.3.6-gompi-2023a

To use this version of IQ-TREE, please first load the module with

module load IQ-TREE/2.3.6-gompi-2023a

Please note: The name of the executable for IQ-TREE version 2+ is iqtree2 and iqtree2-mpi (MPI version).


Sample job submission script (sub.sh) to run IQ-TREE version 2.3.6:

#!/bin/bash
#SBATCH --job-name=jobName
#SBATCH --partition=batch
#SBATCH --mail-type=ALL
#SBATCH --mail-user=username@uga.edu
#SBATCH --ntasks=1
#SBATCH --cpus-per-task=4
#SBATCH --mem=2gb
#SBATCH --time=04:00:00
#SBATCH --output=IQTREE.%j.out
#SBATCH --error=IQTREE.%j.err

cd $SLURM_SUBMIT_DIR

module load IQ-TREE/2.3.6-gompi-2023a
iqtree2 [options]

In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values. If you request multiple cores per tasks in order to have iqtree2 run with multiple threads, then please use the -T option to specify how many threads iqtree2 should use. We recommend setting the number of threads to match the number of cores requested, e.g. -T 4 in this example.


Please refer to Running Jobs on Sapelo2 for more information on running jobs on the Sapelo2 cluster.

Here is an example of job submission command:

sbatch ./sub.sh 

Documentation

[shtsai@b1-24 ~]$ ml IQ-TREE/2.3.6-gompi-2023a
[shtsai@b1-24 ~]$ iqtree2 -h
IQ-TREE multicore version 2.3.6 for Linux x86 64-bit built May  5 2025
Developed by Bui Quang Minh, Nguyen Lam Tung, Olga Chernomor, Heiko Schmidt,
Dominik Schrempf, Michael Woodhams, Ly Trong Nhan, Thomas Wong

Usage: iqtree [-s ALIGNMENT] [-p PARTITION] [-m MODEL] [-t TREE] ...

GENERAL OPTIONS:
  -h, --help           Print (more) help usages
  -s FILE[,...,FILE]   PHYLIP/FASTA/NEXUS/CLUSTAL/MSF alignment file(s)
  -s DIR               Directory of alignment files
  --seqtype STRING     BIN, DNA, AA, NT2AA, CODON, MORPH (default: auto-detect)
  -t FILE|PARS|RAND    Starting tree (default: 99 parsimony and BIONJ)
  -o TAX[,...,TAX]     Outgroup taxon (list) for writing .treefile
  --prefix STRING      Prefix for all output files (default: aln/partition)
  --seed NUM           Random seed number, normally used for debugging purpose
  --safe               Safe likelihood kernel to avoid numerical underflow
  --mem NUM[G|M|%]     Maximal RAM usage in GB | MB | %
  --runs NUM           Number of indepedent runs (default: 1)
  -v, --verbose        Verbose mode, printing more messages to screen
  -V, --version        Display version number
  --quiet              Quiet mode, suppress printing to screen (stdout)
  -fconst f1,...,fN    Add constant patterns into alignment (N=no. states)
  --epsilon NUM        Likelihood epsilon for parameter estimate (default 0.01)
  -T NUM|AUTO          No. cores/threads or AUTO-detect (default: 1)
  --threads-max NUM    Max number of threads for -T AUTO (default: all cores)

CHECKPOINT:
  --redo               Redo both ModelFinder and tree search
  --redo-tree          Restore ModelFinder and only redo tree search
  --undo               Revoke finished run, used when changing some options
  --cptime NUM         Minimum checkpoint interval (default: 60 sec and adapt)

PARTITION MODEL:
  -p FILE|DIR          NEXUS/RAxML partition file or directory with alignments
                       Edge-linked proportional partition model
  -q FILE|DIR          Like -p but edge-linked equal partition model 
  -Q FILE|DIR          Like -p but edge-unlinked partition model
  -S FILE|DIR          Like -p but separate tree inference
  --subsample NUM      Randomly sub-sample partitions (negative for complement)
  --subsample-seed NUM Random number seed for --subsample

LIKELIHOOD/QUARTET MAPPING:
  --lmap NUM           Number of quartets for likelihood mapping analysis
  --lmclust FILE       NEXUS file containing clusters for likelihood mapping
  --quartetlh          Print quartet log-likelihoods to .quartetlh file

TREE SEARCH ALGORITHM:
  --ninit NUM          Number of initial parsimony trees (default: 100)
  --ntop NUM           Number of top initial trees (default: 20)
  --nbest NUM          Number of best trees retained during search (defaut: 5)
  -n NUM               Fix number of iterations to stop (default: OFF)
  --nstop NUM          Number of unsuccessful iterations to stop (default: 100)
  --perturb NUM        Perturbation strength for randomized NNI (default: 0.5)
  --radius NUM         Radius for parsimony SPR search (default: 6)
  --allnni             Perform more thorough NNI search (default: OFF)
  -g FILE              (Multifurcating) topological constraint tree file
  --fast               Fast search to resemble FastTree
  --polytomy           Collapse near-zero branches into polytomy
  --tree-fix           Fix -t tree (no tree search performed)
  --treels             Write locally optimal trees into .treels file
  --show-lh            Compute tree likelihood without optimisation
  --terrace            Check if the tree lies on a phylogenetic terrace

ULTRAFAST BOOTSTRAP/JACKKNIFE:
  -B, --ufboot NUM     Replicates for ultrafast bootstrap (>=1000)
  -J, --ufjack NUM     Replicates for ultrafast jackknife (>=1000)
  --jack-prop NUM      Subsampling proportion for jackknife (default: 0.5)
  --sampling STRING    GENE|GENESITE resampling for partitions (default: SITE)
  --boot-trees         Write bootstrap trees to .ufboot file (default: none)
  --wbtl               Like --boot-trees but also writing branch lengths
  --nmax NUM           Maximum number of iterations (default: 1000)
  --nstep NUM          Iterations for UFBoot stopping rule (default: 100)
  --bcor NUM           Minimum correlation coefficient (default: 0.99)
  --beps NUM           RELL epsilon to break tie (default: 0.5)
  --bnni               Optimize UFBoot trees by NNI on bootstrap alignment

NON-PARAMETRIC BOOTSTRAP/JACKKNIFE:
  -b, --boot NUM       Replicates for bootstrap + ML tree + consensus tree
  -j, --jack NUM       Replicates for jackknife + ML tree + consensus tree
  --jack-prop NUM      Subsampling proportion for jackknife (default: 0.5)
  --bcon NUM           Replicates for bootstrap + consensus tree
  --bonly NUM          Replicates for bootstrap only
  --tbe                Transfer bootstrap expectation

SINGLE BRANCH TEST:
  --alrt NUM           Replicates for SH approximate likelihood ratio test
  --alrt 0             Parametric aLRT test (Anisimova and Gascuel 2006)
  --abayes             approximate Bayes test (Anisimova et al. 2011)
  --lbp NUM            Replicates for fast local bootstrap probabilities

MODEL-FINDER:
  -m TESTONLY          Standard model selection (like jModelTest, ProtTest)
  -m TEST              Standard model selection followed by tree inference
  -m MF                Extended model selection with FreeRate heterogeneity
  -m MFP               Extended model selection followed by tree inference
  -m ...+LM            Additionally test Lie Markov models
  -m ...+LMRY          Additionally test Lie Markov models with RY symmetry
  -m ...+LMWS          Additionally test Lie Markov models with WS symmetry
  -m ...+LMMK          Additionally test Lie Markov models with MK symmetry
  -m ...+LMSS          Additionally test strand-symmetric models
  --mset STRING        Restrict search to models supported by other programs
                       (raxml, phyml, mrbayes, beast1 or beast2)
  --mset STR,...       Comma-separated model list (e.g. -mset WAG,LG,JTT)
  --msub STRING        Amino-acid model source
                       (nuclear, mitochondrial, chloroplast or viral)
  --mfreq STR,...      List of state frequencies
  --mrate STR,...      List of rate heterogeneity among sites
                       (e.g. -mrate E,I,G,I+G,R is used for -m MF)
  --cmin NUM           Min categories for FreeRate model [+R] (default: 2)
  --cmax NUM           Max categories for FreeRate model [+R] (default: 10)
  --merit AIC|AICc|BIC  Akaike|Bayesian information criterion (default: BIC)
  --mtree              Perform full tree search for every model
  --madd STR,...       List of mixture models to consider
  --mdef FILE          Model definition NEXUS file (see Manual)
  --modelomatic        Find best codon/protein/DNA models (Whelan et al. 2015)

PARTITION-FINDER:
  --merge              Merge partitions to increase model fit
  --merge greedy|rcluster|rclusterf
                       Set merging algorithm (default: rclusterf)
  --merge-model 1|all  Use only 1 or all models for merging (default: 1)
  --merge-model STR,...
                       Comma-separated model list for merging
  --merge-rate 1|all   Use only 1 or all rate heterogeneity (default: 1)
  --merge-rate STR,...
                       Comma-separated rate list for merging
  --rcluster NUM       Percentage of partition pairs for rcluster algorithm
  --rclusterf NUM      Percentage of partition pairs for rclusterf algorithm
  --rcluster-max NUM   Max number of partition pairs (default: 10*partitions)

SUBSTITUTION MODEL:
  -m STRING            Model name string (e.g. GTR+F+I+G)
                 DNA:  HKY (default), JC, F81, K2P, K3P, K81uf, TN/TrN, TNef,
                       TIM, TIMef, TVM, TVMef, SYM, GTR, or 6-digit model
                       specification (e.g., 010010 = HKY)
             Protein:  LG (default), Poisson, cpREV, mtREV, Dayhoff, mtMAM,
                       JTT, WAG, mtART, mtZOA, VT, rtREV, DCMut, PMB, HIVb,
                       HIVw, JTTDCMut, FLU, Blosum62, GTR20, mtMet, mtVer, mtInv, FLAVI,
			Q.LG, Q.pfam, Q.pfam_gb, Q.bird, Q.mammal, Q.insect, Q.plant, Q.yeast
     Protein mixture:  C10,...,C60, EX2, EX3, EHO, UL2, UL3, EX_EHO, LG4M, LG4X
              Binary:  JC2 (default), GTR2
     Empirical codon:  KOSI07, SCHN05
   Mechanistic codon:  GY (default), MG, MGK, GY0K, GY1KTS, GY1KTV, GY2K,
                       MG1KTS, MG1KTV, MG2K
Semi-empirical codon:  XX_YY where XX is empirical and YY is mechanistic model
      Morphology/SNP:  MK (default), ORDERED, GTR
      Lie Markov DNA:  1.1, 2.2b, 3.3a, 3.3b, 3.3c, 3.4, 4.4a, 4.4b, 4.5a,
                       4.5b, 5.6a, 5.6b, 5.7a, 5.7b, 5.7c, 5.11a, 5.11b, 5.11c,
                       5.16, 6.6, 6.7a, 6.7b, 6.8a, 6.8b, 6.17a, 6.17b, 8.8,
                       8.10a, 8.10b, 8.16, 8.17, 8.18, 9.20a, 9.20b, 10.12,
                       10.34, 12.12 (optionally prefixed by RY, WS or MK)
      Non-reversible:  STRSYM (strand symmetric model, equiv. WS6.6),
                       NONREV, UNREST (unrestricted model, equiv. 12.12)
                       NQ.pfam, NQ.bird, NQ.mammal, NQ.insect, NQ.plant, NQ.yeast
           Otherwise:  Name of file containing user-model parameters

STATE FREQUENCY:
  -m ...+F             Empirically counted frequencies from alignment
  -m ...+FO            Optimized frequencies by maximum-likelihood
  -m ...+FQ            Equal frequencies
  -m ...+FRY           For DNA, freq(A+G)=1/2=freq(C+T)
  -m ...+FWS           For DNA, freq(A+T)=1/2=freq(C+G)
  -m ...+FMK           For DNA, freq(A+C)=1/2=freq(G+T)
  -m ...+Fabcd         4-digit constraint on ACGT frequency
                       (e.g. +F1221 means f_A=f_T, f_C=f_G)
  -m ...+FU            Amino-acid frequencies given protein matrix
  -m ...+F1x4          Equal NT frequencies over three codon positions
  -m ...+F3x4          Unequal NT frequencies over three codon positions

RATE HETEROGENEITY AMONG SITES:
  -m ...+I             A proportion of invariable sites
  -m ...+G[n]          Discrete Gamma model with n categories (default n=4)
  -m ...*G[n]          Discrete Gamma model with unlinked model parameters
  -m ...+I+G[n]        Invariable sites plus Gamma model with n categories
  -m ...+R[n]          FreeRate model with n categories (default n=4)
  -m ...*R[n]          FreeRate model with unlinked model parameters
  -m ...+I+R[n]        Invariable sites plus FreeRate model with n categories
  -m ...+Hn            Heterotachy model with n classes
  -m ...*Hn            Heterotachy model with n classes and unlinked parameters
  --alpha-min NUM      Min Gamma shape parameter for site rates (default: 0.02)
  --gamma-median       Median approximation for +G site rates (default: mean)
  --rate               Write empirical Bayesian site rates to .rate file
  --mlrate             Write maximum likelihood site rates to .mlrate file

POLYMORPHISM AWARE MODELS (PoMo):
  -s FILE              Input counts file (see manual)
  -m ...+P             DNA substitution model (see above) used with PoMo
  -m ...+N<POPSIZE>    Virtual population size (default: 9)
  -m ...+WB|WH|S]      Weighted binomial sampling
  -m ...+WH            Weighted hypergeometric sampling
  -m ...+S             Sampled sampling
  -m ...+G[n]          Discrete Gamma rate with n categories (default n=4)

COMPLEX MODELS:
  -m "MIX{m1,...,mK}"  Mixture model with K components
  -m "FMIX{f1,...fK}"  Frequency mixture model with K components
  --mix-opt            Optimize mixture weights (default: detect)
  -m ...+ASC           Ascertainment bias correction
  --tree-freq FILE     Input tree to infer site frequency model
  --site-freq FILE     Input site frequency model file
  --freq-max           Posterior maximum instead of mean approximation

TREE TOPOLOGY TEST:
  --trees FILE         Set of trees to evaluate log-likelihoods
  --test NUM           Replicates for topology test
  --test-weight        Perform weighted KH and SH tests
  --test-au            Approximately unbiased (AU) test (Shimodaira 2002)
  --sitelh             Write site log-likelihoods to .sitelh file

ANCESTRAL STATE RECONSTRUCTION:
  --ancestral          Ancestral state reconstruction by empirical Bayes
  --asr-min NUM        Min probability of ancestral state (default: equil freq)

TEST OF SYMMETRY:
  --symtest               Perform three tests of symmetry
  --symtest-only          Do --symtest then exist
  --symtest-remove-bad    Do --symtest and remove bad partitions
  --symtest-remove-good   Do --symtest and remove good partitions
  --symtest-type MAR|INT  Use MARginal/INTernal test when removing partitions
  --symtest-pval NUMER    P-value cutoff (default: 0.05)
  --symtest-keep-zero     Keep NAs in the tests

CONCORDANCE FACTOR ANALYSIS:
  -t FILE              Reference tree to assign concordance factor
  --gcf FILE           Set of source trees for gene concordance factor (gCF)
  --df-tree            Write discordant trees associated with gDF1
  --scf NUM            Number of quartets for site concordance factor (sCF)
  --scfl NUM           Like --scf but using likelihood (recommended)
  -s FILE              Sequence alignment for --scf
  -p FILE|DIR          Partition file or directory for --scf
  --cf-verbose         Write CF per tree/locus to cf.stat_tree/_loci
  --cf-quartet         Write sCF for all resampled quartets to .cf.quartet

ALISIM: ALIGNMENT SIMULATOR

Usage: iqtree --alisim <OUTPUT_PREFIX> [-m MODEL] [-t TREE] ...

  --alisim OUTPUT_ALIGNMENT Activate AliSim and specify the output alignment filename
  -t TREE_FILE              Set the input tree file name
  --length LENGTH           Set the length of the root sequence
  --num-alignments NUMBER   Set the number of output datasets
  --seqtype STRING          BIN, DNA, AA, CODON, MORPH{NUM_STATES} (default: auto-detect)
                            For morphological data, 0<NUM_STATES<=32
  --m MODEL_STRING          Specify the evolutionary model. See Manual for more detail
  --mdef FILE               Name of a NEXUS model file to define new models (see Manual)
  --fundi TAXA_LIST,RHO     Specify a list of taxa, and Rho (Fundi weight) for FunDi model
  --indel <INS>,<DEL>       Set the insertion and deletion rate of the indel model,
                            relative to the substitution rate
  --indel-size <INS_DIS>,<DEL_DIS> Set the insertion and deletion size distributions
  --sub-level-mixture       Enable the feature to simulate substitution-level mixture model
  --no-unaligned            Disable outputing a file of unaligned sequences 
                            when using indel models
  --root-seq FILE,SEQ_NAME  Specify the root sequence from an alignment
  -s FILE                   Specify the input sequence alignment
  --no-copy-gaps            Disable copying gaps from input alignment (default: false)
  --site-freq <OPTION>      Specify the option (MEAN (default), or SAMPLING, or MODEL)
                            to mimic the site-frequencies for mixture models from
                            the input alignment (see Manual)
  --site-rate <OPTION>      Specify the option (MEAN (default), or SAMPLING, or MODEL)
                            to mimic the discrete rate heterogeneity from
                            the input alignment (see Manual)
  -t RANDOM{MODEL,NUM_TAXA} Specify the model and the number of taxa to generate a random tree
  -rlen MIN MEAN MAX        Specify three numbers: minimum, mean and maximum branch lengths
                            when generating a random tree
  -p FILE                   NEXUS/RAxML partition file
                            Edge-linked proportional partition model
  -q FILE                   Like -p but edge-linked equal partition model 
  -Q FILE                   Like -p but edge-unlinked partition model
  --distribution FILE       Supply a definition file of distributions,
                            which could be used to generate random model parameters
  --branch-distribution DIS Specify a distribution, from which branch lengths of the input trees
                            are randomly generated and overridden.
  --branch-scale SCALE      Specify a value to scale all branch lengths
  --single-output           Output all alignments into a single file
  --write-all               Enable outputting internal sequences
  --seed NUM                Random seed number (default: CPU clock)
                            Be careful to make the AliSim reproducible,
                            users should specify the seed number
  -gz                       Enable output compression but taking longer running time
  -af phy|fasta             Set the output format (default: phylip)
  User Manual is available at http://www.iqtree.org/doc/alisim

ANALYSIS WITH GENTRIUS ALGORITHM:
  --gentrius FILE      File must contain either a single species-tree or a set of subtrees.
  -pr_ab_matrix FILE   Presence-absence matrix of loci coverage.
  -s FILE              PHYLIP/FASTA/NEXUS/CLUSTAL/MSF alignment file(s)
  -p FILE              NEXUS/RAxML partition file
  -g_stop_t NUM        Stop after NUM species-trees were generated, or use 0 to turn off this stopping rule. Default: 1MLN trees.
  -g_stop_i NUM        Stop after NUM intermediate trees were visited, or use 0 to turn off this stopping rule. Default: 10MLN trees.
  -g_stop_h NUM        Stop after NUM hours (CPU time), or use 0 to turn off this stopping rule. Default: 7 days.
  -g_non_stop          Turn off all stopping rules.
  -g_query FILE        Species-trees to test for identical set of subtrees.
  -g_print             Write all generated species-trees. WARNING: there might be millions of trees!
  -g_print_lim NUM     Limit on the number of species-trees to be written.
  -g_print_induced     Write induced partition subtrees.
  -g_print_m           Write presence-absence matrix.
  -g_rm_leaves NUM     Invoke reverse analysis for complex datasets.

GENOMIC EPIDEMIOLOGICAL ANALYSIS:
  --pathogen           Apply CMAPLE tree search algorithm if sequence
                       divergence is low, otherwise, apply IQ-TREE algorithm.
  --pathogen-force     Apply CMAPLE tree search algorithm regardless
                       of sequence divergence.

TIME TREE RECONSTRUCTION:
  --date FILE          File containing dates of tips or ancestral nodes
  --date TAXNAME       Extract dates from taxon names after last '|'
  --date-tip STRING    Tip dates as a real number or YYYY-MM-DD
  --date-root STRING   Root date as a real number or YYYY-MM-DD
  --date-ci NUM        Number of replicates to compute confidence interval
  --clock-sd NUM       Std-dev for lognormal relaxed clock (default: 0.2)
  --date-no-outgroup   Exclude outgroup from time tree
  --date-outlier NUM   Z-score cutoff to remove outlier tips/nodes (e.g. 3)
  --date-options ".."  Extra options passing directly to LSD2
  --dating STRING      Dating method: LSD for least square dating (default)


Back to Top

Installation

IQ-Tree


System

64-bit Linux