OrthoFinder-Teaching: Difference between revisions
(Created page with "Category:TeachingCategory:SoftwareCategory:Bioinformatics === Category === Bioinformatics === Program On === Teaching === Version === 2.1.2 === A...") |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 23: | Line 23: | ||
The last version of this application is at /usr/local/apps/gb/orthofinder/2.1.2 | The last version of this application is at /usr/local/apps/gb/orthofinder/2.1.2 | ||
To use this version, please | To use this version, please load the module with | ||
<pre class="gscript"> | <pre class="gscript"> | ||
ml orthofinder/2.1.2 | ml orthofinder/2.1.2 | ||
</pre> | </pre> | ||
Here is an example of a shell script, sub.sh, to run on | Here is an example of a shell script, sub.sh, to run on the batch queue: | ||
<div class="gscript2"> | <div class="gscript2"> | ||
Line 40: | Line 40: | ||
<nowiki>#</nowiki>SBATCH --time=<u>08:00:00</u><br> | <nowiki>#</nowiki>SBATCH --time=<u>08:00:00</u><br> | ||
<nowiki>#</nowiki>SBATCH --output=OrthoFinder.%j.out<br> | <nowiki>#</nowiki>SBATCH --output=OrthoFinder.%j.out<br> | ||
<nowiki>#</nowiki>SBATCH --error=OrthoFinder.%j.err<br> | |||
cd $SLURM_SUBMIT_DIR<br> | cd $SLURM_SUBMIT_DIR<br> | ||
ml orthofinder/2.1.2<br> | ml orthofinder/2.1.2<br> | ||
orthofinder.py | orthofinder.py <u>[options]</u><br> | ||
</div> | </div> | ||
In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values. | In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values. | ||
Line 59: | Line 60: | ||
<pre class="gcommand"> | <pre class="gcommand"> | ||
ml orthofinder/2.1.2 | ml orthofinder/2.1.2 | ||
orthofinder.py | orthofinder.py -h | ||
[ | OrthoFinder version 2.1.2 Copyright (C) 2014 David Emms | ||
SIMPLE USAGE: | |||
Run full OrthoFinder analysis on FASTA format proteomes in <dir> | |||
orthofinder [options] -f <dir> | |||
Add new species in <dir1> to previous run in <dir2> and run new analysis | |||
orthofinder [options] -f <dir1> -b <dir2> | |||
OPTIONS: | |||
-t <int> Number of parallel sequence search threads [Default = 16] | |||
-a <int> Number of parallel analysis threads [Default = 1] | |||
-M <txt> Method for gene tree inference. Options 'dendroblast' & 'msa' | |||
[Default = dendroblast] | |||
-S <txt> Sequence search program [Default = blast] | |||
Options: blast, blast_gz, diamond | |||
-A <txt> MSA program, requires '-M msa' [Default = mafft] | |||
Options: mafft, muscle, mafft | |||
-T <txt> Tree inference method, requires '-M msa' [Default = fasttree] | |||
Options: mafft, iqtree, fasttree, raxml | |||
-R <txt> Tree reconciliation method [Default = of_recon] | |||
Options: of_recon, dlcpar, dlcpar_deepsearch | |||
-s <file> User-specified rooted species tree | |||
-I <int> MCL inflation parameter [Default = 1.5] | |||
-x <file> Info for outputting results in OrthoXML format | |||
-p <dir> Write the temporary pickle files to <dir> | |||
-1 Only perform one-way sequence search | |||
-n <txt> Name to append to the results directory | |||
-h Print this help text | |||
WORKFLOW STOPPING OPTIONS: | |||
-op Stop after preparing input files for BLAST | |||
-og Stop after inferring orthogroups | |||
-os Stop after writing sequence files for orthogroups | |||
(requires '-M msa') | |||
-oa Stop after inferring alignments for orthogroups | |||
(requires '-M msa') | |||
-ot Stop after inferring gene trees for orthogroups | |||
WORKFLOW RESTART COMMANDS: | |||
-b <dir> Start OrthoFinder from pre-computed BLAST results in <dir> | |||
-fg <dir> Start OrthoFinder from pre-computed orthogroups in <dir> | |||
-ft <dir> Start OrthoFinder from pre-computed gene trees in <dir> | |||
LICENSE: | |||
Distributed under the GNU General Public License (GPLv3). See License.md | |||
CITATION: | |||
When publishing work that uses OrthoFinder please cite: | |||
Emms D.M. & Kelly S. (2015), Genome Biology 16:157 | |||
</pre> | </pre> | ||
[[#top|Back to Top]] | [[#top|Back to Top]] |
Latest revision as of 11:21, 15 August 2018
Category
Bioinformatics
Program On
Teaching
Version
2.1.2
Author / Distributor
Description
"OrthoFinder is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers rooted gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses. OrthoFinder is simple to use and all you need to run it is a set of protein sequence files (one per species) in FASTA format." More details are at OrthoFinder
Running Program
The last version of this application is at /usr/local/apps/gb/orthofinder/2.1.2
To use this version, please load the module with
ml orthofinder/2.1.2
Here is an example of a shell script, sub.sh, to run on the batch queue:
#!/bin/bash
#SBATCH --job-name=j_OrthoFinder
#SBATCH --partition=batch
#SBATCH --mail-type=ALL
#SBATCH --mail-user=username@uga.edu
#SBATCH --ntasks=1
#SBATCH --mem=10gb
#SBATCH --time=08:00:00
#SBATCH --output=OrthoFinder.%j.out
#SBATCH --error=OrthoFinder.%j.err
cd $SLURM_SUBMIT_DIR
ml orthofinder/2.1.2
orthofinder.py [options]
In the real submission script, at least all the above underlined values need to be reviewed or to be replaced by the proper values.
Please refer to Running_Jobs_on_the_teaching_cluster, Run X window Jobs and Run interactive Jobs for more details of running jobs at Teaching cluster.
Here is an example of job submission command:
sbatch ./sub.sh
Documentation
ml orthofinder/2.1.2 orthofinder.py -h OrthoFinder version 2.1.2 Copyright (C) 2014 David Emms SIMPLE USAGE: Run full OrthoFinder analysis on FASTA format proteomes in <dir> orthofinder [options] -f <dir> Add new species in <dir1> to previous run in <dir2> and run new analysis orthofinder [options] -f <dir1> -b <dir2> OPTIONS: -t <int> Number of parallel sequence search threads [Default = 16] -a <int> Number of parallel analysis threads [Default = 1] -M <txt> Method for gene tree inference. Options 'dendroblast' & 'msa' [Default = dendroblast] -S <txt> Sequence search program [Default = blast] Options: blast, blast_gz, diamond -A <txt> MSA program, requires '-M msa' [Default = mafft] Options: mafft, muscle, mafft -T <txt> Tree inference method, requires '-M msa' [Default = fasttree] Options: mafft, iqtree, fasttree, raxml -R <txt> Tree reconciliation method [Default = of_recon] Options: of_recon, dlcpar, dlcpar_deepsearch -s <file> User-specified rooted species tree -I <int> MCL inflation parameter [Default = 1.5] -x <file> Info for outputting results in OrthoXML format -p <dir> Write the temporary pickle files to <dir> -1 Only perform one-way sequence search -n <txt> Name to append to the results directory -h Print this help text WORKFLOW STOPPING OPTIONS: -op Stop after preparing input files for BLAST -og Stop after inferring orthogroups -os Stop after writing sequence files for orthogroups (requires '-M msa') -oa Stop after inferring alignments for orthogroups (requires '-M msa') -ot Stop after inferring gene trees for orthogroups WORKFLOW RESTART COMMANDS: -b <dir> Start OrthoFinder from pre-computed BLAST results in <dir> -fg <dir> Start OrthoFinder from pre-computed orthogroups in <dir> -ft <dir> Start OrthoFinder from pre-computed gene trees in <dir> LICENSE: Distributed under the GNU General Public License (GPLv3). See License.md CITATION: When publishing work that uses OrthoFinder please cite: Emms D.M. & Kelly S. (2015), Genome Biology 16:157
Installation
Source code is obtained from OrthoFinder
System
64-bit Linux