OrthoFinder-Sapelo2: Difference between revisions
No edit summary |
|||
(3 intermediate revisions by the same user not shown) | |||
Line 9: | Line 9: | ||
=== Version === | === Version === | ||
2.5.4 | |||
=== Author / Distributor === | === Author / Distributor === | ||
Line 23: | Line 23: | ||
Please refer to [[Running Jobs on Sapelo2]] | Please refer to [[Running Jobs on Sapelo2]] | ||
*Version 2. | *Version 2.5.4, installed at /apps/eb/OrthoFinder/2.5.4-foss-2022a/ | ||
To use the version 2.5.4, please first load the module with | |||
To use the version 2. | |||
<pre class="gscript"> | <pre class="gscript"> | ||
ml OrthoFinder/2. | ml OrthoFinder/2.5.4-foss-2022a | ||
</pre> | </pre> | ||
Here is an example of a shell script sub.sh to run orthofinder of version 2.5.4 at the batch queue: | |||
Here is an example of a shell script sub.sh to run orthofinder | |||
<pre class="gscript"> | <pre class="gscript"> | ||
#!/bin/bash | #!/bin/bash | ||
Line 53: | Line 38: | ||
#SBATCH --partition=batch | #SBATCH --partition=batch | ||
#SBATCH --ntasks=1 | #SBATCH --ntasks=1 | ||
#SBATCH --cpus-per-task= | #SBATCH --cpus-per-task=8 | ||
#SBATCH --mem= | #SBATCH --mem=32gb | ||
#SBATCH --time=120:00:00 | #SBATCH --time=120:00:00 | ||
#SBATCH --output=log.%j.out | #SBATCH --output=log.%j.out | ||
Line 63: | Line 48: | ||
cd $SLURM_SUBMIT_DIR | cd $SLURM_SUBMIT_DIR | ||
ml OrthoFinder/2.5. | ml OrthoFinder/2.5.4-foss-2022a | ||
orthofinder | orthofinder -t 8 -a 8 [options] | ||
</pre> | </pre> | ||
Line 76: | Line 61: | ||
=== Documentation === | === Documentation === | ||
<pre | <pre class="gcommand"> | ||
ml OrthoFinder/2.5. | ml OrthoFinder/2.5.4-foss-2022a | ||
orthofinder -h | |||
OrthoFinder version 2.5. | OrthoFinder version 2.5.4 Copyright (C) 2014 David Emms | ||
SIMPLE USAGE: | SIMPLE USAGE: | ||
Line 90: | Line 75: | ||
OPTIONS: | OPTIONS: | ||
-t <int> Number of parallel sequence search threads [Default = | -t <int> Number of parallel sequence search threads [Default = 16] | ||
-a <int> Number of parallel analysis threads | -a <int> Number of parallel analysis threads | ||
-d Input is DNA sequences | -d Input is DNA sequences |
Revision as of 10:46, 5 September 2023
Category
Bioinformatics
Program On
Sapelo2
Version
2.5.4
Author / Distributor
Description
"OrthoFinder is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers rooted gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses. OrthoFinder is simple to use and all you need to run it is a set of protein sequence files (one per species) in FASTA format." More details are at OrthoFinder
Running Program
Please refer to Running Jobs on Sapelo2
- Version 2.5.4, installed at /apps/eb/OrthoFinder/2.5.4-foss-2022a/
To use the version 2.5.4, please first load the module with
ml OrthoFinder/2.5.4-foss-2022a
Here is an example of a shell script sub.sh to run orthofinder of version 2.5.4 at the batch queue:
#!/bin/bash #SBATCH --job-name=j_OrthoFinder #SBATCH --partition=batch #SBATCH --ntasks=1 #SBATCH --cpus-per-task=8 #SBATCH --mem=32gb #SBATCH --time=120:00:00 #SBATCH --output=log.%j.out #SBATCH --error=log.%j.err #SBATCH --mail-user=username@uga.edu #SBATCH --mail-type=ALL cd $SLURM_SUBMIT_DIR ml OrthoFinder/2.5.4-foss-2022a orthofinder -t 8 -a 8 [options]
Here is an example of job submission
sbatch ./sub.sh
Documentation
ml OrthoFinder/2.5.4-foss-2022a orthofinder -h OrthoFinder version 2.5.4 Copyright (C) 2014 David Emms SIMPLE USAGE: Run full OrthoFinder analysis on FASTA format proteomes in <dir> orthofinder [options] -f <dir> Add new species in <dir1> to previous run in <dir2> and run new analysis orthofinder [options] -f <dir1> -b <dir2> OPTIONS: -t <int> Number of parallel sequence search threads [Default = 16] -a <int> Number of parallel analysis threads -d Input is DNA sequences -M <txt> Method for gene tree inference. Options 'dendroblast' & 'msa' [Default = dendroblast] -S <txt> Sequence search program [Default = diamond] Options: blast, diamond, diamond_ultra_sens, blast_gz, mmseqs, blast_nucl -A <txt> MSA program, requires '-M msa' [Default = mafft] Options: mafft, muscle -T <txt> Tree inference method, requires '-M msa' [Default = fasttree] Options: fasttree, raxml, raxml-ng, iqtree -s <file> User-specified rooted species tree -I <int> MCL inflation parameter [Default = 1.5] -x <file> Info for outputting results in OrthoXML format -p <dir> Write the temporary pickle files to <dir> -1 Only perform one-way sequence search -X Don't add species names to sequence IDs -y Split paralogous clades below root of a HOG into separate HOGs -z Don't trim MSAs (columns>=90% gap, min. alignment length 500) -n <txt> Name to append to the results directory -o <txt> Non-default results directory -h Print this help text WORKFLOW STOPPING OPTIONS: -op Stop after preparing input files for BLAST -og Stop after inferring orthogroups -os Stop after writing sequence files for orthogroups (requires '-M msa') -oa Stop after inferring alignments for orthogroups (requires '-M msa') -ot Stop after inferring gene trees for orthogroups WORKFLOW RESTART COMMANDS: -b <dir> Start OrthoFinder from pre-computed BLAST results in <dir> -fg <dir> Start OrthoFinder from pre-computed orthogroups in <dir> -ft <dir> Start OrthoFinder from pre-computed gene trees in <dir> LICENSE: Distributed under the GNU General Public License (GPLv3). See License.md CITATION: When publishing work that uses OrthoFinder please cite: Emms D.M. & Kelly S. (2019), Genome Biology 20:238 If you use the species tree in your work then please also cite: Emms D.M. & Kelly S. (2017), MBE 34(12): 3267-3278 Emms D.M. & Kelly S. (2018), bioRxiv https://doi.org/10.1101/267914
Installation
source code from OrthoFinder
System
64-bit Linux