BUSCO-Sapelo2: Difference between revisions
No edit summary |
|||
Line 101: | Line 101: | ||
<pre class="gcommand"> | <pre class="gcommand"> | ||
singularity exec / | singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco [options] | ||
</pre> | </pre> | ||
To get busco help info: | To get busco help info: | ||
<pre class="gcommand"> | <pre class="gcommand"> | ||
singularity exec / | singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco -h | ||
</pre> | </pre> | ||
To check busco version info: | To check busco version info: | ||
<pre class="gcommand"> | <pre class="gcommand"> | ||
singularity exec / | singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco -v | ||
</pre> | </pre> | ||
To have a check on other programs included in this busco singularity image: | To have a check on other programs included in this busco singularity image: | ||
<pre class="gcommand"> | <pre class="gcommand"> | ||
singularity exec / | singularity exec /apps/singularity-images ls /usr/local/bin | ||
singularity exec / | singularity exec /apps/singularity-images ls /augustus | ||
singularity exec / | singularity exec /apps/singularity-images ls /ncbi-blast-2.2.31+/bin | ||
singularity exec / | singularity exec /apps/singularity-images ls /hmmer-3.2.1 | ||
singularity exec / | singularity exec /apps/singularity-images ls /prodigal | ||
</pre> | </pre> | ||
Revision as of 11:27, 23 October 2020
Category
Bioinformatics
Program On
Sap2test
Version
4.0.5, 4.0.6
Author / Distributor
Description
"BUSCO - Benchmarking sets of Universal Single-Copy Orthologs." More details are at BUSCO
Running Program
Version 4.0.5
- Version 4.0.5, is installed at /apps/eb/BUSCO/4.0.5-foss-2019b-Python-3.7.4
BLAST+ v2.9.0 is loaded with this application. This version of Blast+ enables the multiple cores function for busco. AUGUSTUS v3.3.3 is also loaded with AUGUSTUS_CONFIG_PATH set correctly.
To use this version of BUSCO, please load the module with
ml BUSCO/4.0.5-foss-2019b-Python-3.7.4
Before run the program, please copy the BUSCO config file config.ini to your current working folder and modify the input file value and other values as needed in it. Please also copy AUGUSTUS config folder to the place:
cp -r /apps/eb/AUGUSTUS/3.3.3-foss-2019b/config config_augustus export AUGUSTUS_CONFIG_PATH=config_augustus cp /apps/eb/BUSCO/4.0.5-foss-2019b-Python-3.7.4/config/config.ini config.ini vim config.ini export BUSCO_CONFIG_FILE=config.ini
Version 4.0.6
- Version 4.0.6, is installed at /apps/eb/BUSCO/4.0.6-foss-2019b-Python-3.7.4
To use this version of BUSCO, please first load the module with
ml BUSCO/4.0.6-foss-2019b-Python-3.7.4
BLAST+ v2.9.0 is loaded with this application. This version of Blast+ enables the multiple cores function for busco. AUGUSTUS v3.3.3 is also loaded with AUGUSTUS_CONFIG_PATH set correctly.
Before run the program, please copy the BUSCO config file config.ini to your current working folder and modify the input file value and other values as needed in it. Please also copy AUGUSTUS config folder to the place:
cp -r /apps/eb/AUGUSTUS/3.3.3-foss-2019b/config config_augustus export AUGUSTUS_CONFIG_PATH=config_augustus cp /apps/eb/BUSCO/4.0.6-foss-2019b-Python-3.7.4/config/config.ini config.ini vim config.ini export BUSCO_CONFIG_FILE=config.ini
Example shell script sub.sh to run BUSCO/4.0.6 on the batch partition:
#!/bin/bash #SBATCH --job-name=busco # Job name #SBATCH --partition=batch # Partition (queue) name #SBATCH --ntasks=1 # Run a single task #SBATCH --cpus-per-task=4 # Number of CPU cores per task #SBATCH --mem=10gb # Job memory request #SBATCH --time=48:00:00 # Time limit hrs:min:sec #SBATCH --output=log.%j.out # Standard output log #SBATCH --error=log.%j.err # Standard error log #SBATCH --export=NONE # Don't export user's explicit env variables to compute node #SBATCH --mail-type=END,FAIL # Mail events (NONE, BEGIN, END, FAIL, ALL) #SBATCH --mail-user=username@uga.edu # Where to send mail cd $SLURM_SUBMIT_DIR ml BUSCO/4.0.6-foss-2019b-Python-3.7.4 # load BUSCO/4.0.6 module export AUGUSTUS_CONFIG_PATH=config_augustus export BUSCO_CONFIG_FILE=config.ini time busco --config ./config.ini --cpu 4 [options]
where [options] need to be replaced by the options (command and arguments) you want to use. Other parameters of the job, such as the time limit, maximum memory, number of cores, and the job name need to be modified appropriately as well.
Version 4.0.5, Singularity Image
- Version 4.0.5, is installed as a singularity image at /apps/singularity-images/busco-4.0.5.simg
To run this singularity image:
singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco [options]
To get busco help info:
singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco -h
To check busco version info:
singularity exec /apps/singularity-images/busco-4.0.5.simg run_busco -v
To have a check on other programs included in this busco singularity image:
singularity exec /apps/singularity-images ls /usr/local/bin singularity exec /apps/singularity-images ls /augustus singularity exec /apps/singularity-images ls /ncbi-blast-2.2.31+/bin singularity exec /apps/singularity-images ls /hmmer-3.2.1 singularity exec /apps/singularity-images ls /prodigal
Before run busco singularity container, please copy AUGUSTUS config folder to your current working folder:
cp -r /apps/eb/AUGUSTUS/3.3.3-foss-2019b/config config_augustus export AUGUSTUS_CONFIG_PATH=config_augustus
Please also copy the BUSCO config file config.ini from its singularity image to your current working folder and modify the input file value and other values as needed in it.
singularity exec /apps/singularity-images/busco-4.0.5.simg cp /busco/config/config.ini . vim config.ini export BUSCO_CONFIG_FILE=config.ini
Example shell script sub.sh to run BUSCO/4.0.5 singularity container:
#!/bin/bash #SBATCH --job-name=busco # Job name #SBATCH --partition=batch # Partition (queue) name #SBATCH --ntasks=1 # Run a single task #SBATCH --cpus-per-task=4 # Number of CPU cores per task #SBATCH --mem=10gb # Job memory request #SBATCH --time=24:00:00 # Time limit hrs:min:sec #SBATCH --output=log.%j.out # Standard output log #SBATCH --error=log.%j.err # Standard error log #SBATCH --export=NONE # Don't export user's explicit env variables to compute node #SBATCH --mail-type=END,FAIL # Mail events (NONE, BEGIN, END, FAIL, ALL) #SBATCH --mail-user=username@uga.edu # Where to send mail cd $SLURM_SUBMIT_DIR export AUGUSTUS_CONFIG_PATH=config_augustus export BUSCO_CONFIG_FILE=config.ini time singularity exec --bind ./config_augustus/:/augustus/config /apps/singularity-images/busco-4.0.5.simg run_busco --config ./config.ini --cpu 4 [options]
where [options] need to be replaced by the options (command and arguments) you want to use. Other parameters of the job, such as the maximum wall clock time, maximum memory, the number of cores per node, and the job name need to be modified appropriately as well.
Here is an example of job submission command:
sbatch sub.sh
Documentation
ml BUSCO/4.0.6-foss-2019b-Python-3.7.4 busco -h usage: busco -i [SEQUENCE_FILE] -l [LINEAGE] -o [OUTPUT_NAME] -m [MODE] [OTHER OPTIONS] Welcome to BUSCO 4.0.6: the Benchmarking Universal Single-Copy Ortholog assessment tool. For more detailed usage information, please review the README file provided with this distribution and the BUSCO user guide. optional arguments: -i FASTA FILE, --in FASTA FILE Input sequence file in FASTA format. Can be an assembled genome or transcriptome (DNA), or protein sequences from an annotated gene set. -c N, --cpu N Specify the number (N=integer) of threads/cores to use. -o OUTPUT, --out OUTPUT Give your analysis run a recognisable short name. Output folders and files will be labelled with this name. WARNING: do not provide a path --out_path OUTPUT_PATH Optional location for results folder, excluding results folder name. Default is current working directory. -e N, --evalue N E-value cutoff for BLAST searches. Allowed formats, 0.001 or 1e-03 (Default: 1e-03) -m MODE, --mode MODE Specify which BUSCO analysis mode to run. There are three valid modes: - geno or genome, for genome assemblies (DNA) - tran or transcriptome, for transcriptome assemblies (DNA) - prot or proteins, for annotated gene sets (protein) -l LINEAGE, --lineage_dataset LINEAGE Specify the name of the BUSCO lineage to be used. -f, --force Force rewriting of existing files. Must be used when output files with the provided name already exist. --limit REGION_LIMIT How many candidate regions (contig or transcript) to consider per BUSCO (default: 3) --long Optimization mode Augustus self-training (Default: Off) adds considerably to the run time, but can improve results for some non-model organisms -q, --quiet Disable the info logs, displays only errors --augustus_parameters AUGUSTUS_PARAMETERS Pass additional arguments to Augustus. All arguments should be contained within a single pair of quotation marks, separated by commas. E.g. '--param1=1,--param2=2' --augustus_species AUGUSTUS_SPECIES Specify a species for Augustus training. --auto-lineage Run auto-lineage to find optimum lineage path --auto-lineage-prok Run auto-lineage just on non-eukaryote trees to find optimum lineage path --auto-lineage-euk Run auto-placement just on eukaryote tree to find optimum lineage path --update-data Download and replace with last versions all lineages datasets and files necessary to their automated selection --offline To indicate that BUSCO cannot attempt to download files --config CONFIG_FILE Provide a config file -v, --version Show this version and exit -h, --help Show this help message and exit --list-datasets Print the list of available BUSCO datasets
Installation
Source code is obtained from BUSCO
System
64-bit Linux