PfamDB-Sapelo2: Difference between revisions

From Research Computing Center Wiki
Jump to navigation Jump to search
(Created page with "Category:Sapelo2Category:SoftwareCategory:BioinformaticsCategory:Bioinformatics Database A database for Hmmer and Pfam_scan program There are mult...")
 
No edit summary
Line 3: Line 3:


There are multiple versions under /db/pfam
There are multiple versions under /db/pfam
The pfam database must be formatted to work with certain software such as HMMER. The non-formulated downloads are numbered with the version of the pfam database.The currently available versions are 27.0 33.0 and  34.0. The versions which have been formatted to work with HMMER are listed with the pfam version followed by the HMMER version they have been formatted for. The currently available versions are 27.0-hmmer3.0 and 34.0-hmmer3.3.2.


<pre class="gscript">
<pre class="gscript">
Line 8: Line 9:
</pre>
</pre>


The command generated the data is documented at UGA file sitting at the same directory with data:
The commands to generate the data is documented at UGA file sitting at the same directory with data:
<pre class="gcommand">
<pre class="gcommand">
formatdb -i Pfam-A.fasta -n Pfam-A -t Pfam-A -l Pfam-A.log -p T
formatdb -i Pfam-A.fasta -n Pfam-A -t Pfam-A -l Pfam-A.log -p T
Line 18: Line 19:
xdformat -p -x Pfam-A.fasta
xdformat -p -x Pfam-A.fasta
xdformat -p Pfam-B.fasta
xdformat -p Pfam-B.fasta
</pre>
If you would like a particular version of the pfam database to work with a particular version of HMMER you can download or prepare this database yourself. To format pfam for use with HMMER you can use the function, hmmpress, which comes built into HMMER. The hmmpress step is required for hmmscan to work.
The following commands can be used to prepare pfam.hmm data.
<pre class="gcommand">
ml HMMER/3.1b2-foss-2016b #use whichever version of HMMER you would like to use for hmmscan
hmmpress Pfam-A.hmm
hmmpress Pfam-B.hmm
</pre>
</pre>



Revision as of 12:58, 13 October 2021

A database for Hmmer and Pfam_scan program

There are multiple versions under /db/pfam The pfam database must be formatted to work with certain software such as HMMER. The non-formulated downloads are numbered with the version of the pfam database.The currently available versions are 27.0 33.0 and 34.0. The versions which have been formatted to work with HMMER are listed with the pfam version followed by the HMMER version they have been formatted for. The currently available versions are 27.0-hmmer3.0 and 34.0-hmmer3.3.2.

/db/pfam/

The commands to generate the data is documented at UGA file sitting at the same directory with data:

formatdb -i Pfam-A.fasta -n Pfam-A -t Pfam-A -l Pfam-A.log -p T
formatdb -i Pfam-B.fasta -n Pfam-A -t Pfam-A -l Pfam-A.log -p T
ml HMMER/3.1b2-foss-2016b
hmmpress Pfam-A.hmm
hmmpress Pfam-B.hmm
ml WU-Blast/2.2.6-foss-2016b
xdformat -p -x Pfam-A.fasta
xdformat -p Pfam-B.fasta

If you would like a particular version of the pfam database to work with a particular version of HMMER you can download or prepare this database yourself. To format pfam for use with HMMER you can use the function, hmmpress, which comes built into HMMER. The hmmpress step is required for hmmscan to work. The following commands can be used to prepare pfam.hmm data.

ml HMMER/3.1b2-foss-2016b #use whichever version of HMMER you would like to use for hmmscan
hmmpress Pfam-A.hmm
hmmpress Pfam-B.hmm

version built with hmmer 3.0 (Pfam_scan friendly)

/db/pfam/27.0-hmmer3.0/Pfam-A
/db/pfam/27.0-hmmer3.0/Pfam-B
/db/pfam/27.0-hmmer3.0/Pfam-A.hmm
/db/pfam/27.0-hmmer3.0/Pfam-B.hmm

Databases are download install files from sanger website.