Name: uniclust-pipeline
Owner: Söding Lab
Description: null
Created: 2016-08-15 10:24:27.0
Updated: 2017-08-01 21:25:01.0
Pushed: 2017-05-29 17:24:37.0
Homepage: null
Size: 66
Language: Shell
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Make sure to install the HH-Suite3 and MMseqs2 and adjust the paths in paths.sh
.
Also make sure that awk, tar, pigz, cstranslate_mpi, sed, md5deep, clustalo, kalign, timeout
are all installed and available in PATH.
To build your own databases based on the uniclust pipeline you can use the following three scripts:
run_main.sh
: Run Main does the clustering, builds the uniclust30/50/90
and does the sequence enrichment of the uniboost10/20/30
databases.run_hhdatabase.sh
: Builds the uniclust30_hhsuite
databaserun_annotate.sh
: Builds the annotation filesMake sure to run the scripts in this order.
The pipeline was custom build for out LSF cluster computing environment and can be submitted to the LSF with bsub < run_mpi.sh
for example.
Please adjust the LSF parameters at the beginning of the run_
scripts. The pipeline assumes a shared file system between the computing nodes.
We provide a webserver and the Uniclust based on the UniProtKB on https://uniclust.mmseqs.com/.