soedinglab/hh-suite

Name: hh-suite

Owner: Söding Lab

Description: Remote protein homology detection suite.

Created: 2015-05-04 12:00:48.0

Updated: 2017-12-05 09:16:44.0

Pushed: 2017-09-19 15:04:26.0

Homepage: http://www.nature.com/nmeth/journal/v9/n2/full/nmeth.1818.html

Size: 22942

Language: C++

GitHub Committers

UserMost Recent Commit# Commits
James Hetherington2015-12-22 13:13:09.01
Milot Mirdita2017-09-19 15:03:47.062
Stefan Seemayer2015-08-11 10:54:38.02
David Miller2017-04-21 16:57:26.01
Martin Steinegger2017-08-10 15:37:29.04
Daniel2016-09-09 11:22:51.01
Gary Macindoe2016-04-19 15:54:53.06
Lukas Zimmermann2017-02-08 20:18:49.02
Seung-Zin Nam2016-09-25 20:17:16.01
Markus Meier2017-07-14 12:05:42.0136
Christian Roth2017-05-03 20:34:46.07

Other Committers

UserEmailMost Recent Commit# Commits
Andreas Biegertandreas@andreaspc.lmb.uni-muenchen.de2008-11-19 13:32:39.04
Andreas Biegertandreas@hn01.cluster.local2008-12-10 14:52:54.07
Andreas Biegertandreas@lmu.fms-install.local2008-12-05 13:02:21.02
Andreas Hauserandy@hauserws.genzentrum.lmu.de2011-09-13 22:11:27.01
Angermueller Christofangermue@in.tum.de2013-02-22 14:57:09.018
Armin Meierarmin@hn01.cluster.local2010-08-05 12:52:07.01
null voiddmiller4232017-04-25 22:29:13.01
Andreas Hauserhauser@dbs1.genzentrum.lmu.de2012-01-16 12:22:24.01
Andreas Hauserhauser@fs03.compbio.genzentrum.lmu.de2011-11-15 13:44:11.07
Andy Hauserhauser@genzentrum.lmu.de2013-05-23 14:28:32.0141
Andy Hauserhauser@genzentrum.lmu.de2013-05-23 14:28:32.0141
mergemaster (andy)hhsuite-devel@genzentrum.lmu.de2013-04-29 12:28:33.03
Martin Steineggermad@dhcp-10-176-85-96.dynamic.eduroam.mwn.de2015-01-06 22:19:42.01
Martin Steineggermad@martins-macbook-pro-3.local2014-11-05 04:06:19.021
madmad@martins-macbook-pro.local2016-07-21 09:47:16.020
Martin Steineggermad@martins-mbp-3.fritz.box2015-01-08 13:47:27.022
Martin Steineggermad@martins-mbp-3.railnet.train2014-12-30 22:57:10.09
Managermanager@fs02.compbio.genzentrum.lmu.de2012-02-22 15:05:57.020
Managermanager@hn01.cluster.local2011-11-28 12:37:47.026
Managermanager@lmu.fms-install.local2009-12-07 15:17:10.07
Managermanager@michaelpc.(none)2010-01-15 12:03:37.01
Maria Hausermaria@cn27.cluster.local2011-04-14 14:03:14.01
Markus Meiermarkus.meier@mpibpc.mpg.de2015-07-22 15:00:01.08
Markus Meiermarkus.meier@mpibpc.mpg.de2015-07-22 15:00:01.08
martin.steineggermartin.steinegger@campus.lmu.de2014-06-30 01:14:34.01
meiermarkmeiermark@10.163.241.1652013-07-15 10:59:45.01
meiermarkmeiermark@genzentrum.lmu.de2016-10-14 19:11:01.0170
meiermarkmeiermark@genzentrum.lmu.de2016-10-14 19:11:01.0170
meiermarkmeiermark@hn02.compbio.gcm.genzentrum.lmu.de2013-07-27 08:45:11.01
Markus Meiermeiermark@mars.(none)2012-10-15 08:21:28.01
Maria Hausermhauser@genzentrum.lmu.de2012-10-25 12:26:15.01
Michael Remmertmichael@hn01.cluster.local2011-11-28 12:36:37.046
Michael Remmertmichael@lmu.fms-install.local2010-11-30 10:26:39.02
Michael Remmertmichael@michaelpc.lmb.uni-muenchen.de2009-03-02 12:44:31.01
Michael Remmertmichael@michaelpc.(none)2010-10-27 11:28:12.062
Martin Steineggermsteine1@gwdu102.global.gwdg.cluster2015-08-03 12:35:27.01
Michael Remmertremmert@genzentrum.lmu.de2011-10-13 09:00:57.037
Harald Sagar Vöhringersagar@b1161-c542.mpibpc.intern2016-08-17 12:15:08.02
Harald Sagar Vöhringersagar@pool-164-16.mpibpc.intern2016-08-11 14:30:03.01
Harald Sagar Vöhringersagar@pool-164-21.mpibpc.intern2015-06-17 14:24:59.01
soedingsoeding@genzentrum.lmu.de2013-03-15 06:27:40.0156
Johannes Soedingsoeding@m1161-js.fritz.box2014-12-30 12:15:18.06
Johannes Soedingsoeding@soedingpc.lmb.uni-muenchen.de2010-02-11 13:57:54.025
Soedingsoeding@soedingpc.(none)2011-11-30 14:04:48.07

README

Beta-Test

HHsuite for sensitive sequence searching version 3.0-beta.3 (14-07-2017)

© Johannes Soeding, Markus Meier, Martin Steinegger, Michael Remmert, Andreas Hauser, Andreas Biegert 2015

Codeship Status for soedinglab/hh-suite

Build Status

The HH-suite is an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).

WARNING

We had to rename our repository for ffindex, so it might be necessary to run the following commands to update the submodule in your clone:

git pull
git submodule deinit
git submodule init
Requirements

To compile from source, you will need:

Installation

We recommend compiling HHsuite on the machine that should run the computations so that it can be optimized for the appropriate CPU architecture.

Packages

Some distributions incorporate HHsuite on their own:

Release tarballs

The release tarballs should contain all required source files. Simply download and extract

Cloning from GIT

If you want to compile the most recent version, simply clone the git repository. Then, from the repository root, initialize the ffindex submodule:

git submodule init
git submodule update
Compilation

With the sourcecode ready, simply run cmake with the default settings and libraries should be auto-detected:

mkdir build
cd build
cmake -DCMAKE_BUILD_TYPE=RelWithDebInfo -G "Unix Makefiles" -DCMAKE_INSTALL_PREFIX=${INSTALL_BASE_DIR} ..
make
make install
Setting paths
Setting environment variables

In your shell set environment variable HHLIB to ${INSTALL_BASE_DIR}, e.g (for bash, zsh, ksh):

export HHLIB=${INSTALL_BASE_DIR}

HHsearch and HHblits look for the column state library file cs219.lib and the context library file context_data.lib in ${HHLIB}/data/. The HHsuite scripts also read HHLIB to locate the perl modules Align.pm and HHPaths.pm in ${HHLIB]/scripts/.

Add the location of HHsuite binaries and scripts to your search PATH variable

export PATH=${PATH}:${INSTALL_BASE_DIR}/bin:${INSTALL_BASE_DIR}/scripts
Specify BLAST, PSIPRED, PDB, DSSP paths

Specify paths in ${INSTALL_BASE_DIR}/scripts/HHPaths.pm where they are read by HHsuite's perl scripts.

Download Databases

Download current databases from our server To build up multiple sequences alignments using HHblits uniprot20 is sufficient.

Usage

For performing a single search iteration of HHblits, run HHblits with the following command:

hhblits -i <input-file> -o <result-file> -n 1 -d <database-basename>

For generating an alignment of homologous sequences:

hhblits -i <input-file> -o <result-file> -oa3m <result-alignment> -d <database-basename>

You can get a detailed list of options for HHblits by running HHblits with the “-h” option.

Building packages for a release

It might be good to do the following steps in a fresh VM with Ubuntu.

  1. Adjust /README.md (see TODOs) The following line has to be updated

        HHsuite for sensitive sequence searching version current_version (release_date)
    
  2. Adjust /CMakeLists.txt

    1. Update the version number

      set (HHSUITE_VERSION_MAJOR 3)
      set (HHSUITE_VERSION_MINOR 0)
      set (HHSUITE_VERSION_PATCH 3)
      
    2. Update the release date

      set (HHSUITE_DATE "14-07-2017")
      
    3. Check the package version

      set (CPACK_PACKAGE_VERSION_MAJOR "${HHSUITE_VERSION_MAJOR}")
      

      The beta is required for the current beta releases

      set (CPACK_PACKAGE_VERSION_MINOR "${HHSUITE_VERSION_MINOR}-beta")
      set (CPACK_PACKAGE_VERSION_MINOR "${HHSUITE_VERSION_PATCH}")
      
  3. Build the packages

    rm -rf build
    mkdir build
    cd build
    
    cmake -DCMAKE_INSTALL_PREFIX=/home/mmeier/opt/hh-suite -DHAVE_SSE2=1 \
    -DCMAKE_BUILD_TYPE=Release -DBUILD_SHARED_LIBS=OFF \
    -DCMAKE_EXE_LINKER_FLAGS_RELEASE=-static -static-libgcc \
    -static-libstdc++ -DCMAKE_FIND_LIBRARY_SUFFIXES=.a ..
    
    make -j 16
    make package
    make package_source
    

The generated packages can be found in /build

TODO
License

The HHsearch/HHblits software package is distributed under Gnu Public Licence, Version 3. This means that the HH-suite is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See the copy of the GNU General Public License in the LICENSE file. If you do not have this file, see http://www.gnu.org/licenses/

Notes

For full documentation see the user guide in hhsuite-userguide.pdf

We are very grateful for bug reports! Please contact us at soeding@mpibpc.mpg.de

Links
Acknowledgements

The hhsuite contains in file hhprefilter.cpp code adapted from Michael Farrar (http://sites.google.com/site/farrarmichael/smith-waterman). His code is marked in the file hhprefilter.cpp. For the copy right of that code, please see the LICENSE file that comes with HHsuite. Reference: Farrar M. Striped Smith-Waterman speeds database searches six times over other SIMD implementations. Bioinformatics. 2007, 23:156-61. Many posthumous thanks to Michael Farrar for his great code!


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.