BackofenLab/GraphClust

Name: GraphClust

Owner: Bioinformatics Lab - Department of Computer Science - University Freiburg

Description: Structural clustering of local RNA secondary structures

Created: 2016-03-31 15:23:58.0

Updated: 2017-01-17 15:03:10.0

Pushed: 2017-11-19 13:52:03.0

Homepage: null

Size: 1144

Language: Perl

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

GraphClust

GraphClust can be used for structural clustering of RNA sequences. Especially it can be used for clustering of very large dataset with thousands of RNAs.

Motivation: Clustering according to sequence?structure similarity has now become a generally accepted scheme for ncRNA annotation. Its application to complete genomic sequences as well as whole transcriptomes is therefore desirable but hindered by extremely high computational costs.

Results: We present a novel linear-time, alignment-free method for comparing and clustering RNAs according to sequence and structure. The approach scales to datasets of hundreds of thousands of sequences. The quality of the retrieved clusters has been benchmarked against known ncRNA datasets and is comparable to state-of-the-art sequence?structure methods although achieving speedups of several orders of magnitude. A selection of applications aiming at the detection of novel structural ncRNAs are presented. Exemplarily, we predicted local structural elements specific to lincRNAs likely functionally associating involved transcripts to vital processes of the human nervous system. In total, we predicted 349 local structural RNA elements.

Contribution

Feel free to contribute to this project by raising Issues with feature requests or bug reports.

Cite

If you use GraphClust, please cite our article:

 10.1093/bioinformatics/bts224

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.