Name: LSA-pipeline
Owner: Hurwitz Lab
Description: LatentStrainAnalysis to separate out fastas into genome bins which can then be assembled individually
Created: 2015-10-27 22:07:04.0
Updated: 2015-10-27 22:09:06.0
Pushed: 2015-10-30 23:09:52.0
Homepage: null
Size: 34732
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Copy of the fork https://github.com/scottdaniel/LatentStrainAnalysis
this will be remade to work with fasta files and with the PBS compute cluster software at the University of Arizona
Welcome to the Latent Strain Analysis (LSA) code repository!
LSA was developed as a pre-assembly tool for partitioning metagenomic reads. It uses a hyperplane hashing function and streaming SVD in order to find covariance relations between k-mers. The code, and the process outline in LSFScripts in particular, have been optimized to scale to massive data sets in fixed memory with a highly distributed computing environment.
Documentation for LSA, including a “getting started” tutorial with accompanying test data, and step-by-step instructions for analyzing large collections, can be found at: http://latentstrainanalysis.readthedocs.org/