hurwitzlab/LSA-pipeline

Name: LSA-pipeline

Owner: Hurwitz Lab

Description: LatentStrainAnalysis to separate out fastas into genome bins which can then be assembled individually

Created: 2015-10-27 22:07:04.0

Updated: 2015-10-27 22:09:06.0

Pushed: 2015-10-30 23:09:52.0

Homepage: null

Size: 34732

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Copy of the fork https://github.com/scottdaniel/LatentStrainAnalysis

this will be remade to work with fasta files and with the PBS compute cluster software at the University of Arizona

Welcome to the Latent Strain Analysis (LSA) code repository!

LSA was developed as a pre-assembly tool for partitioning metagenomic reads. It uses a hyperplane hashing function and streaming SVD in order to find covariance relations between k-mers. The code, and the process outline in LSFScripts in particular, have been optimized to scale to massive data sets in fixed memory with a highly distributed computing environment.

Documentation

Documentation for LSA, including a “getting started” tutorial with accompanying test data, and step-by-step instructions for analyzing large collections, can be found at: http://latentstrainanalysis.readthedocs.org/


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.