Duke-GCB/MMAP

Name: MMAP

Owner: Duke Center for Genomic and Computational Biology

Description: Project_Lemur

Created: 2015-02-17 16:33:41.0

Updated: 2015-05-06 02:08:30.0

Pushed: 2015-05-06 02:08:30.0

Homepage:

Size: 34736

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

MMAP: Microbial Metagenomic Analysis Pipeline

Developing a software package to analysis metagenomics data

Language: Python 2.7 Required package

Required software

Note: These software distributed here might not work with your OS!

INSTALL
Directory list

Default/Recommended folder structure. This structure is used in default control file. If Genovo/Glimmer/MINE/BLAST are located at other destination, please update the control file.

Generating local blast database

The BLAST component queries annotated Gene Ontology sequences downloaded from http://geneontology.org. Prior to running an analysis, you must download the sequences and convert them to a BLAST+ formatted database.

A bash script is provided that does this automatically: [src/scripts/makeblastdb-go.sh]. It uses curl and makeblastdb to download a FASTA-formatted protein sequence and convert it to NCBI BLAST+ format.

It accepts one argument - the output directory for the GO Sequence Database.

$ ./makeblastdb-go.sh /data/go-seqdb

After generating the local database, add its path to the control file, so that the pipeline knows where to find it.

USAGE
t the control file "data/example/control"; Update "parent_directory=" and point that to the data/ directory. Update "go_blastdb=" to point to the converted BLAST DB directory
MAP;
on src/core/main.py data/example/control
Directory list

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.