Name: MMAP
Owner: Duke Center for Genomic and Computational Biology
Description: Project_Lemur
Created: 2015-02-17 16:33:41.0
Updated: 2015-05-06 02:08:30.0
Pushed: 2015-05-06 02:08:30.0
Size: 34736
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Developing a software package to analysis metagenomics data
Language: Python 2.7 Required package
Required software
Note: These software distributed here might not work with your OS!
Default/Recommended folder structure. This structure is used in default control file. If Genovo/Glimmer/MINE/BLAST are located at other destination, please update the control file.
blastx
, which is a symlink to /usr/bin/blastx
The BLAST component queries annotated Gene Ontology sequences downloaded from http://geneontology.org. Prior to running an analysis, you must download the sequences and convert them to a BLAST+ formatted database.
A bash script is provided that does this automatically: [src/scripts/makeblastdb-go.sh]. It uses curl and makeblastdb to download a FASTA-formatted protein sequence and convert it to NCBI BLAST+ format.
It accepts one argument - the output directory for the GO Sequence Database.
$ ./makeblastdb-go.sh /data/go-seqdb
After generating the local database, add its path to the control file, so that the pipeline knows where to find it.
t the control file "data/example/control"; Update "parent_directory=" and point that to the data/ directory. Update "go_blastdb=" to point to the converted BLAST DB directory
MAP;
on src/core/main.py data/example/control