Name: docker-midas
Owner: Fred Hutchinson Cancer Research Center
Description: Docker image running MIDAS
Created: 2018-01-30 23:45:42.0
Updated: 2018-01-31 00:51:56.0
Pushed: 2018-01-31 00:58:37.0
Homepage: null
Size: 8
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
This repository provides a Docker image for running MIDAS that is compatible with automated analysis on AWS Batch. Specifically, this image includes a wrapper script that:
Downloads reference databases
Downloads input data
Runs MIDAS
Saves the outputs to persistent storage
In order to be compatible with AWS Batch, all of these steps are parameterizable and are run with a single command.
Specifies the set of FASTQ reads that will be aligned. Supports files from SRA, S3, or FTP. Use the file prefix to specify the source (s3://
, sra://
, or ftp://
). Note that for SRA, just provide the accession (e.g. sra://SRR123456
).
Path to the MIDAS reference database (folder). Supports s3://
, or a local path.
Folder to place the output in, supporting either s3://
or a local path. Output files will take the form of <prefix>.json.gz
, where <prefix>
is the SRA accession (if specified), or otherwise the prefix of the input file from S3 or ftp.
Number of threads used by MIDAS during alignment, defaults to 16.
nput_path": <PATH_TO_INPUT_DATA>,
nput": <INPUT_DATA_PREFIX>,
utput_folder": <OUTPUT_FOLDER_PATH>,
ogs": <ANALYSIS_LOGS>,
ef_db": <LOCAL_PATH_TO_REF_DB>,
ef_db_url": <URL_FOR_REF_DB>,
otal_reads": <INT>,
ime_elapsed": <FLOAT_SECONDS>,
esults": {
"species": [
{
"species_id": "Clostridium_botulinum_57664",
"relative_abundance": 0.966368434694,
"count_reads": 5631,
"coverage": 95.0686106346
},
...species level data for more species...
],
"genes": [
{
"species_id": "Clostridium_sporogenes_58038",
"marker_coverage": 0,
"fraction_covered": 0.112485472356,
"aligned_reads": 98310,
"covered_genes": 1355,
"mean_coverage": 6.65663712323,
"mapped_reads": 49030,
"pangenome_size": 12046
"genes": [
{
"copy_number": 0,
"count_reads": 47,
"coverage": 5.62211614956,
"gene_id": "1075091.3.peg.1003",
"annot": {
"figfam": [
"FIG00517132"
]
}
},
...more genes...
]
},
...gene level data for more species...
]
}