Name: Genie
Owner: Sage Bionetworks
Description: For analyzing data in www.synapse.org/genie
Created: 2016-10-20 20:22:55.0
Updated: 2017-09-01 04:43:09.0
Pushed: 2018-01-17 23:52:35.0
Homepage: null
Size: 5152
Language: Jupyter Notebook
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
This repository documents code used to gather, QC, standardize, and analyze data uploaded by institutes participating in AACR's Project GENIE (Genomics, Evidence, Neoplasia, Information, Exchange). Follow instructions below to download and populate the folder data
with the core Genie dataset. And then review the contents of the analyses
folder, which contains a sub-folder each, for various analyses that can be performed on those datasets.
These are tools or packages you will need, to be able to reproduce these results:
bsub
) or SGE (qsub
)pip install synapseclient
)pip install pandas
)Download the core dataset into data/cbioportal
. This data feeds the web-based UI at cbioportal.org/genie:
pse get -r --downloadLocation data/cbioportal syn5521835
on sage_processing/validateGENIE.py -h
e: validateGENIE.py [-h]
{maf,clinical,fusions,cnv,vcf,seg,bed} file [file ...]
{MSK,GRCC,DFCI,NKI,JHU,MDA,VICC,UHN}
date GENIE files
tional arguments:
af,clinical,fusions,cnv,vcf,seg,bed}
File type that you are validating: maf, clinical,
fusions, cnv, vcf, seg, bed
le File(s) that you are validating. If you validation
your clinical files and you have both sample and
patient files, you must provide both
SK,GRCC,DFCI,NKI,JHU,MDA,VICC,UHN}
Contributing Center
onal arguments:
, --help show this help message and exit
Examples:
on validateGENIE.py clinical data_clinical_supp_NKI.txt NKI
on validateGENIE.py clinical data_clinical_supp_patient_VICC.txt data_clinical_supp_sample_VICC.txt VICC
Go onto GENIE ec2
Run shell script - .sage_processing/processGENIE.sh
Currently processGENIE.sh is set up with default parameters and set to upload into the staging site. Make sure you modify these lines below can be modified in the shell script. (Working on being able to pass in arguments into the shell script)
on database_to_staging.py Jul-2017 ~/cbioportal/ 1.1.0 --consortiumReleaseCutOff 183
on consortium_to_public.py Jul-2017 ~/cbioportal/ 1.1.0 --publicReleaseCutOff 365
on ../sage_dashboard/dashboardTableUpdater.py 1.1.0
ipt ../sage_dashboard/public_dashboard.R 1.1.0
ipt ../sage_dashboard/clinicalImages.R
on database_to_staging.py Jan-2017 ~/cbioportal/ 1.1.0 --skipMutationsInCis
on consortium_to_public.py Jul-2017 ~/cbioportal/ 2.0.0
on database_to_staging.py Jul-2017 ~/cbioportal/ 2.1.0
on consortium_to_public.py Jan-2018 ~/cbioportal/ 3.0.0
on database_to_staging.py Jan-2018 ~/cbioportal/ 3.1.0
on consortium_to_public.py Jul-2018 ~/cbioportal/ 4.0.0
stop ecs
rm -rf /var/lib/ecs/data/ecs_agent_data.json
Please double check and triple check your bug, and if you're super certain that we're not infallible, then click here to report the bug, along with a clear explanation of how we can find or reproduce it.