hammerlab/tcga-blca

Name: tcga-blca

Owner: Hammer Lab

Description: Example analysis using Cohorts & TCGA-BLCA data

Created: 2016-09-13 04:15:14.0

Updated: 2016-09-27 20:46:38.0

Pushed: 2016-09-27 20:41:10.0

Homepage:

Size: 763

Language: Jupyter Notebook

GitHub Committers

UserMost Recent Commit# Commits
Jacki Buros Novik2016-09-27 20:41:05.082

Other Committers

UserEmailMost Recent Commit# Commits

README

tcga-blca

Example using Cohorts to manage TCGA-BLCA for analysis

  1. Query GDC for clinical and sample datasets for TCGA-BLCA data (query code to be merged into pygdc)
  2. Set up a Cohort using Cohorts to manage these data
  3. Mock-analysis of said Cohort to show functionality of Cohorts.

Setup

There are a few steps you will have to follow before using this code.

Once you have these items set up, you can run one or both of the refresh_*.py scripts to fetch data from the GDC portal.

Then, you can try out the various *.ipynbs in the repo for yourself, or use them as a starting point for further analysis.

query_tcga

The refresh_*.py scripts make use of the query_tcga package. This cannot currently be installed via pip.

Instead, you will want to install as follows:

pip install git+git://github.com/jburos/query_tcga

This code will eventually be merged into the cleaner pygdc package. For now, the merge of these codebases is a WIP.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.