verilylifesciences/variant-annotation

Name: variant-annotation

Owner: Verily Life Sciences

Description: Use cloud technology to annotate human sequence variants in parallel.

Created: 2017-08-24 19:41:29.0

Updated: 2017-08-28 17:17:31.0

Pushed: 2017-09-05 23:50:12.0

Homepage:

Size: 48

Language: Jupyter Notebook

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Disclaimer

This is not an official Verily product.

variant-annotation

This repository contains code to annotate human sequence variants using cloud technology to perform analyses in parallel.

Sub-projects:

The code in this repository is designed for use with genomic variants stored in Google BigQuery in a particular variant table format.

Processing uses Google Container Builder, Docker, and dsub for batch processing. We suggest working through the introductory materials for each tool before working with the code in this repository.

For interactive annotation, parallelism is accomplished due to the use of BigQuery. For batch annotation, parallelism is accomplished due to the use of dsub to run annotation in parallel on small shards of the input file(s).


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.