GoogleCloudPlatform/Data-Pipeline

Name: Data-Pipeline

Owner: Google Cloud Platform

Description: Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.

Created: 2013-11-21 18:51:11.0

Updated: 2018-05-17 08:23:57.0

Pushed: 2014-02-11 18:32:58.0

Homepage: null

Size: 1335

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.