GoogleCloudPlatform/DataflowPythonSDK

Name: DataflowPythonSDK

Owner: Google Cloud Platform

Description: Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Created: 2016-02-19 23:24:03.0

Updated: 2018-05-23 15:45:28.0

Pushed: 2017-05-31 17:16:50.0

Homepage: http://cloud.google.com/dataflow

Size: 898

Language: null

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Google Cloud Dataflow SDK for Python

Google Cloud Dataflow SDK for Python is based on Apache Beam and targeted for executing Python pipelines on Google Cloud Dataflow.

Getting Started
We moved to Apache Beam!

Google Cloud Dataflow for Python is now Apache Beam Python SDK and the code development moved to the Apache Beam repo.

If you want to contribute to the project (please do!) use this Apache Beam contributor's guide

Contact Us

We welcome all usage-related questions on Stack Overflow tagged with google-cloud-dataflow.

Please use the issue tracker on Apache JIRA (sdk-py component) to report any bugs, comments or questions regarding SDK development.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.