bomboradata/pubsub-to-bigquery

Name: pubsub-to-bigquery

Owner: Bombora

Description: A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub

Created: 2017-08-07 22:15:46.0

Updated: 2018-05-13 21:57:11.0

Pushed: 2018-04-23 17:34:55.0

Homepage:

Size: 17

Language: Java

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

PubSubToBigQuery

A highly configurable Google Cloud Dataflow pipeline that writes data into a Google Big Query table from Pub/Sub

Compile this Dataflow pipeline into a runnable Jar (pubsub-to-bq.jar). Then call the jar with following parameters:

.exe -jar "C:\Jars\pubsub-to-bq.jar" --runner=BlockingDataflowPipelineRunner --params="<params><workingBucket>gs://your_bucket</workingBucket><maxNumWorkers>1</maxNumWorkers><diskSizeGb>250</diskSizeGb><machineType>n1-standard-1</machineType><keyFile>C:\KeyFiles\YourFile.json</keyFile><accountEmail>your_account@developer.gserviceaccount.com</accountEmail><projectId>your_project_id</projectId><pipelineName>your_pipeline_name</pipelineName><pubSubTopic>your_pub_topic</pubSubTopic><bqDataSet>your_destination_BQ_dataset</bqDataSet><bqTable>your_destination_BQ_table</bqTable><streaming>true</streaming><zone>us-west1-a</zone><schema>{"fields":[{"description":null,"fields":null,"mode":"REQUIRED","name":"Student_Name","type":"STRING","ETag":null}],"ETag":null}</schema></params>"

Parameters formatted view:

ams>
workingBucket>gs://your_bucket</workingBucket>
maxNumWorkers>1</maxNumWorkers>
diskSizeGb>250</diskSizeGb>
machineType>n1-standard-1</machineType>
keyFile>C:\KeyFiles\YourFile.json</keyFile>
accountEmail>your_account@developer.gserviceaccount.com</accountEmail>
projectId>your_project_id</projectId>
pipelineName>your_pipeline_name</pipelineName>
pubSubTopic>your_pub_topic</pubSubTopic>
bqDataSet>your_destination_BQ_dataset</bqDataSet>
bqTable>your_destination_BQ_table</bqTable>
streaming>true</streaming>
zone>us-west1-a</zone>
schema>{"fields":[{"description":null,"fields":null,"mode":"REQUIRED","name":"Student_Name","type":"STRING","ETag":null}],"ETag":null}</schema>
rams>

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.