Chicago/open-data-etl-utility-kit

Name: open-data-etl-utility-kit

Owner: City of Chicago

Description: Use Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en/stable

Created: 2014-09-12 02:28:22.0

Updated: 2018-01-05 19:44:39.0

Pushed: 2017-01-30 03:51:21.0

Homepage:

Size: 17520

Language: Shell

GitHub Committers

UserMost Recent Commit# Commits
Ben Welsh2014-10-16 22:21:13.01
Forest Gregg2014-09-25 20:08:01.02
Tim Wisniewski2015-10-30 13:07:32.01
Josh Kalov2017-01-12 01:32:38.05
Tom Schenk Jr2017-01-30 03:51:20.064
Jonathan Levy2016-10-06 15:36:00.025
Jef Waltman2015-12-08 19:24:47.011

Other Committers

UserEmailMost Recent Commit# Commits

README

ETL Utilities for an Open Data Program

This toolkit provides several utilities and framework to help governments deploy automated ETLs using the open-source Pentaho data integration (Kettle) software.

Namely, this toolkit will allow:

The ETL framework is organized so each function can be modified in one file that is used by all ETLs. This provides for easier maintenance, upgrading, and modification over hundreds of ETLs.

Features
Requirements

The requirements for the recommended configuration require the following pieces of software:

Kettle Compatibility

This framework has only been tested using Kettle 4.3.0 and Kettle 4.4.0. It is possible that this framework is fully compatible with Kettle 5.x, but has not been tested. If you would like to contribute, please see the issue page.

Errors / Bugs

Experiencing issues with the included files? Report it on our issue tracker


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.