Name: open-data-etl-utility-kit
Owner: City of Chicago
Description: Use Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en/stable
Created: 2014-09-12 02:28:22.0
Updated: 2018-01-05 19:44:39.0
Pushed: 2017-01-30 03:51:21.0
Size: 17520
Language: Shell
GitHub Committers
User | Most Recent Commit | # Commits |
---|---|---|
Ben Welsh | 2014-10-16 22:21:13.0 | 1 |
Forest Gregg | 2014-09-25 20:08:01.0 | 2 |
Tim Wisniewski | 2015-10-30 13:07:32.0 | 1 |
Josh Kalov | 2017-01-12 01:32:38.0 | 5 |
Tom Schenk Jr | 2017-01-30 03:51:20.0 | 64 |
Jonathan Levy | 2016-10-06 15:36:00.0 | 25 |
Jef Waltman | 2015-12-08 19:24:47.0 | 11 |
Other Committers
User | Most Recent Commit | # Commits |
---|
This toolkit provides several utilities and framework to help governments deploy automated ETLs using the open-source Pentaho data integration (Kettle) software.
Namely, this toolkit will allow:
The ETL framework is organized so each function can be modified in one file that is used by all ETLs. This provides for easier maintenance, upgrading, and modification over hundreds of ETLs.
The requirements for the recommended configuration require the following pieces of software:
This framework has only been tested using Kettle 4.3.0 and Kettle 4.4.0. It is possible that this framework is fully compatible with Kettle 5.x, but has not been tested. If you would like to contribute, please see the issue page.
Experiencing issues with the included files? Report it on our issue tracker