UTHSCSA-CIRD/addressfix

Name: addressfix

Owner: Clinical Informatics Research Division

Description: Wrapper for a couple of address parsing/standardizing libraries for the purpose of cleaning up addresses in CSV files.

Created: 2017-08-12 19:30:42.0

Updated: 2017-10-17 04:53:37.0

Pushed: 2017-09-02 03:48:08.0

Homepage: null

Size: 37

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

addressfix

Wrapper for a couple of address parsing/standardizing libraries for the purpose of cleaning up addresses in CSV files.

Instructions:

Setup:

pip install -r requirements.txt

Run with default settings (won't work right for this input file):

python addressfix.py -f test_address_input.csv

Run keeping columns 0,2,3,4 in the output and using column 4 as the address:

python addressfix.py -a 4 -k 0,2,3,4 -f test_address_input.csv

Get help

python addressfix.py -h

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.