openelections/tracker

Name: tracker

Owner: The OpenElections Project

Description: Progress tracker for OpenElections work

Created: 2017-07-31 00:29:52.0

Updated: 2018-04-07 09:25:42.0

Pushed: 2018-04-07 09:42:13.0

Homepage: null

Size: 82

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Tracker

A command-line tool to check progress of data collection by The OpenElections Project.

The openelex-tracker command generates various files and reports.

The goal is to measure our progress toward generating four file types in each state:

Other elections, including office-specific primaries, may also be included where available.

See the Technical Background section below for more details.

Metrics on Github activity, repo creation, etc. are also fair game…

Install
clone git@github.com:openelections/tracker.git
racker/
 install

This step is only necessary in order to publish world-readable reports to the openelections-tracker S3 bucket.

.aws/credentials
nelex]
access_key_id = <ACCESS_KEY_ID>
secret_access_key = <SECRET_ACCESS_KEY>
Use
st available actions
elex-tracker --help

nerate a CSV of repos
elex-tracker --repos-report
elex-tracker --issues-report

nerate a CSV of repos and publish a world-readable file to S3
elex-tracker --repos-report --publish
elex-tracker --issues-report --publish
Contribute

To contribute code:

Setup
path/to/tracker
 "PYTHONPATH=$(pwd)" > .env
install pipenv
Run tests
 test
Technical Background

OpenElections has dozens of data-oriented Github repositories containing minimally processed data from source agencies. In some cases, these data files may be hand-keyed.

This “raw” data is stored in its pre-processed form on Github in repositories that follow the below naming conventions:

openelections-[source|data]-[state-postal]

For example:

The -sources- repos contain data files in their most raw form, such as unconverted image pdfs.

The -data- repos contain data in a minimally processed form, such as data files which have been extracted from PDFs but reflect the same field names, data values etc. as the original PDFs.

The files in -data- repos serve as the raw input for the OpenEletions data standardiztion pipeline.

The -sources- and -data- repos organize election result files into year-based directories. These files follow a naming convention based on OpenCivic identifiers.

TK - examples of file names…


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.