bayeshack2016/cfsf-datasci_dot-hazmat

Name: cfsf-datasci_dot-hazmat

Owner: Bayes Hack 2016

Description: #DOT Data Science Working Group @ Code for SF (CFA) : App for DoT Prompt 1 (Hazmat Incidents)

Created: 2016-04-23 21:26:08.0

Updated: 2016-10-22 18:03:04.0

Pushed: 2016-10-25 05:55:43.0

Homepage: http://bayeshack2016.github.io/cfsf-datasci_dot-hazmat/website

Size: 1221

Language: JavaScript

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Data Science Working Group @ Code for SF (CFA) :

Applications and Model Building for DoT Prompt 1 (Hazmat Incidents)

Synopsis:

In order to address the Department of Transportation prompt, the team has developed two tools which assist in detecting and predicting traffic related incidents.

The first tool is a model to predict the number of incidents that involved hazardous materials for a month in a particular state. The model was developed taking into account various publicly available data sources such as records on land development applications, employment data in energy related sectors, oil prices, and more.

Data sources used:

A Random Forest model was built using features from employment data from BLS and Oil and gas statistics data to predict number of Hamzat incidences at the state level. Further details can be found in the markdown document which describes the model selecion and building process:

The model projection and actual data by state can be accessed and visualized through an interactive front-end tool.

The second tool monitors the number of fatalities as well as the number of reported incidences involving hazardous materials and flags breakouts in trends using an anomaly detection algorithm. This can be used in real time for early detection of accumulations of incidents and accidents to trigger a fast investigation on the underlying problem.

Outlook:

Both tools have proven basic capabilities and can be further enhanced through a more thorough development. For instance, predictive capability of the model can be improved through taking into account further detailed and more granular data sources as they become available. This project has identified features and areas that can be used to inform possible preventative measures that the Department of Transportation can take to prevent Hazmat related incidents.

Slides:

deck here

Contributors:

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.