Data Science for Social Good

Login: dssg

Company: null

Location: Chicago, IL

email: datascience@uchicago.edu

Blog: http://dssg.uchicago.edu

Members

  1. Adam Fishman
  2. Ahmad Qamar
  3. Allen Lin
  4. Andrea Fernández Conde
  5. Andrea Navarrete
  6. Andres
  7. Brandon T. Willard
  8. Christopher Brown
  9. Dean Malmgren
  10. Edward Su
  11. Evan Misshula
  12. Forest Gregg
  13. Gene Leynes
  14. Hunter Owens
  15. Isaac Hollander McCreery
  16. Joe Walsh
  17. Jordan T Bates
  18. Joshua Gary Mausolf
  19. Juan-Pablo Velez
  20. Kayla Jacobs
  21. Kris Sankaran
  22. Lisa Nash
  23. Manojit Nandi
  24. Matt Bauman
  25. Matt Gee
  26. Michael Castelle
  27. Michelangelo D'Agostino
  28. Miguel Perez
  29. Mike Stringer
  30. Miles Watkins
  31. Min Xu
  32. Nathan Leiby
  33. Nick Mader
  34. Paul Meinshausen
  35. Pedro Saleiro
  36. Rayid Ghani
  37. redshiftzero
  38. Roberto Sánchez Ávalos
  39. Sam Adhikari
  40. Sarah Adelaide
  41. Sumedh Joshi
  42. Tom Plagge
  43. Tom Schenk Jr
  44. Vidhur Vohra
  45. Walter Dempsey
  46. Zahra Ashktorab
  47. null
  48. null
  49. null
  50. null
  51. null
  52. null
  53. null
  54. null
  55. null
  56. null
  57. null
  58. null
  59. null
  60. null
  61. null

Repositories

411-on-311
Exploratory analysis and predictive models of how Chicago's neighborhoods interact with the City's 311 service requests.
acs2pgsql
Download American Community Survey data and put it into a Postgres database
ADB_tools
Tools for administering Salesforce Alumni Database. Written in Python.
aequitas
Bias and Fairness Audit Toolkit
aequitas-public
null
after-hours
null
architect
Plan, design and build train and test matrices
argcmdr
Thin argparse wrapper for quick, clear and easy declaration of hierarchical console command interfaces
audition
Choosing the best classifier models
babies-public
This is the publicly available version of the babies repo, containing code used during our project with the Illinois Department of Human Services to predict and reduce adverse births in Illinois.
benchmark
Repository for code that will be given to Benchmark Analytics
bikeshare
Statistical models and webapp for predicting when bikeshare stations will be empty or full.
catwalk
Training, testing, and evaluating machine learning classifier models
census-communities-usa
Mapping and analyzing local business data from the Census Bureau.
cincinnati
DSaPP project with the City of Cincinnati. Building upon the DSSG15 project
cincinnati2015-public
Predicting blight in Cincinnati
cincinnati_ems_public
null
ci-public
Public face of the Conservation International project
collate
Aggregation SQL Query Builder
cookiecutter
null
cta-otp
OpenTripPlanner tool and transit mobility maps for Chicago
cta-sim
Big data simulation of Chicago's public transportation to improve transit planning and reduce bus crowding
data-challenges
A repository of real-world data challenges faced by organizations used for project-based learning
data-portal-treemap
Chicago Data Portal (data.cityofchicago.org) tree map
data-science-101
Methods, tools, tips, and tricks for anyone interested in getting started doing data science for the social good.
data-sci-fellows
The sexy landing page that will make everyone want to apply for the fellowship.
deploybot
A series of Chef Recipes to deploy the DSSG stack.
dickens
Common Python descriptors
diogenes
Searching for an honest classifier
dirtyduck
Triage's guided tour
DSaPP_RA_Project
This repository includes an exercise for aspiring DSaPP volunteers and research assistants to complete
dssg2017-text_analysis
Text Analysis Tutorial for DSSG 2017 Conference
dssg-hospitalization
Patient Hospitalization Prediction Ptoject
dssg-manual
This repository contains the Eric & Wendy Schmidt Data Science for Social Good Fellowship Manual
dssg-public-hmda
null
dssg-training-workshop-2015
Main site for DSSG Training 2015
EDF
Analysis of energy efficiency loan data for the Environmental Defense Fund.
education-college-public
A 2015 DSSG project assisting school networks to increase the proportion of their alumni who graduate college
education-highschool-public
DSSG 2015 project focused on using data science methods to help partner public school districts improve their respective high school graduation rates and outcomes.
eights
Data Science template with focus on prewritten workflows
eis
null
energywise
An energy analytics tool to make commercial building more energy efficient
envfile
POSIX-compatible script like env to run a program in an environment modified by values set in a file
experiment-designer
Design triage experiments on the web
fellows-dict
A dictionary to describe DSSG's fellows and how awesome they are
givinggraph
An API tool to help understand the relationships between non-profits, for-profits, and the causes they support.
growth-curves
Statistical models of children's growth curves that predict which kids are at risk of obesity.
healthleads-public
The public repo for the 2014 DSSG Health Leads project
hitchhikers-guide
The Hitchhiker's Guide to Data Science for Social Good
hiv-retention-public
null
homelessness-public
null
hylas
Webapp for visualizing ML'd data
identifiability
null
il-dmr
Illinois EPA Discharge Monitoring Reports download and import
innovation-ecosystems
Understanding city innovation hotspots using the Census CitySDK
inspections
null
install-cli
Bash library for guided installation & bootstrapping
johnson-county-ddj-public
null
land-bank
Analytics tool to help the Cook County Land Bank acquire vacant and abandoned properties strategically.
learning
What fellows are learning about data problems and tools
LID-bills
This repository grabs state legislative documents from Legiscan for use in DSSG's Legislative Influence Detector
lorax
Speaks for the trees by providing individual feature importances from random forests.
machine_learning_legislation
Automatically identify earmarks in congressional spending bills
marketplace
null
match.edu
Predictive models to identify high-achieving high school students who are likely to undermatch - attend 2-year rather than 4-year colleges, or not go to college at all.
memphis-public
Public repository for the DSSG Memphis project
metta-data
Train Matrix and Test Matrix Storage
mexico-public
Public facing Mexico repository
milwaukee_public
milwaukee_public
MLforPublicPolicy
Class resources for CAPP 30254 (Machine Learning for Public Policy)
MS2Postgres
A tool to move data from SQL Server to PostgreSQL in an environment with limited harddrive space.
nfp
Impact evaluation of the Nurse-Family Partnership nonprofit
obscuritext
Transform text to be unreadable but still somewhat useful
openenergysaver
null
panopticon
The command center at the DSSG office. http://en.wikipedia.org/wiki/Panopticon
pgdedupe
A simple command line interface to the datamade/dedupe library.
philonous
A markup language for describing Machine Learning results and tools to use it
plenario
RESTful API for geospatial and time aggregation across multiple open datasets.
plenario-sf
Early fork of Plenar.io with custom functions written for SF and Chicago use cases.
police-eis
DSaPP police early intervention system: using machine learning to predict adverse incidents
policy_diffusion
Tracing policy ideas from think tanks and lobbyists through state legislative bills
predicting_student_enrollment_public
Statistical models and analysis of student enrollment in Chicago Public Schools
project_template
A template for a sample DSSG project.
publicsafety
Exploratory analysis and spatial correlation tests of whether jail inmate releases are associated with crime spikes in Chicago
Random_Forest_Imputer
Automatic missing value imputation using random forests
rcra
null
repo-scraper
Search for potential passwords/data leaks in a folder or git repo
results-schema
Store results of modeling runs in a relational database
sanergy-public
null
sedesol-public
null
signalled-timeout
Timeout library for generic interruption of main thread by an exception after a configurable duration.
sklearn_tutorial
Short tutorial on some pipeline issues
streetlights-crime
Statistical models to find whether Chicago street light outages are associated with increased crime
student-early-warning
Using machine learning to predict high school dropouts
stupid-csv-tricks
Code for doing slightly atypical things with CSVs
syracuse_public
null
timechop
generate time splits for temporal validation
triage
General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems
tweedr
A machine learning API to analyze tweets during disasters.
tyra
Prediction model evaluation
UPSG
A set of tools and conventions to help data scientists share code
ushine-learning
An API that uses machine learning to help the Ushahidi nonprofit do smarter crisis crowdsourcing.
weather2pgsql
Download NOAA weather for a user-specified US state
where-we-work
visualizing Chicago employment data
wikienergy
Git repo for Wiki Energy project

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.