uwescience/ds4ad

Name: ds4ad

Owner: UW eScience Institute

Description: Data Science for Administrative Data

Created: 2018-04-06 16:00:04.0

Updated: 2018-04-26 09:11:34.0

Pushed: 2018-04-26 09:11:32.0

Homepage: https://uwescience.github.io/ds4ad/

Size: 1553

Language: HTML

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Data science for administrative datasets

This course is somewhat based on parts of the Software Carpentry curriculum.

Setup:
Day 1: Foundations

9 - 10: Introduction (Ariel)

10 - 12: Foundations of programming in Python (Jose)

Noon - 1 PM lunch

1 - 4 : git and GitHub (Bryna)

Homework day 1:

Set up your own GitHub project

Code Challenges

print(outer('helium'))

print(fence('name', '*'))
e*

Bring a use-case

Tomorrow, tell us about a data use-case that you have in mind for your work:

  1. What is the data?
  2. What are some questions you would like to answer with these data?
  3. How is the data currently stored?
Day 2: Data munging

9 - noon : Introducing Pandas (Bryna)

Noon - 1 PM: lunch

1 - 4 PM: Manipulating data with Pandas (Ariel)

Homework day 1:

Set up your own GitHub project

Day 3: Data analysis and data visualization

9 - noon : Computations and statistics with Pandas DataFrames (Ariel)

Noon - 1 PM: lunch

Afternoon:

1 PM - 2:30 Visualizing data with Matplotlib (Jose)

2:30 - 4 Next steps – where do we go from here? (Bryna + Jose (+ Ariel))


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.