ropensci/user2016-tutorial

Name: user2016-tutorial

Owner: rOpenSci

Description: null

Created: 2016-04-13 02:31:58.0

Updated: 2017-11-11 08:34:07.0

Pushed: 2016-12-22 23:16:03.0

Homepage: null

Size: 51293

Language: R

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Extracting data from the web APIs and beyond

Instructors: Karthik Ram, Garrett Grolemund and Scott Chamberlain

No matter what your domain of interest or expertise, the internet is a treasure trove of useful data that comes in many shapes, forms, and sizes, from beautifully documented fast APIs to data that need to be scraped from deep inside of 1990s html pages. In this 3 hour tutorial you will learn how to programmatically read in various types of web data from experts in the field (Founders of the rOpenSci project and the training lead of RStudio). By the end of the tutorial you will have a basic idea of how to wrap an R package around a standard API, extract common non-standard data formats, and scrape data into tidy data frames from web pages.

Background Knowledge
Familiarity with base R and ability to write functions.

Requirements

R with latest versions of httr, rvest, and curl. It would also be helpful to have a recent release of R and RStudio

Target Audience

Any R user with an interest in retrieving data from the web.

Website for materials: All slides from the workshop are available as a PDF in this repo.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.