CD2H gitForager

ropensci/pubchunks

Name: pubchunks

Owner: rOpenSci

Description: Get chunks of XML format scholarly articles

Created: 2018-01-12 01:35:35.0

Updated: 2018-01-15 17:45:54.0

Pushed: 2018-01-12 01:35:53.0

Homepage:

Size: 3574

Language: R

GitHub Committers

User	Most Recent Commit	# Commits

Other Committers

User	Email	Most Recent Commit	# Commits

README

pubchunks

Get chunks of XML articles

… still working through whether this will be a package or not and how it would work …

Installation

Development version from GitHub

t latest fulltext
ools::install_github("ropensci/fulltext")
ools::install_github("ropensci/pubchunks")

Load library

ary('pubchunks')

First, get some full text

Using fulltext

ary(fulltext)
 ft_search(query = 'ecology', from = 'crossref')

Then extract chunks

 ft_get(c('10.7554/eLife.03032', '10.7554/eLife.32763'), from = "elife")
% ft_collect() %>% ft_chunks("publisher") %>% ft_tabularize()

Get multiple fields at once

% ft_collect() %>% ft_chunks(c("doi","publisher")) %>% ft_tabularize()

Use dplyr to data munge

ary("dplyr")
%
_collect() %>% 
_chunks(c("doi", "publisher", "permissions")) %>%
_tabularize() %>%
elife %>%
lect(-permissions.license, -permissions.license_url)

Contributors

Scott Chamberlain http://github.com/sckott