ropensci/pubchunks

Name: pubchunks

Owner: rOpenSci

Description: Get chunks of XML format scholarly articles

Created: 2018-01-12 01:35:35.0

Updated: 2018-01-15 17:45:54.0

Pushed: 2018-01-12 01:35:53.0

Homepage:

Size: 3574

Language: R

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

pubchunks

Project Status: Concept ? Minimal or no implementation has been done yet, or the repository is only intended to be a limited example, demo, or proof-of-concept. Build Status

Get chunks of XML articles

… still working through whether this will be a package or not and how it would work …

Installation

Development version from GitHub

t latest fulltext
ools::install_github("ropensci/fulltext")
ools::install_github("ropensci/pubchunks")

Load library

ary('pubchunks')
First, get some full text

Using fulltext

ary(fulltext)
 ft_search(query = 'ecology', from = 'crossref')

Then extract chunks

 ft_get(c('10.7554/eLife.03032', '10.7554/eLife.32763'), from = "elife")
% ft_collect() %>% ft_chunks("publisher") %>% ft_tabularize()

Get multiple fields at once

% ft_collect() %>% ft_chunks(c("doi","publisher")) %>% ft_tabularize()

Use dplyr to data munge

ary("dplyr")
%
_collect() %>% 
_chunks(c("doi", "publisher", "permissions")) %>%
_tabularize() %>%
elife %>%
lect(-permissions.license, -permissions.license_url)
Contributors
Meta

rofooter


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.