The ContentMine

Login: ContentMine

Company: null

Location: UK

email: team@contentmine.org

Blog: http://contentmine.org

Members

  1. Emanuil Tolev
  2. Erin C. McKiernan
  3. Graham Steel
  4. Jenny Molloy
  5. Richard Smith-Unna
  6. Ross Mounce
  7. Stefan Kasberger
  8. null
  9. null
  10. null
  11. null

Repositories

2015-11-07-mozfest15
null
2015-11-15-OpenCon15
ContentMine event at OpenCon2015 Brussels
2015-12-10-lifesciences
Information for participants of the life sciences workshop.
2015-12-11-dnadigest
Information for participants of the DNAdigest hackday.
2016-03-10-hackathon
10-11th March 2016 ContentMine hackathon - mining for life sciences
admin
Catchup meetings and tasks
AdvisoryBoard
null
ami
null
amidemos
null
ami-deprecated
A generic tool for searching text and graphical objects, customisable by users.
api-demo
Containing code for use-cases and demonstrations of the ContentMine API.
assets
For storing assets like logos
CambridgeChemistryWorkshopSep2015
null
cam_daily
Cambridge daily runner
canary
Canary is a UI to the contentmine tools getpapers, quickscrape, norma, and ami.
canary-perch
ES Academic paper fact extraction - backend for canary
Chicago-20141114
ContentMine workshop in Chicago (US), November 14th 2014
cmapi
The ContentMine API
cmbot
An autonomous bot for scraping the academic literature
cm-crawlerd
ContentMine crawler daemon - this finds the latest articles in journals we mine, and stores them in our scraping queue
cm-pom
Parent POM for ContentMine Java/MVN stack
cm-scraperd
ContentMine scraper daemon - this collects metadata, and files for articles in the queue, and stores them in our fact-extraction queue
CMServices
Web services layer for ContentMine text and data mining tools and utilities
cm-ucl
A repository to openly track progress on table extraction.
cm-uclii
Data and progress tracking for table extraction and semantically guided content enhancement
contentmine-app
The ContentMine ecosystem as a standalone app for OSX, Windows and Linux.
contentmine.github.io
ContentMine installation instructions website
contentmine-gui
GUI for executing ContentMine commands - browser SPA for running locally on user's machine.
contentmine.org
The static site
contentmine_virt
Virtual environment for running all of contentmine's software easily.
ContentMineWeb
Repository for tracking bugs in the new CM website
cproject
ArgProcessor and files for basic CMDirectories. Often subclassed. Needs to be separate from euclid and norma
crossref
repo for scraping metadata from Crossref.
CTree
null
datastreams
Output datastream plugins for ContentMine
diagramanalyzer
Library to build diagram primitives and diagrams from vectors (and possibly pixels)
dictionaries
Dictionaries for use with `ami` , including some management software
ebi_workshop_20141006
ContentMine workshop at EBI, October 6th 2014
EBI_workshop_20150330
null
elpub17-workshop
ContentMining Workshp for ELPUB 17 Conference at Limassol, Cyprus
euclid
ContentMine Fork of the WWMM Euclid Package
executables
repo for executables (so as not to bloat projects)
force2015_workshop
Materials for the Force2015 ContentMine Workshop 3pm-6pm Sunday 11th January
FutureTDM
Materials of FutureTDM project
getpapers
Get metadata, fulltexts or fulltext URLs of papers matching a search query
grobid
tools and codes to run grobid
gui
Client-side GUI for managing commandline submission
html
ContentMine Fork of the WWMM html Package
ijsem
Computational results of PLUTo ami-phylo analysis of trees from Int. J. Syst. Evol. Microbiol.
imageanalysis
ContentMine Fork of the WWMM imageanalysis Package
JISC-Workshop-1Dec2014
Workshop resources for one day workshop at JISC on 1 Dec 2014
journal-scrapers
Journal scraper definitions for the ContentMine framework
karoo
CLI tool for canary
mailmap
null
meta
A repository in which to file and fix meta issues (issues affecting more than one ContentMine repo or project)
neuro
Neurophysiology, especially voltage traces
nhtml
NHTML is a normalization of scholarly documents from {PDF, HTML, XML, SVG, PNG} into a single semantic format
node-journalTOCs
Node.js client for the JournalTOCs API
norma
Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML
old_site
The contentmine site, which (currently) includes the API
openArticles
A collection of OpenAccess (CC-BY) and Public Domain articles for use in workshops and development
package-managers
Collection of scripts / config files for building packages of the ContentMine software
pdf2svg
ContentMine Fork of the WWMM pdf2svg Package
phylotree
A repository for ami-phylotree development
pyCMine
Python scripts for downstream analyses of content mine extracted facts, mostly comming from pyCProject
pyCProject
Provides basic function to read a ContentMine CProject and CTrees into python datastructures.
quickscrape
A scraping command line tool for the modern web
releases
Release packages for ContentMine projects
sage_contentmine.org
sage wordpress theme for contentmine.org
SciDataCon2014
Workshop resources for half-day workshop at SciDataCon New Delhi on 2 Nov 2014
scraperJSON
The scraperJSON standard for defining web scrapers as JSON objects
scripts
Shell and Python scripts for utility activities
selectorgadget
Go go CSS / DOM inspection.
svg
ContentMine Fork of the WWMM svg Package
svg2xml
ContentMine Fork of the WWMM svg2xml Package
svghtml
Combined SVG and HTML repos and building functionality
thresher
Headless scraperJSON scraping for Node.js
tilburg
Extraction of data from Vector-based Funnel Plots in the scholarly literature
visualizations
null
vms
ContentMine virtual machines
vt-open-data-week
Virginia Tech workshop
wellcome-2015-files
VM contents for the Wellcome Trust 2015 Workshop
WellcomeTrust-April2015
Workshop resources for two day workshop at the Wellcome Trust 13-14 April 2015
wikifactmine-api
The WikifactMine API Endpoint
wlic-2017
World Library and Information Congress 2017 ContentMine Workshop
workshop-resources
This repository contains material helping you to set up a ContentMine workshop. It also includes tutorials for learning the ContentMine tools on your own.
workshops
General materials for workshops
zen-upload
NodeJS client for uploading to Zenodo

This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.