Name: elpc_bakken
Owner: datamade
Description: Bakken well files PDF extraction
Created: 2015-06-24 15:50:48.0
Updated: 2017-01-05 16:59:42.0
Pushed: 2015-08-11 17:27:37.0
Homepage: null
Size: 1788913
Language: HTML
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
This Makefile extracts text from PDFs, OCR images in PDFS, and extracts data.
do apt-get install tesseract-ocr ocrfeeder poppler-utils
pdf
directorymake
To parallelize task use the -j command make -j 8
will use 8 processes.