sul-dlss/discovery-indexing

Name: discovery-indexing

Owner: Stanford University Digital Library

Description: Repo for central stories related to a number of different code bases associated with automated indexing along with the auditing script for monitoring automated indexing

Created: 2016-07-12 21:09:45.0

Updated: 2016-10-31 14:04:03.0

Pushed: 2017-01-20 19:41:49.0

Homepage:

Size: 2330

Language: Ruby

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Discovery Indexing Audit Script

How to run the script:
Minimum required inputs:
  1. ARGO_URL_DEFAULT - URL to Argo Solr index, eg. http://solr.stanford.edu/solr/prod
  2. SW_URL_DEFAULT - URL to Searchworks Solr index, eg. http://solr.stanford.edu/solr/sw_prod
  3. PF_URL_DEFAULT - URL to purl-fetcher deployment, eg. http://purl-fetcher.stanford.edu
  4. SW_TGT_DEFAULT - Searchworks target, eg. searchworks-stage

Note: All URLs should not include a terminating slash

Optional inputs:
  1. RPT_TYPE_DEFAULT - Which report to run - if none provided, Collections Summary is run as the default
  2. Everything Released Summary - Summary of all collections and items released to the specified Searchworks target
  3. Collections Summary - Summary of all collections released to the specified Searchworks target
  4. Collection-specific Summary - Summary of a specific collection released to the specified Searchworks target
  5. Individual Items Summary - Not implemented yet, but will be a summary of all items released to Searchworks target not as part of a collection
  6. COLL_DRUID_DEFAULT - Collection druid without the druid prefix - required for running reports on a specific collection, e.g. aa111bb2222

In order to run the audit manually, go onto the server to the top-level directory for discovery-indexing and run ./bin/perform_audit. If you get a permissions denied message, just chmod a+x bin/perform_audit

So, for a collection-specific summary report, at the prompts, you need to use the following for the inputs: Report Type: “Collection-specific Summary” Collection Druid: Value of the specific collection druid


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.