TransparencyToolkit/EntityExtractor

Name: EntityExtractor

Owner: Transparency Toolkit

Description: Extracts entities and terms matching certain patterns.

Created: 2014-02-23 07:42:46.0

Updated: 2015-07-02 00:42:28.0

Pushed: 2015-04-20 22:32:55.0

Homepage: null

Size: 188

Language: Ruby

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

This is a tool for extracting things from text in JSONs.

Currently it supports:

It used to support (and will soon support again):

To use any methods:

TermExtractor.new(File.read("input/file.json"), ["fields", "to",
ract", "from"], "name_of_output_field")

Then, for set terms:

tractSetTerms(File.read("list/of/terms.json"), ["field", "with",
raction", "terms"], "if it is case sensitive")

Input JSONs are of the form:

m to map matches to": {
  "Search Terms": ["Array", "of", "terms"],
  "Case Sensitive?": "Yes"

Alternatively, a simple array of terms to look for also works.

The following options are available for case sensitivity:

Output Methods-

Code Climate


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.