Name: EntityExtractor
Owner: Transparency Toolkit
Description: Extracts entities and terms matching certain patterns.
Created: 2014-02-23 07:42:46.0
Updated: 2015-07-02 00:42:28.0
Pushed: 2015-04-20 22:32:55.0
Homepage: null
Size: 188
Language: Ruby
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
This is a tool for extracting things from text in JSONs.
Currently it supports:
It used to support (and will soon support again):
To use any methods:
TermExtractor.new(File.read("input/file.json"), ["fields", "to",
ract", "from"], "name_of_output_field")
Then, for set terms:
tractSetTerms(File.read("list/of/terms.json"), ["field", "with",
raction", "terms"], "if it is case sensitive")
Input JSONs are of the form:
m to map matches to": {
"Search Terms": ["Array", "of", "terms"],
"Case Sensitive?": "Yes"
Alternatively, a simple array of terms to look for also works.
The following options are available for case sensitivity:
Output Methods-