NCATS-Tangerine/pinto

Name: pinto

Owner: NCATS Data Translator Project - Tangerine Team

Description: Wikidata Semantic Query API

Forked from: SuLab/garbanzo

Created: 2017-05-11 12:31:55.0

Updated: 2017-05-11 12:32:20.0

Pushed: 2017-05-11 14:50:45.0

Homepage: null

Size: 41

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

pinto

Context

Pinto is an NCATS Translator Knowledge Beacon.

It broadcasts the availability of concepts to the federated Translator knowledgebase.

It wraps a chem2bio2rdf data service.

The underlying service is the RENCI Stars Blazegraph instance.

Pinto Challenges
  1. Pinto is a fork of Garbanzo, a knowledge beacon wrapping Wikidata. While Wikidata has significant ontological structure, especially around the notion of concepts, chem2bio2rdf is an earlier linked data effort and has very little structure representing conceptual abstractions. So for the beacon to tell a client what concepts it is able to serve, it must query a number of different structures, and those structures must be addressed independently in the beacon implementation.

  2. Similarly related to this low level of structure, the data is highly denormalized. So, to know where the string “Aspirin” occurs, there's not one rdfs:label that's referenced from lots of places. Neither are all strings predicates of an rdfs:label. Many are targets of an arbitrary identifier. Again, to answer where that string occurs requires manually addressing a number of locations in the data selected by a relatively manual, curatorial process.

Usage
install -r requirements.txt
on app.py

Then go to the Swagger endpoint.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.