bio-tools/Tool-Information-Standard

Name: Tool-Information-Standard

Owner: bio.tools

Description: An information standard for the description of bioinformatics software tools.

Created: 2018-05-15 10:27:27.0

Updated: 2018-05-15 13:23:10.0

Pushed: 2018-05-15 13:23:08.0

Homepage: null

Size: 1224

Language: null

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Tool Information Standard

An information standard for the description of bioinformatics software tools.

The standard described below represents a “respectable beta” and will be implemented in https://bio.tools. Please join the discussion on GitHub <https://github.com/bio-tools/biotoolsSchema/issues/77>_

biotoolsSchema <https://github.com/bio-tools/biotoolsSchema/>_ provides the foundation for an information standard for the description of tools that is being adopted by https://bio.tools. The standard has two components:

Adherence to some of guidelines is verified by automated QC checks within bio.tools, whereas others must be verified by manual inspection.

The standard provides a basis for monitoring of content and labelling of bio.tools entries in various ways:

If and how to use labels is currently under discussion <https://github.com/bio-tools/biotoolsSchema/issues/77>_.

The standard will be applied to bio.tool as follows:

.. image:: images/information_standard.png

Why is a standard needed at all?

There are several reasons why a community-defined standard for tool information is helpful:

Description of attributes

Attributes defined as mandatory in the latest stable schema (biotools-2.0.0 <https://github.com/bio-tools/biotoolsSchema/tree/master/versions/biotools-2.0.0>_) are those in the “OKAY” tier plus “Scientific operation” from the “GOOD” tier.

General attributes ^^^^^^^^^^^^^^^^^^ .. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

Name“, “Canonical software name assigned by the software developer or service provider”, “Text”, “<name>” “Description“, “Short and concise textual description of the software function”, “Text”, “<description>” “Homepage“, “Homepage of the software, or some URL that best serves this purpose”, “URL”, “<homepage>” “Unique ID“, “Unique ID of the tool that is assigned upon registration of the software in bio.tools”, “Text (URL-safe version of tool name)“, “<toolID>” “Tool type“, “The type of application software: a discrete software entity can have more than one type.“, “enum (from biotoolsSchema, see below)“, “<toolType>” “Scientific topics“, “General scientific domain the software serves or other general category, e.g. 'Proteomics'“, “Term and / or URI of EDAM Topic <http://edamontology.org/topic_0004> concept(s) (1)“, “<topic>” “Publications“, “Publications about the software”, “DOI, PMID or PMCID”, “<publication>” “Scientific operations“, “The basic operation(s) performed by the software, e.g. 'Multiple sequence alignment'“, “Term and / or URI of EDAM Operation <http://edamontology.org/operation_0004> concept(s)“, “<function><operation>” “Operating system“, “The operating system supported by a downloadable software package.“, “enum (from biotoolsSchema)“, “<labels><OperatingSystem>” “Language“, “Name of programming language the software source code was written in, e.g. 'C'.“, “<language>” “License“, “Software or data usage license”, “enum (from biotoolsSchema)“, “<labels><license>” “Type of input & output data“, “Type of primary input / output data (if any), e.g. 'Protein sequences'“, “Term and / or URI of EDAM Data <http://edamontology.org/data_0006> concept(s)“, “<function><input>/<output><data>” “Supported data formats“, “Allowed format(s) of primary inputs/outputs, e.g. 'FASTA'“, “Term and / or URI of EDAM Format <http://edamontology.org/format_1915> concept(s)“, “<function><input>/<output><format>

"**Scientific benchmark**", "Scientific benchmarking results.", "Link", "``<link><type>Scientific benchmark</type>``"
"**Technical monitoring**", "Technical monitoring results.", "Link", "``<link><type>Technical monitoring</type>``"

“Documentation” group ^^^^^^^^^^^^^^^^^^^^^ .. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

General documentation“, “General documentation”, “URL”, “<documentation><type>General</type>” “Manual“, “Information on how to use the software.“, “URL”, “<documentation><type>Manual</type>” “API documentation“, “Human-readable API documentation.“, “URL”, “<documentation><type>API documentation</type>” “API specification“, “File providing an API specification for the software, e.g. Swagger/OpenAPI, WSDL or RAML file.“, “URL”, “<download><type>API specification</type>

“Code availability” group ^^^^^^^^^^^^^^^^^^^^^^^^^
.. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

Repository“, “Link to repository where source code, data and other files may be downloaded”, “URL”, “<link><type>Repository</type>” “Source code“, “Software source code.“, “URL”, “<download><type>Source code</type>” “Source package“, “Source package (of various types) for the software.“, “URL”, “<download><type>Source package</type>

“Accessibility” group ^^^^^^^^^^^^^^^^^^^^^ .. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

Terms of use“, “Rules that one must agree to abide by in order to use a service.“, “URL”, “<link><type>Terms of use</type>” “Accessibility“, “Whether the software is freely available for use.“, “enum (from biotoolsSchema)“, “<labels><Accessibility>” “Cost“, “Monetary cost of acquiring the software.“, “enum (from biotoolsSchema)“, “<labels><Cost>

“Support” group ^^^^^^^^^^^^^^^^^ .. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

Helpdesk“, “Helpdesk providing support in using the software.“, “URL”, “<link><type>Helpdesk</type>” “Issue tracker“, “Link to tracker for software issues, bug reports, feature requests etc.“, “URL”, “<link><type>Issue tracker</type>” “Mailing list“, “Link to mailing list for software announcements, discussions, support etc.“, “URL”, “<link><type>Mailing list</type>” “Contact person“, “Primary contact, e.g. a person, helpdesk or mailing list”, “Name, email, URL and/or ORCID iD”, “<credit><typeRole>Primary contact</typeRole>

“Downloads” group ^^^^^^^^^^^^^^^^^ .. csv-table:: :header: “Attribute”, “Description”, “Format”, “element” :widths: 15, 75, 10, 10

Biological data“, “Biological data, or a web page on a database portal where such data may be downloaded.“, “URL”, “<download><type>Biological data</type>” “Binaries“, “Binaries for the software.“, “URL”, “<download><type>Binaries</type>” “Binary package“, “Binary package for the software.“, “URL”, “<download><type>Binary package</type>” “Container file“, “Container file including the software.“, “URL”, “<download><type>Container file</type>” “CWL file“, “Common Workflow Language (CWL) file for the software.“, “URL”, “<download><type>CWL file</type>” “Ontology“, “A file containing an ontology, controlled vocabulary, terminology etc.“, “URL”, “<download><type>Ontology</type>” “VM image“, “Virtual machine (VM) image for the software.“, “URL”, “<download><type>VM image</type>” “Tool wrapper (galaxy)“, “Galaxy tool configuration file (wrapper) for the software.“, “URL”, “<download><type>Tool wrapper (galaxy)</type>” “Tool wrapper (taverna)“, “Taverna configuration file for the software.“, “URL”, “<download><type>Tool wrapper (taverna)</type>” “Tool wrapper (other)“, “Workbench configuration file (other than taverna, galaxy or CWL wrapper) for the software.“, “URL”, “<download><type>Tool wrapper (other)</type>

.. note:: (1) EDAM is a simple ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM provides a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use. You can find EDAM on GitHub <https://github.com/edamontology/edamontology>_.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.