FASTGenomics/FASTGenomics_Data_Package_Format

Name: FASTGenomics_Data_Package_Format

Owner: FASTGenomics

Description: Description of the FASTGenomics Data Package Format

Created: 2017-10-19 13:02:32.0

Updated: 2017-10-24 13:36:31.0

Pushed: 2017-12-22 10:09:42.0

Homepage:

Size: 12

Language: null

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

The FASTGenomics Data Package Format

Introduction

Single-cell RNA-seq datasets typically consist of several data data tables that are all required for the understanding of an experiment. The FASTGenomics ecosystem for single-cell RNA-seq analyses provides functionality to make these analyses as easy and convenient as possible. To enable the data analysis with FASTGenomics, the dataset must be provided in a defined format which is detailed below. Briefly, a dataset consists of files containing expression data, metadata describing cells and genes as well as the experimental conditions. To reduce disk space, all these files are bundled into one ZIP file. Apart from the package description below, the example folder in this repository contains correctly structured, however not zipped, files to illustrate how a FASTGenomics Data Package should look like. Furthermore, the following link leads you to an R-based step by step tutorial how to create a FASTGenomics Data Package.

Components of a FASTGenomics Data Package

The following table gives an overview of files that have to be included in a FASTGenomics dataset package:

Data Package Description

The data package description is supplied via the manifest.yml file, which has the following structure: