broadinstitute/htslib

Name: htslib

Owner: Broad Institute

Description: C library for high-throughput sequencing data formats. This is the repository for any necessary gamgee-specific changes.

Created: 2014-03-16 17:29:08.0

Updated: 2014-09-03 14:27:23.0

Pushed: 2014-11-10 16:05:37.0

Homepage:

Size: 5537

Language: C

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

HTSlib is an experimental implementation of a unified C library for accessing common file formats, such as SAM and VCF, used for high-throughput sequencing data. The library is contained in the htslib subdirectory. Each command-line tool is implemented in one C source code file in the toplevel directory. HTSlib only depends on zlib. It is known to be compatible with gcc, g++ and clang.

HTSlib implements a generalized BAM index. The command-line tools bamidx and tabix write the old index by default, but can also write a new index with file extension .csi (coordinate-sorted index). The HTSlib file reader first looks for the new index and then for the old if the new index is absent.

HTSlib is unfinished. It has not been tested on large-scale real data. Some useful APIs are missing.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.