wtsi-hgi/tarchecksum

Name: tarchecksum

Owner: Wellcome Trust Sanger Institute - Human Genetics Informatics

Description: null

Created: 2014-11-12 10:20:03.0

Updated: 2016-10-26 10:19:25.0

Pushed: 2016-10-26 10:19:24.0

Homepage: null

Size: 74

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Build Status codecov.io

tarchecksum

This module performs the checksums on all the files in a tar archive and compares them against the checksums of the original files. In case the md5 sums don't match, there will be an error message outputted. It doesn't check if the directory and the tar have identical files, it only reads the members of the tar and checks them against the members of the original files. The checksum on the archive members is done by streaming from the tar the list of members and their metadata, and computing the md5 checksum on each member after extracting it.

Example:

$ python2.7 tarcheck.py --tar_path /path/to/archive/archive.tar.bz2 --dir /path/to/the/archived/dir

where:

Optional:

Note: the tarcheck checks what is in the tar against the corresponding files in the dir. It does not check to see if everything in the dir has been archived. It only checksums the files in the tar and compares them with the checksums of the files in the dir given as argument.

It uses < 100MB memory to run.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.