cyverse-de/heuristomancer

Name: heuristomancer

Owner: CyVerse Discovery Environment

Description: null

Created: 2016-08-29 19:28:36.0

Updated: 2016-08-29 21:56:45.0

Pushed: 2017-11-08 20:45:15.0

Homepage: null

Size: 7533

Language: Clojure

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

heuristomancer

A Clojure library for attempting to guess file types.

Heuristomancer uses EBNF files defining context-free grammars with instaparse and a provided sample size (default 1000 bytes). EBNF grammars can be found in resources/ebnf/ and are indexed by resources/grammars.clj for use by Heuristomancer.

TL;DR
Command-Line Usage
splay the help text.
 -jar /path/to/heuristomancer-0.1.0-SNAPSHOT-standalone.jar -h

st the file types recognized by heuristomancer.
 -jar /path/to/heuristomancer-0.1.0-SNAPSHOT-standalone.jar -l

entify the types of some files.
 -jar /path/to/heuristomancer-0.1.0-SNAPSHOT-standalone.jar file1 file2

e a different sample size when identifying files.
 -jar /path/to/heuristomancer-0.1.0-SNAPSHOT-standalone.jar -s 2000 file
Programmatic Usage
 'heuristomancer.core)

dentify a file using the default sample size (1000 characters).
ntify "/path/to/file")

dentify a file with a specified sample size (in number of characters).
ntify "/path/to/file" sample-size)

dentify a sample that's already loaded into memory.
ntify-sample sample)
License

http://iplantcollaborative.org/sites/default/files/iPLANT-LICENSE.txt


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.