Ed Summers
Login:
edsu
Company:
University of Maryland
Location:
Silver Spring, MD
Bio:
I'm a software developer at @umd-mith and study archives on & of the web in the UMD iSchool.
Blog:
http://inkdroid.org
Blog:
http://inkdroid.org
Member of
- Archives and Linked Data
- code4lib
- DCIC-UMD
- Documenting the Now
- Maryland Institute for Technology in the Humanities
- RDFLib
- null
- null
Repositories
-
1600daily
-
generate rss for White House 1600Daily website (ugh)
-
accessyyz-2015
-
earls results for Access 2015 http://accessconference.ca/
-
agrippa
-
A test repo for teaching humanists about Git & GitHub
-
alllivesmatter
-
Experiments with #AllLivesMatter hashtag in Ferguson related tweets.
-
alternative-internet
-
A collection of interesting new networks and tech aiming at decentralisation (in some form).
-
alto-words
-
simplistic calculation of the ratio of dictionary words to all words in a METS Alto OCR file
-
american-archive-kaldi
-
This repo houses open-source models for Kaldi speech-to-text software that have been trained on public media content.
-
americanarchivist
-
metadata extractor for The American Archivist
-
annotator-okfn
-
Inline annotation for the web in pure Javascript. Select text, images, or (nearly) anything else, and add your notes.
-
annotator-store
-
A backend store for the Annotator
-
anon
-
tweet about anonymous Wikipedia edits from particular IP address ranges
-
antiharassment-policy
-
Code4lib anti-harassment policy drafting space
-
aoir2016
-
URLs Tweeted during AoIR 2106 http://aoir.org/aoir2016/
-
aotycmp
-
hack to see what well reviewed albums-of-the-year are available on Spotify and Rdio
-
apostrophe
-
"But I got the crystal ball", he said, and held it to the light.
-
appdeps
-
Simple commandline tool to check and/or wait for application dependencies.
-
appraisal-talk
-
slides for a talk at SIGCIS
-
archivesspace
-
The ArchivesSpace archives management tool
-
at-you
-
analyze the tweets that are directed at deray & Nettaaaaaaaa
-
awesome-iiif
-
Awesome IIIF-related resources
-
bagger-js
-
Experiment doing BagIt processing in a static web application
-
bagit
-
null
-
bagit-python3
-
Create BagIt style packages of digital content in Python 3.2+
-
bagit-ruby
-
Ruby Library and Command Line tools for bagit
-
bagitspec
-
The BagIt File Packaging Format
-
bagweb
-
mirror a website, put it in a bag
-
baltimore-riots-viewer
-
random #BaltimoreRiot tweets with media content
-
baltimore-uprising-viewer
-
random #BaltimoreUprising tweets with media content
-
beat
-
little experiment to look at links in LC bibliographic data
-
bell
-
Alexander Graham Bell Family Papers Metadata
-
bisac
-
top level BISAC subject vocabulary
-
bitcoin.org
-
Bitcoin.org website
-
bootstrap
-
CSS toolkit from Twitter
-
botnet-retweets
-
Exploring retweets by Twitter bot-nets.
-
bots-seeds-people
-
slides for a paper I'm working on
-
bullshit-detector
-
Building a better bullshit detector for social media based on fact checking sites.
-
c4l15-urls
-
a snapshot of urls mentioned during code4lib 2015
-
c4l18-keynote-statement
-
Code4Lib Community Statement in Support of Chris Bourg
-
cablegate-social-graph
-
tries to display wikileaks cablegate cable messages as a graph
-
Capstone---Generative-Poetry
-
null
-
chi2016
-
urls shared during CHI 2016
-
chroma
-
display searches of Chronicling America as they happen
-
chrome-extension-example
-
null
-
chronam-widget
-
view on NDNP content using just HTML/JavaScript and the Chronicling America API
-
ckanext-datajson
-
for POD /data
-
ckanext-storage
-
CKAN storage extension.
-
collecting_events
-
Analysis of different approach for collecting Twitter data for events.
-
congresseditors
-
the code that runs the @congresseditors twitter bot
-
congresseditors-archive
-
a snapshot of the @congressedtiors twitter archive
-
congressedits-archive
-
a snapshot of the @congressedits twitter archive
-
congressedits-slides
-
slides for DC Hack & Tell
-
congress-legislators
-
Members of the United States Congress, 1789-Present, in YAML, as well as committees and presidents.
-
conn4
-
a Connect Four demo written in PHP and JavaScript.
-
cooperhewitt-collection
-
Cooper-Hewitt's Collection Database
-
cpdata
-
FAO Country Profile data
-
creepy-polaroid
-
display an image for where you are using HTML, JavaScript and Google
-
cscw2016
-
URLS shared during CSCW 2016
-
cscw2016-topicmodeling
-
null
-
cscw-pandoc
-
Turn your Pandoc Markdown into a CSCW PDF
-
csvw
-
Documents produced by the CSV on the Web Working Group
-
csvw-template
-
document the semantics of your csv file
-
curio
-
An experiment in static site archives.
-
d3muckabout
-
null
-
dat
-
real-time replication and versioning for data sets.
-
databib-metadata
-
example html/metadata examples for databib
-
data-gov-uk-harvester
-
tiny little project to harvest rdfa metadata from data.gov.uk
-
datarescue-dc
-
null
-
datastory
-
null
-
datausa-tutorials
-
Holds tutorials for how to build things with the dataUSA API and embedded visualizations.
-
dchud-notebooks
-
null
-
decentweb
-
null
-
dedoop
-
recursively deduplicate a directory and write its contents to a new directory while remembering the old paths
-
deepzoom.py
-
Python Deep Zoom Tools
-
denten.github.io
-
null
-
deplorable-slides
-
https://edsu.github.io/deplorable-slides/
-
dev8d-linked-data
-
some experiments with linked data available from the dev8d conference
-
dewey-crawler
-
simplistic crawler and serializer for linked data at dewey.info
-
dflat
-
an implementation of the dflat and redd specifications from CDL for versioning of digital objects
-
dh2015
-
urls tweeted during #dh2015
-
diffengine_diffs
-
A repository of edits to the Washington Post
-
diffengine-slides
-
slidedeck about diffengine
-
disasterview
-
an experiment using Python to explore the DPLA API and its images of natural disasters
-
django-pagination
-
A set of utilities for creating robust pagination tools throughout a django application.
-
django-sugar
-
Curated collection of all the sweet Django helpers/utilities developers create, and sometimes recreate too often.
-
django-tastypie
-
Creating delicious APIs for Django apps since 2010. v1.0.0-beta
-
dlfforum
-
dlfforum 2015 urls
-
dlfforum-2016
-
URLs shared during DLF Forum 2016
-
dnflow
-
experimental repo, playing with dn workflow options
-
docker-open-oni
-
Run open-oni (chronam) in Docker
-
docker_shortimer
-
Docker environment for https://github.com/code4lib/shortimer
-
docnow-bricolage
-
Slides for a presentation about DocNow at AERI 2016
-
docnow.github.io
-
docnow.io website
-
docnow-slides-2017
-
null
-
docnow-vis
-
a few slides for my data vis class presentation
-
dpla-map
-
a simple pure html/javascript DPLA/GoogleMap mashup
-
dpla-platform
-
The DPLA Platform
-
dpub-annotation
-
null
-
dumbwaiter
-
Workflow for processing open data released by the New York Public Library's What's on the Menu? project
-
dynamodb
-
null
-
ead-finder
-
use Google to find public EAD XML documents
-
eadlinks
-
informal survey of linking from archival finding aids
-
earls
-
display urls being tweeted with an event hashtag
-
easyxdm-test
-
simple test of easyXDM
-
echochamber
-
download/visualize the connections between the followers of a given Twitter user
-
editbot
-
a bot that will tweet edits to wikipedia articles as they happen
-
ema
-
Enhancing Music Notation Addressability
-
emailz
-
turn mboxen into rdf, and visualize w/ d3
-
empirical-cloud
-
a little demo visualization of owl:sameAs links in billion triple challenge data
-
ENGL-668K-Data-Stories
-
Materials for the Data Stories module in Introduction to Digital Studies at the University of Maryland (Fall 2015).
-
ephemeral-activist-culture
-
My slides for Preserving Ephemeral Activist Culture. November 7, 2015 at Temple University.
-
etudier
-
Extract a citation network from Google Scholar
-
europeana-crawler
-
a simple crawler of the RDFa in Europeana
-
extlinks
-
utility to download and parse wikipedia external links
-
fakepremis
-
fake premis event twitter bot
-
fastcat
-
navigate wikipedia categories quickly in a local redis instance
-
fbarc
-
A commandline tool and Python library for archiving data from Facebook using the Graph API.
-
fbopen
-
An open API server, data import tools, and sample apps to help small businesses search for opportunities to work with the U.S. government.
-
ferguson-201408
-
A view of the most retweeted Ferguson tweets sent Aug 9-10.
-
ferguson-lothian
-
slides for Alexis Lothian's DCC106 class
-
ferguson-shilton
-
slides for my talk to Katie Shilton's class
-
ferguson-slides
-
Slides to describe MITH's work with archiving Ferguson Twitter data.
-
ferguson-tweet-viewer
-
randomly display ferguson tweets
-
ferguson-urls
-
A summary report and dataset documenting URLs in tweets mentioning "ferguson".
-
ffmprovisr
-
null
-
fido
-
Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.
-
flickr-commons-metadata
-
null
-
fondz
-
fondz is a tool for auto-generating an "archival description" from a bag or series of bags.
-
freddiegray-tweet-viewer
-
display random tweets w/ images from #FreddieGray
-
friendly
-
null
-
geonames-localsolr
-
A little project to help bootstrap a local-solr instance with geonames data.
-
ginger
-
null
-
google-count
-
hack to count google hits
-
google-the-poem
-
An epic poem generated using Google auto-complete
-
gpx-geojson
-
null
-
h
-
The Internet, peer reviewed.
-
hathitime
-
demo of hathitrust research center API to report term usage over time
-
hathitrust-api
-
Python wrappers for the HathiTrust APIs.
-
highscores
-
Displays retro arcade game highscores for original cataloging performed today using OCLC's Worldcat Live API.
-
hocr-tools
-
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
-
htmldiff
-
Diffs arbitrary HTML inline
-
htrc-feature-reader
-
Tools for working with HTRC Feature Extraction files
-
HTRCMARC2RDF
-
null
-
hubot-scripts
-
DEPRECATED, see https://github.com/github/hubot-scripts/issues/1113 for details - optional scripts for hubot, opt in via hubot-scripts.json
-
ici
-
Edit Wikipedia Pages Near You
-
id
-
LCSH SKOS webapp
-
identify-by-color
-
null
-
iftheygunnedmedown
-
null
-
ifttt
-
Flask web app providing an IFTTT Channel Protocol API for featured content on Wikimedia wikis
-
iiif
-
Python library for IIIF
-
iiif.io
-
IIIF Website, contexts, and source of specifications documents
-
iipc16
-
URLs shared during the IIPC 2016 Conference
-
iipcGA15
-
URLs tweeted during IIPC 2015
-
imls-cdx
-
working files, data, notebooks for museum group at Archives Unleashed DC
-
incu4
-
null
-
inkdroid-apache
-
my config files for apache
-
inkdroid.org
-
My website
-
inkdroid-proxy
-
my node.js proxy server
-
internetannouncementboard
-
null
-
introspect
-
examine what you look at on Wikipedia using your Chrome history database
-
IRC-js
-
The best IRC library for node.js
-
janky
-
oh I dunno
-
jekyll-pandoc
-
Jekyll Pandoc markdown converter as Ruby gem
-
jekyll-wikidata
-
A Jekyll plugin for Wikidata.
-
journos
-
simple example of looking for journalists in twitter stream
-
jschannel
-
A JavaScript library which implements fancy IPC semantics on top of postMessage.
-
jschannel-test
-
a simple test of jschannel for highlighting
-
json2xml
-
simplistic json -> xml converter
-
json-intro
-
Short, gentle introductions to JSON for the aspiring programmer.
-
kasabi-archive
-
little utility for downloading kasabi datasets and uploading to Internet Archive
-
lakeland-iiif
-
null
-
lakeland-images
-
null
-
lastcloud
-
imperfect html/javascript hack to look up musicians you like on soundcloud
-
lastfm-tools
-
Some Python CLI tools for talking to the Last.fm API
-
lastweet
-
Update Twitter & Mastodon with your LastFM history.
-
launchpad
-
A django based system that provides a stable URL for every item in the library's catalog. Various discovery services will link to these URLs. The page for each item will in turn link out to various other resources that provide methods for accessing the content of the items.
-
lcco
-
Converts a textual representation of the Library of Congress Classification Outline into SKOS/RDF and makes it available on the Web in a hierarchical viewer.
-
lc-findingaids
-
null
-
lcsh-index
-
a simple example of putting lcsh into an solr index
-
lcsh-subset
-
create a subset view of LCSH
-
libweb
-
extract library homepage urls from LIBWEB
-
linkypedia
-
a web based tool to monitor how your website content is used in wikipedia
-
lldvis
-
LLD Visualiser
-
lmb
-
null
-
loc
-
Library of Congress Residency 2017-2018
-
lochief
-
A linked-data version of kochief
-
lod-graph
-
A protovis visualization of the linked open data cloud.
-
macaulay-mix
-
play random sounds from the Macaulay Library
-
maintainers-urls
-
urls shared during The Maintainers 2016
-
marac-slides
-
slides for my MARAC talk naturally
-
marc2bibframe
-
null
-
marc-detrans
-
Perl de-transliteration engine for converting romanized text in bibliographic data to native scripts.
-
marc-spec
-
MARC spec as string
-
marc-subjectmap
-
perl framework for translating subject headings in MARC data
-
marvin
-
Marvin is a character for your home that enhances your life. Marvin is software that runs on a beaglebone along with a cape that contains a number of added sensors (available upon request).
-
Mastodon.py
-
Python wrapper for the Mastodon ( https://github.com/Gargron/mastodon/ ) API.
-
mediator
-
Look at Medium through Twitter
-
mediatypes
-
A project that harvests media type information from the IANA registry, and publishes information as linked data using the Google App Engine.
-
medium-archive
-
A snapshot of my medium export.
-
menus-vocab
-
null
-
metaHathi
-
Scalatra app for importing downloaded Hathi metadata into a running instance of OpenRefine
-
metatweet
-
A bot for monitoring the structure of JSON in tweets from the sample stream.
-
metaweb
-
get metadata for a web page
-
microdata
-
python library for extracting html microdata
-
microdata_schemaorg_example
-
Step by step example of applying Microdata and Schema.org vocabularies to a digital collections site.
-
mincomp
-
a GO::DH working group on minimal computing
-
mirador
-
An open-source, web-based 'multi-up' viewer that supports zoom-pan-rotate functionality, ability to display/compare simple images, and images with annotations.
-
mirador-test
-
null
-
mith-chat
-
MITH chat slides
-
mla16
-
urls shared during the Modern Languages Association 2016 Conference
-
moma-collection
-
The Museum of Modern Art (MoMA) collection data
-
muldicat
-
tool to generate SKOS for the Multilingual Dictionary of Cataloging Terms and Concepts
-
multiverse
-
A JavaScript library for writing generative text in HTML.
-
mundaneum
-
null
-
namaste
-
Python port of the Namaste Perl module, "which implements the Namaste (Name as Text) convention for containing a data element completely within the content of a file, using as filename an approximation of the value preceded by a numeric tag."
-
NativeImaging
-
Experimental PIL-like interface for basic functionality using platform native libraries such as GraphicsMagick
-
ndfnz-2015
-
urls tweeted during National Digital Forum 2015 in New Zealand
-
neveragaindottech.github.io
-
Source files for the neveragain.tech site
-
nla-slides
-
slides for my talk at the National Library of New Zealand
-
nlnz-slides
-
null
-
node
-
evented I/O for v8 javascript
-
node-unshorten
-
URL unshortener for Node.js
-
nyaraka
-
Extract data from Omeka to the filesystem.
-
NYTdiff
-
Code for the twitter bot nyt_diff
-
nytimestream
-
NYTimes Newswire API as a stream using node.js
-
oai2pairtree
-
command line utility to dump records in an oai-pmh repository as xml in a pairtree
-
oai2xmpp
-
oai-pmh -> xmpp
-
ocropy
-
minimalist wrapper around ocropus for generating hOCR documents from images
-
ohh
-
share content, have fun, make friends
-
omeka_tweet
-
Tweets when new items are added to an Omeka instance.
-
open-oni
-
Fork of chronam, under heavy development and not yet ready for production
-
openseadragon
-
Fork of OpenSeadragon (from its mercurial repository). Provides a smooth Zoomable User Interface for HTML/Javascript
-
openseadragon-djatoka-demo
-
A simple demonstration of using OpenSeadragon with a Djatoka Image Server
-
opensearch
-
A python opensearch client
-
opinions
-
watch SCOTUS opinions for URLs
-
overview
-
? Start here for current projects, how to get involved, and joining community calls, a resource for new and veteran members
-
pairtree
-
Python Pairtree implementation
-
pandoc-templates
-
Templates for pandoc
-
paperbot
-
Twitter bot for Chronicling America
-
papvc-topicmodel
-
null
-
parisreview
-
visualize the graph of Paris Review interviews, and their links to Wikipedia
-
pda2015
-
URLs tweeted during Personal Digital Archiving 2015
-
peaceworks
-
slides for PeaceWorks presentation
-
pillbox-data-process
-
Pillbox for Developers data processing code
-
pinhole
-
null
-
pizzagate
-
None.
-
pjscrape
-
A web-scraping framework written in Javascript, using PhantomJS and jQuery
-
presimental
-
sentiment analysis of Obama and Romney tweets
-
ptree
-
minimal PairTree implementation
-
public-apis
-
A collective list of public JSON APIs for use in web development.
-
pydnz
-
null
-
py-flarchive
-
py-flarchive is a very simple Python library for archiving the metadata for the Flickr photos belonging to a user.
-
pymarc
-
process MARC records from Python
-
python-oauth2
-
A fully tested, abstract interface to creating OAuth clients and servers.
-
python-sitemap
-
Python library for parsing & generating sitemaps
-
QueryPic
-
null
-
ragdollkit
-
null
-
rage14
-
d3 visualization of the httpRange-14 discussion on various w3 mailing lists
-
rdflib-microdata
-
an rdflib plugin to parse html5 microdata
-
rdio2spotify
-
null
-
react-boilerplate
-
:fire: A highly scalable, offline-first foundation with the best developer experience and a focus on performance and best practices.
-
react-redux-starter
-
a boilerplate react-redux app just for me
-
readsaa
-
makeshift archive of readsaa tweets
-
redis
-
Redis key-value store
-
request
-
Simplified HTTP request client.
-
requests
-
Python HTTP Requests for Humans.
-
requests-html
-
Pythonic HTML Parsing for Humans?
-
resaw_eu
-
URLs mentioned at http://resaw.eu/ in 2015
-
restful-bag-server
-
Draft of proposed structure for serving BagIt repositories RESTfully
-
resync-simulator
-
ResourceSync Change Simulator
-
robotstxt
-
robots.txt parser coffeescript
-
s3-bucket-listing
-
Create nice directory listings for s3 buckets with javascript and HTML.
-
s3_loader
-
Watch for local files to appear and move them into S3
-
saa-glossary
-
structured data scraped from A Glossary of Archival and Records Terminology
-
scalanvas
-
Scala library for creating Shared Canvas manifests
-
scotus-bookmarks
-
null
-
seedlists
-
experiment to see what web archive seed lists would look like as yaml
-
semantictweet
-
A simple Sinatra application that provides a FOAF semantic web feed of your twitter friends and followers. Forked from sinatra-template.
-
shortpipe
-
unshorten a stream of urls from the command line in parallel
-
sigma.js
-
an open-source lightweight JavaScript graph drawing library
-
simpleopendata
-
simple guidelines for publishing open data in useful formats
-
skosd
-
turn a SKOS vocabulary into something less, that's more useful - less is more, etc.
-
skosdict
-
turn a SKOS concept scheme into a simple JSON dictionary
-
skos_wikidata
-
match a SKOS concept scheme to Wikidata from the command line
-
social-feed-manager
-
manage rules and streams from social data sources, starting with twitter.
-
Socket.IO
-
Sockets for the rest of us
-
solrpy
-
Automatically exported from code.google.com/p/solrpy
-
south-test
-
just a throw away demo app
-
spn
-
Playing around with SavePageNow (meta)data.
-
sru-ruby
-
ruby client for the Search/Retrieval by URL protocol
-
storycorps-meta
-
collect public storycorps metadata and save as json-ld
-
styles
-
Official repository for Citation Style Language (CSL) citation styles.
-
subjects-here
-
An HTML5 experiment that uses OCLC's mapFast to lookup subjects for your current location.
-
summoner
-
work with the Serial Solutions Summon API from Python
-
talks
-
talks i'm giving or have given
-
teju-instagram
-
small dataset of Teju Cole Instagram metadata for analysis
-
testudo
-
UMD Schedule of Classes as Data
-
thisisacoup
-
#thisisacoup tweet ids
-
tinyarchive
-
Software behind tracker.tinyarchive.org - Warning: Very hacky code
-
tosdrbot
-
A Twitter Bot for monitoring changes to ToS Documents
-
toxic-bags
-
a collection of BagIt test data
-
translators
-
Zotero Translators
-
travis-magic
-
simple test of python-magic on travis
-
ttw16
-
URLs shared during Theorizing the Web 2016
-
twarc
-
A command line tool (and Python library) for archiving Twitter JSON
-
tweepy
-
Twitter for Python!
-
twit
-
Twitter API Client for node (REST & Streaming API)
-
twitterator
-
iterator functions for twitter api
-
twitter-export-image-fill
-
A script to download (backup locally) all the images accompanying your tweets
-
twtxt
-
null
-
umd-r-study
-
null
-
under-surveillance
-
slides for a talk at CHNM about the Documenting the Now project
-
upchuck
-
People who congratulated Trump on winning in Twitter
-
versioning-metrics
-
little utility to compare approaches to version control
-
videogrep
-
automatic video supercuts with python
-
vine-tweets-slides
-
null
-
vogon
-
You know, Vogon poetry courtesy of Google Suggest.
-
voyage
-
display a stream of circulation activity for a Voyager ILS
-
warc
-
warc library for golang
-
warcpy
-
Python library for reading and writing warc files
-
warc-twarc
-
Save a Twitter search and get the JSON data for the tweets.
-
wastebookbot
-
dumb bot that tweets markov text from the Waste Book
-
webarchives
-
see if a URL is available in a web archive somewhere on the web
-
webarchives-intro
-
intro to web archives slides
-
web-platform-tests
-
Test suites for Web-platform specs ? including WHATWG, W3C, and others
-
webpresmed
-
The Web as a Preservation Medium (slidedeck)
-
whiskers.js
-
Whiskers templating library for JavaScript
-
whrss
-
scrape White House Blog to generate RSS until it starts working again
-
wikichanges
-
a NodeJS library for monitoring changes on Wikipedia sites
-
wikicites
-
get a stream of recent citations from wikipedia
-
wikidata-bots
-
wikidata editing stats for bots
-
wikidata-client
-
Wikidata API client.
-
wikidata_suggest
-
a CLI suggestion tool for Wikidata entities
-
wikieds
-
A command line script to summarize the editors for a given Wikipedia article (in Markdown)
-
wikifileformats
-
playground for wikipedia file formats experiments
-
wikigeo
-
JavaScript library for getting geojson from the Wikipedia API
-
wikilinks
-
null
-
wikipedia-irc
-
Try to spot new trends based on Wikipedia live edit spikes
-
wikippoc
-
Wikipedia / LC Prints & Photographs Citation Tool
-
wikipulse
-
a gauge widget to display wikipedia activity
-
wikistream
-
displays edit activity on wikipedia
-
wikitrends
-
see most viewed wikipedia articles
-
wikitweets
-
see tweets that reference wikipedia articles
-
wordpressure
-
realtime view of new items posted to WordPress sites
-
worksvenn
-
generate a Venn diagram for LibraryThing, OCLC and OpenLibrary FRBRization services
-
wplinks
-
utility to get a list of Wikipedia articles that point at a particular website
-
www-wikipedia
-
Simple Perl client for grabbing content out of Wikipedia
-
zhang-webarchiving
-
Notes for my talk about Web Archiving to Jane Zhang's Digital Curation class.
Commits To
Repository | Most Recent Commit | # Commits |