Ed Summers

Login: edsu

Company: University of Maryland

Location: Silver Spring, MD

Bio: I'm a software developer at @umd-mith and study archives on & of the web in the UMD iSchool.

Blog: http://inkdroid.org

Blog: http://inkdroid.org

Member of

  1. Archives and Linked Data
  2. code4lib
  3. DCIC-UMD
  4. Documenting the Now
  5. Maryland Institute for Technology in the Humanities
  6. RDFLib
  7. null
  8. null

Repositories

1600daily
generate rss for White House 1600Daily website (ugh)
accessyyz-2015
earls results for Access 2015 http://accessconference.ca/
agrippa
A test repo for teaching humanists about Git & GitHub
alllivesmatter
Experiments with #AllLivesMatter hashtag in Ferguson related tweets.
alternative-internet
A collection of interesting new networks and tech aiming at decentralisation (in some form).
alto-words
simplistic calculation of the ratio of dictionary words to all words in a METS Alto OCR file
american-archive-kaldi
This repo houses open-source models for Kaldi speech-to-text software that have been trained on public media content.
americanarchivist
metadata extractor for The American Archivist
annotator-okfn
Inline annotation for the web in pure Javascript. Select text, images, or (nearly) anything else, and add your notes.
annotator-store
A backend store for the Annotator
anon
tweet about anonymous Wikipedia edits from particular IP address ranges
antiharassment-policy
Code4lib anti-harassment policy drafting space
aoir2016
URLs Tweeted during AoIR 2106 http://aoir.org/aoir2016/
aotycmp
hack to see what well reviewed albums-of-the-year are available on Spotify and Rdio
apostrophe
"But I got the crystal ball", he said, and held it to the light.
appdeps
Simple commandline tool to check and/or wait for application dependencies.
appraisal-talk
slides for a talk at SIGCIS
archivesspace
The ArchivesSpace archives management tool
at-you
analyze the tweets that are directed at deray & Nettaaaaaaaa
awesome-iiif
Awesome IIIF-related resources
bagger-js
Experiment doing BagIt processing in a static web application
bagit
null
bagit-python3
Create BagIt style packages of digital content in Python 3.2+
bagit-ruby
Ruby Library and Command Line tools for bagit
bagitspec
The BagIt File Packaging Format
bagweb
mirror a website, put it in a bag
baltimore-riots-viewer
random #BaltimoreRiot tweets with media content
baltimore-uprising-viewer
random #BaltimoreUprising tweets with media content
beat
little experiment to look at links in LC bibliographic data
bell
Alexander Graham Bell Family Papers Metadata
bisac
top level BISAC subject vocabulary
bitcoin.org
Bitcoin.org website
bootstrap
CSS toolkit from Twitter
botnet-retweets
Exploring retweets by Twitter bot-nets.
bots-seeds-people
slides for a paper I'm working on
bullshit-detector
Building a better bullshit detector for social media based on fact checking sites.
c4l15-urls
a snapshot of urls mentioned during code4lib 2015
c4l18-keynote-statement
Code4Lib Community Statement in Support of Chris Bourg
cablegate-social-graph
tries to display wikileaks cablegate cable messages as a graph
Capstone---Generative-Poetry
null
chi2016
urls shared during CHI 2016
chroma
display searches of Chronicling America as they happen
chrome-extension-example
null
chronam-widget
view on NDNP content using just HTML/JavaScript and the Chronicling America API
ckanext-datajson
for POD /data
ckanext-storage
CKAN storage extension.
collecting_events
Analysis of different approach for collecting Twitter data for events.
congresseditors
the code that runs the @congresseditors twitter bot
congresseditors-archive
a snapshot of the @congressedtiors twitter archive
congressedits-archive
a snapshot of the @congressedits twitter archive
congressedits-slides
slides for DC Hack & Tell
congress-legislators
Members of the United States Congress, 1789-Present, in YAML, as well as committees and presidents.
conn4
a Connect Four demo written in PHP and JavaScript.
cooperhewitt-collection
Cooper-Hewitt's Collection Database
cpdata
FAO Country Profile data
creepy-polaroid
display an image for where you are using HTML, JavaScript and Google
cscw2016
URLS shared during CSCW 2016
cscw2016-topicmodeling
null
cscw-pandoc
Turn your Pandoc Markdown into a CSCW PDF
csvw
Documents produced by the CSV on the Web Working Group
csvw-template
document the semantics of your csv file
curio
An experiment in static site archives.
d3muckabout
null
dat
real-time replication and versioning for data sets.
databib-metadata
example html/metadata examples for databib
data-gov-uk-harvester
tiny little project to harvest rdfa metadata from data.gov.uk
datarescue-dc
null
datastory
null
datausa-tutorials
Holds tutorials for how to build things with the dataUSA API and embedded visualizations.
dchud-notebooks
null
decentweb
null
dedoop
recursively deduplicate a directory and write its contents to a new directory while remembering the old paths
deepzoom.py
Python Deep Zoom Tools
denten.github.io
null
deplorable-slides
https://edsu.github.io/deplorable-slides/
dev8d-linked-data
some experiments with linked data available from the dev8d conference
dewey-crawler
simplistic crawler and serializer for linked data at dewey.info
dflat
an implementation of the dflat and redd specifications from CDL for versioning of digital objects
dh2015
urls tweeted during #dh2015
diffengine_diffs
A repository of edits to the Washington Post
diffengine-slides
slidedeck about diffengine
disasterview
an experiment using Python to explore the DPLA API and its images of natural disasters
django-pagination
A set of utilities for creating robust pagination tools throughout a django application.
django-sugar
Curated collection of all the sweet Django helpers/utilities developers create, and sometimes recreate too often.
django-tastypie
Creating delicious APIs for Django apps since 2010. v1.0.0-beta
dlfforum
dlfforum 2015 urls
dlfforum-2016
URLs shared during DLF Forum 2016
dnflow
experimental repo, playing with dn workflow options
docker-open-oni
Run open-oni (chronam) in Docker
docker_shortimer
Docker environment for https://github.com/code4lib/shortimer
docnow-bricolage
Slides for a presentation about DocNow at AERI 2016
docnow.github.io
docnow.io website
docnow-slides-2017
null
docnow-vis
a few slides for my data vis class presentation
dpla-map
a simple pure html/javascript DPLA/GoogleMap mashup
dpla-platform
The DPLA Platform
dpub-annotation
null
dumbwaiter
Workflow for processing open data released by the New York Public Library's What's on the Menu? project
dynamodb
null
ead-finder
use Google to find public EAD XML documents
eadlinks
informal survey of linking from archival finding aids
earls
display urls being tweeted with an event hashtag
easyxdm-test
simple test of easyXDM
echochamber
download/visualize the connections between the followers of a given Twitter user
editbot
a bot that will tweet edits to wikipedia articles as they happen
ema
Enhancing Music Notation Addressability
emailz
turn mboxen into rdf, and visualize w/ d3
empirical-cloud
a little demo visualization of owl:sameAs links in billion triple challenge data
ENGL-668K-Data-Stories
Materials for the Data Stories module in Introduction to Digital Studies at the University of Maryland (Fall 2015).
ephemeral-activist-culture
My slides for Preserving Ephemeral Activist Culture. November 7, 2015 at Temple University.
etudier
Extract a citation network from Google Scholar
europeana-crawler
a simple crawler of the RDFa in Europeana
extlinks
utility to download and parse wikipedia external links
fakepremis
fake premis event twitter bot
fastcat
navigate wikipedia categories quickly in a local redis instance
fbarc
A commandline tool and Python library for archiving data from Facebook using the Graph API.
fbopen
An open API server, data import tools, and sample apps to help small businesses search for opportunities to work with the U.S. government.
ferguson-201408
A view of the most retweeted Ferguson tweets sent Aug 9-10.
ferguson-lothian
slides for Alexis Lothian's DCC106 class
ferguson-shilton
slides for my talk to Katie Shilton's class
ferguson-slides
Slides to describe MITH's work with archiving Ferguson Twitter data.
ferguson-tweet-viewer
randomly display ferguson tweets
ferguson-urls
A summary report and dataset documenting URLs in tweets mentioning "ferguson".
ffmprovisr
null
fido
Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.
flickr-commons-metadata
null
fondz
fondz is a tool for auto-generating an "archival description" from a bag or series of bags.
freddiegray-tweet-viewer
display random tweets w/ images from #FreddieGray
friendly
null
geonames-localsolr
A little project to help bootstrap a local-solr instance with geonames data.
ginger
null
google-count
hack to count google hits
google-the-poem
An epic poem generated using Google auto-complete
gpx-geojson
null
h
The Internet, peer reviewed.
hathitime
demo of hathitrust research center API to report term usage over time
hathitrust-api
Python wrappers for the HathiTrust APIs.
highscores
Displays retro arcade game highscores for original cataloging performed today using OCLC's Worldcat Live API.
hocr-tools
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
htmldiff
Diffs arbitrary HTML inline
htrc-feature-reader
Tools for working with HTRC Feature Extraction files
HTRCMARC2RDF
null
hubot-scripts
DEPRECATED, see https://github.com/github/hubot-scripts/issues/1113 for details - optional scripts for hubot, opt in via hubot-scripts.json
ici
Edit Wikipedia Pages Near You
id
LCSH SKOS webapp
identify-by-color
null
iftheygunnedmedown
null
ifttt
Flask web app providing an IFTTT Channel Protocol API for featured content on Wikimedia wikis
iiif
Python library for IIIF
iiif.io
IIIF Website, contexts, and source of specifications documents
iipc16
URLs shared during the IIPC 2016 Conference
iipcGA15
URLs tweeted during IIPC 2015
imls-cdx
working files, data, notebooks for museum group at Archives Unleashed DC
incu4
null
inkdroid-apache
my config files for apache
inkdroid.org
My website
inkdroid-proxy
my node.js proxy server
internetannouncementboard
null
introspect
examine what you look at on Wikipedia using your Chrome history database
IRC-js
The best IRC library for node.js
janky
oh I dunno
jekyll-pandoc
Jekyll Pandoc markdown converter as Ruby gem
jekyll-wikidata
A Jekyll plugin for Wikidata.
journos
simple example of looking for journalists in twitter stream
jschannel
A JavaScript library which implements fancy IPC semantics on top of postMessage.
jschannel-test
a simple test of jschannel for highlighting
json2xml
simplistic json -> xml converter
json-intro
Short, gentle introductions to JSON for the aspiring programmer.
kasabi-archive
little utility for downloading kasabi datasets and uploading to Internet Archive
lakeland-iiif
null
lakeland-images
null
lastcloud
imperfect html/javascript hack to look up musicians you like on soundcloud
lastfm-tools
Some Python CLI tools for talking to the Last.fm API
lastweet
Update Twitter & Mastodon with your LastFM history.
launchpad
A django based system that provides a stable URL for every item in the library's catalog. Various discovery services will link to these URLs. The page for each item will in turn link out to various other resources that provide methods for accessing the content of the items.
lcco
Converts a textual representation of the Library of Congress Classification Outline into SKOS/RDF and makes it available on the Web in a hierarchical viewer.
lc-findingaids
null
lcsh-index
a simple example of putting lcsh into an solr index
lcsh-subset
create a subset view of LCSH
libweb
extract library homepage urls from LIBWEB
linkypedia
a web based tool to monitor how your website content is used in wikipedia
lldvis
LLD Visualiser
lmb
null
loc
Library of Congress Residency 2017-2018
lochief
A linked-data version of kochief
lod-graph
A protovis visualization of the linked open data cloud.
macaulay-mix
play random sounds from the Macaulay Library
maintainers-urls
urls shared during The Maintainers 2016
marac-slides
slides for my MARAC talk naturally
marc2bibframe
null
marc-detrans
Perl de-transliteration engine for converting romanized text in bibliographic data to native scripts.
marc-spec
MARC spec as string
marc-subjectmap
perl framework for translating subject headings in MARC data
marvin
Marvin is a character for your home that enhances your life. Marvin is software that runs on a beaglebone along with a cape that contains a number of added sensors (available upon request).
Mastodon.py
Python wrapper for the Mastodon ( https://github.com/Gargron/mastodon/ ) API.
mediator
Look at Medium through Twitter
mediatypes
A project that harvests media type information from the IANA registry, and publishes information as linked data using the Google App Engine.
medium-archive
A snapshot of my medium export.
menus-vocab
null
metaHathi
Scalatra app for importing downloaded Hathi metadata into a running instance of OpenRefine
metatweet
A bot for monitoring the structure of JSON in tweets from the sample stream.
metaweb
get metadata for a web page
microdata
python library for extracting html microdata
microdata_schemaorg_example
Step by step example of applying Microdata and Schema.org vocabularies to a digital collections site.
mincomp
a GO::DH working group on minimal computing
mirador
An open-source, web-based 'multi-up' viewer that supports zoom-pan-rotate functionality, ability to display/compare simple images, and images with annotations.
mirador-test
null
mith-chat
MITH chat slides
mla16
urls shared during the Modern Languages Association 2016 Conference
moma-collection
The Museum of Modern Art (MoMA) collection data
muldicat
tool to generate SKOS for the Multilingual Dictionary of Cataloging Terms and Concepts
multiverse
A JavaScript library for writing generative text in HTML.
mundaneum
null
namaste
Python port of the Namaste Perl module, "which implements the Namaste (Name as Text) convention for containing a data element completely within the content of a file, using as filename an approximation of the value preceded by a numeric tag."
NativeImaging
Experimental PIL-like interface for basic functionality using platform native libraries such as GraphicsMagick
ndfnz-2015
urls tweeted during National Digital Forum 2015 in New Zealand
neveragaindottech.github.io
Source files for the neveragain.tech site
nla-slides
slides for my talk at the National Library of New Zealand
nlnz-slides
null
node
evented I/O for v8 javascript
node-unshorten
URL unshortener for Node.js
nyaraka
Extract data from Omeka to the filesystem.
NYTdiff
Code for the twitter bot nyt_diff
nytimestream
NYTimes Newswire API as a stream using node.js
oai2pairtree
command line utility to dump records in an oai-pmh repository as xml in a pairtree
oai2xmpp
oai-pmh -> xmpp
ocropy
minimalist wrapper around ocropus for generating hOCR documents from images
ohh
share content, have fun, make friends
omeka_tweet
Tweets when new items are added to an Omeka instance.
open-oni
Fork of chronam, under heavy development and not yet ready for production
openseadragon
Fork of OpenSeadragon (from its mercurial repository). Provides a smooth Zoomable User Interface for HTML/Javascript
openseadragon-djatoka-demo
A simple demonstration of using OpenSeadragon with a Djatoka Image Server
opensearch
A python opensearch client
opinions
watch SCOTUS opinions for URLs
overview
? Start here for current projects, how to get involved, and joining community calls, a resource for new and veteran members
pairtree
Python Pairtree implementation
pandoc-templates
Templates for pandoc
paperbot
Twitter bot for Chronicling America
papvc-topicmodel
null
parisreview
visualize the graph of Paris Review interviews, and their links to Wikipedia
pda2015
URLs tweeted during Personal Digital Archiving 2015
peaceworks
slides for PeaceWorks presentation
pillbox-data-process
Pillbox for Developers data processing code
pinhole
null
pizzagate
None.
pjscrape
A web-scraping framework written in Javascript, using PhantomJS and jQuery
presimental
sentiment analysis of Obama and Romney tweets
ptree
minimal PairTree implementation
public-apis
A collective list of public JSON APIs for use in web development.
pydnz
null
py-flarchive
py-flarchive is a very simple Python library for archiving the metadata for the Flickr photos belonging to a user.
pymarc
process MARC records from Python
python-oauth2
A fully tested, abstract interface to creating OAuth clients and servers.
python-sitemap
Python library for parsing & generating sitemaps
QueryPic
null
ragdollkit
null
rage14
d3 visualization of the httpRange-14 discussion on various w3 mailing lists
rdflib-microdata
an rdflib plugin to parse html5 microdata
rdio2spotify
null
react-boilerplate
:fire: A highly scalable, offline-first foundation with the best developer experience and a focus on performance and best practices.
react-redux-starter
a boilerplate react-redux app just for me
readsaa
makeshift archive of readsaa tweets
redis
Redis key-value store
request
Simplified HTTP request client.
requests
Python HTTP Requests for Humans.
requests-html
Pythonic HTML Parsing for Humans?
resaw_eu
URLs mentioned at http://resaw.eu/ in 2015
restful-bag-server
Draft of proposed structure for serving BagIt repositories RESTfully
resync-simulator
ResourceSync Change Simulator
robotstxt
robots.txt parser coffeescript
s3-bucket-listing
Create nice directory listings for s3 buckets with javascript and HTML.
s3_loader
Watch for local files to appear and move them into S3
saa-glossary
structured data scraped from A Glossary of Archival and Records Terminology
scalanvas
Scala library for creating Shared Canvas manifests
scotus-bookmarks
null
seedlists
experiment to see what web archive seed lists would look like as yaml
semantictweet
A simple Sinatra application that provides a FOAF semantic web feed of your twitter friends and followers. Forked from sinatra-template.
shortpipe
unshorten a stream of urls from the command line in parallel
sigma.js
an open-source lightweight JavaScript graph drawing library
simpleopendata
simple guidelines for publishing open data in useful formats
skosd
turn a SKOS vocabulary into something less, that's more useful - less is more, etc.
skosdict
turn a SKOS concept scheme into a simple JSON dictionary
skos_wikidata
match a SKOS concept scheme to Wikidata from the command line
social-feed-manager
manage rules and streams from social data sources, starting with twitter.
Socket.IO
Sockets for the rest of us
solrpy
Automatically exported from code.google.com/p/solrpy
south-test
just a throw away demo app
spn
Playing around with SavePageNow (meta)data.
sru-ruby
ruby client for the Search/Retrieval by URL protocol
storycorps-meta
collect public storycorps metadata and save as json-ld
styles
Official repository for Citation Style Language (CSL) citation styles.
subjects-here
An HTML5 experiment that uses OCLC's mapFast to lookup subjects for your current location.
summoner
work with the Serial Solutions Summon API from Python
talks
talks i'm giving or have given
teju-instagram
small dataset of Teju Cole Instagram metadata for analysis
testudo
UMD Schedule of Classes as Data
thisisacoup
#thisisacoup tweet ids
tinyarchive
Software behind tracker.tinyarchive.org - Warning: Very hacky code
tosdrbot
A Twitter Bot for monitoring changes to ToS Documents
toxic-bags
a collection of BagIt test data
translators
Zotero Translators
travis-magic
simple test of python-magic on travis
ttw16
URLs shared during Theorizing the Web 2016
twarc
A command line tool (and Python library) for archiving Twitter JSON
tweepy
Twitter for Python!
twit
Twitter API Client for node (REST & Streaming API)
twitterator
iterator functions for twitter api
twitter-export-image-fill
A script to download (backup locally) all the images accompanying your tweets
twtxt
null
umd-r-study
null
under-surveillance
slides for a talk at CHNM about the Documenting the Now project
upchuck
People who congratulated Trump on winning in Twitter
versioning-metrics
little utility to compare approaches to version control
videogrep
automatic video supercuts with python
vine-tweets-slides
null
vogon
You know, Vogon poetry courtesy of Google Suggest.
voyage
display a stream of circulation activity for a Voyager ILS
warc
warc library for golang
warcpy
Python library for reading and writing warc files
warc-twarc
Save a Twitter search and get the JSON data for the tweets.
wastebookbot
dumb bot that tweets markov text from the Waste Book
webarchives
see if a URL is available in a web archive somewhere on the web
webarchives-intro
intro to web archives slides
web-platform-tests
Test suites for Web-platform specs ? including WHATWG, W3C, and others
webpresmed
The Web as a Preservation Medium (slidedeck)
whiskers.js
Whiskers templating library for JavaScript
whrss
scrape White House Blog to generate RSS until it starts working again
wikichanges
a NodeJS library for monitoring changes on Wikipedia sites
wikicites
get a stream of recent citations from wikipedia
wikidata-bots
wikidata editing stats for bots
wikidata-client
Wikidata API client.
wikidata_suggest
a CLI suggestion tool for Wikidata entities
wikieds
A command line script to summarize the editors for a given Wikipedia article (in Markdown)
wikifileformats
playground for wikipedia file formats experiments
wikigeo
JavaScript library for getting geojson from the Wikipedia API
wikilinks
null
wikipedia-irc
Try to spot new trends based on Wikipedia live edit spikes
wikippoc
Wikipedia / LC Prints & Photographs Citation Tool
wikipulse
a gauge widget to display wikipedia activity
wikistream
displays edit activity on wikipedia
wikitrends
see most viewed wikipedia articles
wikitweets
see tweets that reference wikipedia articles
wordpressure
realtime view of new items posted to WordPress sites
worksvenn
generate a Venn diagram for LibraryThing, OCLC and OpenLibrary FRBRization services
wplinks
utility to get a list of Wikipedia articles that point at a particular website
www-wikipedia
Simple Perl client for grabbing content out of Wikipedia
zhang-webarchiving
Notes for my talk about Web Archiving to Jane Zhang's Digital Curation class.

Commits To

RepositoryMost Recent Commit# Commits


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.