Installing the Flux Capacitor: Search at the Internet Archive
The Internet Archive’s Wayback Machine has collections that range in size from billions of archived web pages, to millions of scanned books, down to 10,000 quite-popular Grateful Dead concert recordings. See how Elasticsearch glues everything together and hear some lessons learned along the way.
Greg Lindahl is currently working for the Internet Archive, adding search to the Wayback Machine web archive, and the Archive's book collection. Previously, he was CTO/Founder at blekko, a web-scale search engine that was purchased by IBM Watson in March, 2015.