Elasticsearch in Anger: Stories from the GitHub Search Clusters

Over the past two years GitHub's source code search product has grown from a small research project into a very large index containing nearly 4 billion documents. This is an ever changing and continuously growing data set that has presented us with some interesting scaling problems. This talk will cover how they have tackled these scaling problems - from monitoring and alerting, application changes, growing clusters, and tuning Lucene parameters.

Tim Pease

For the past several years Tim has been working at GitHub to make all things searchable. He has overseen GitHub's Transition from a single Solr instance to running one of the larger Elasticsearch installations in production. When not indexing data, Tim enjoys cycling around his hometown of Boulder, Colorado.