Elasticsearch in Anger: Stories from the GitHub Search Clusters

Over the past two years GitHub's source code search product has grown from a small research project into a very large index containing nearly 4 billion documents. This is an ever changing and continuously growing data set that has presented us with some interesting scaling problems. This talk will cover how they have tackled these scaling problems - from monitoring and alerting, application changes, growing clusters, and tuning Lucene parameters.

Tim Pease