This Week in Elasticsearch - May 28, 2014
Welcome to This Week in Elasticsearch. In this roundup, we try to inform you about the latest and greatest changes in Elasticsearch. We cover what happened in the GitHub repositories, as well as many Elasticsearch events happening worldwide, and give you a small peek into the future of the project.
Elasticsearch core
- Field data: Global ordinals cause
ClassCastExceptions
if used with a bounded fielddata cache (#6260, master and 1.2) - Percolator API: Fix assertion in percolation with nested docs (#6263, master, 1.2 and 1.1)
- More Like This Query: replaced
exclude
withinclude
to avoid double negation (#6248, master and 1.2) - Lucene: Entirely cut over to
TopDocs#merge
for merging shard docs in the reduce phase (#6197, master and 1.3) - Percolate API: Support for execute significant terms and geo hash grid aggregations (#6037, master and 1.3)
- Percolate API: Add
num_of_shards
statistic to percolate context (#6123, master and 1.3) - Aggregations:
date_histogram
aggregation breaks on date fields with multiple formats (#6239, master and 1.3) - More Like This Query: allow for both
like_text
anddocs/ids
to be specified (#6246, master and 1.3) - Highlighting: Highlight fields in request order (#6178, master and 1.3)
- Aggregations: Fix
ReverseNestedAggregator
to compute the parent document correctly (#6278, master, 1.3 and 1.2) - Nested: Make sure queries/filters/aggs get a
FixedBitSet
when they expect one (#6279, master, 1.3, 1.2 and 1.1) - Guava: Filter cache size limit not honored 32GB or over (#6268, master, 1.3, 1.2 and 1.1)
- Aggregations: Added
top_hits
aggregation that keeps track of the most relevant document being aggregated per bucket (#6124, master and 1.3) - Startup: Improve error when
mlockall
fails (#6288, master and 1.3) - Query API: Fix
MatchQueryParser
not parsing fuzzy_transpositions (#6300, master, 1.3, 1.2 and 1.1) - Startup: Reset locale to
en_US.UTF-8
in bin/elasticsearch (#6047, master and 1.3) - Highlighting: Plain highlighter to use analyzer defined on a document level when available (#5497, master and 1.3)
- Serialization: Add support for Byte to the
XContentBuilder
(#6127, master and 1.3) - Mustache: Ensure internal scope extrators are always operating on a Map (6138#, master, 1.3, 1.2 and 1.1)
Today's #Elasticsearch #bbuzz hackathon. Thanks @elasticberlin @berlinonlinenet @syseleven @immobilienscout for all! pic.twitter.com/tLbmixz3cs
— Leslie Hawthorn (@lhawthorn)
May 28, 2014
Elasticsearch Ecosystem
Here’s some more information about what is happening in the ecosystem we are maintaining around the ELK stack – that’s Elasticsearch plus Logstash and Kibana – including plugin and driver releases.
- In case you missed it, we’ve had several releases in the past week: Elasticsearch 1.2.0 and 1.1.2, Elasticsearch for Apache Hadoop 2.0 GA, and Kibana 3.1 last week.
- Speaking of Kibana (and Elasticsearch!), you may be interested in reading about the recent hackathon we hosted with the MIT Data Analytics Club. Cool hacks, plus making data visualization simple and beautiful for those without a comp sci background.
- If you missed it last week, you can check out the video of our latest webcast, The ELK Stack in a DevOps Environment, presented by Kurt Hurtado. (Registration required.)
- The lastest release of the community created Scala client for Elasticsearch, Elastic4s 1.2.0.0 is out!
- Zachary Tong created an awesome how-to last week about creating a negative connotation filter for Elasticsearch.
- Gareth Phillips shared his experiences building threat intelligence dashboards with live feeds from Elasticsearch.
- You may be interested to learn how Quizlet and OpenCrowd are using Elasticsearch to power education & high finance.
- Poul Foged shared an article on replication and scaling in Elasticsearch. (i dansk)
Binh Ly Introduces Kibana at last week’s Chicago Meetup
Slides & Videos
We’re got a bunch of slide & video love from Berlin Buzzwords and beyond!
Where to find Us
We’d love to feature all the great Elasticsearch, Logstash and Kibana presentations and meetups happening worldwide in this section. If you’re speaking or hosting a meetup, let our Community Manager, Leslie Hawthorn, know!
Austria
The Elasticsearch Vienna Meetup Group has scheduled their first meetup for Thursday, June 12th. Please join us at 7 PM to hear from Alexander Reelsen on What’s New in Elasticsearch.
Bulgaria
Honza Kral will discuss how to Explore Your Data using Elasticsearch at the Bulgarian Web Summit 2014. The conference takes place on May 31st in Sofia.
France
The Seventh Elasticsearch France User Group meeting is coming up on June 10th in Paris. Doors open at 7:30 PM.
The Netherlands
We’ve got not one but two awesome conferences going on in Amsterdam the week of June 16th:
- DevOps Days Amsterdam: Join us June 19-21st for an Elasticsearch workshop, a little on DevOps for Happier Humans and all the news you can use at the Elasticsearch table. Swing by to say hello to Leslie Hawthorn and Kurt Hurtado! Lee Hinmann will also join us for the workshop. Yay Lee!
- GOTO Amsterdam: Our CTO, Shay Banon, will treat the crowd to a deep dive on Elasticsearch’s APIs, plus we’ll have lots of knowledge and swag on offer at our booth. See you there!
Norway
- Martijn Laarman will be speaking on from text to full-text search at the NDC Oslo 2014 conference. The show runs from June 2-6th.
- Mark Harwood will cover Big Data on Elasticsearch at the ComperioFrokost on June 11th.
Poland
The Torun JUG will get together on May 28th at 6 PM to talk about Lucene and all of her friends, with a spotlight on Elasticsearch.
Romania
Costin Leau will speak at Topconf Bucharest 2014 on Big data real time search and analytics. Topconf Bucharest runs from June 10-13th and Costin will speak at 3:20 PM on June 12th.
Russia
Honza Kral will speak at PyCon Ru. The schedule is still being finalized, but mark your calendars for June 2nd and 3rd. If you’re heading to PyCon Ru, make sure to say hello to Honza!
New Zealand
The Code Lounge user group in Auckland will get together on June 7th to talk Getting Control of Your Logs. Lots of Logstash love on offer.
Switzerland
- Alexander Reelsen will be speaking at Mimacom Days Zurich on June 4th. He will talk about continuous improvement of your data using the ELK stack at 9:45 AM, directly after the conference welcome remarks.
- Alexander Reelsen and Britta Weber will be speaking at the Zurich Elasticsearch Meetup on June 7th. Alex will discuss What’s new in Elasticsearch and Britta will cover the Significant Terms Aggregation. Doors open at 7 PM.
United Kingdom
- The London Elasticsearch User Group has already announced that there next meeting will take place on June 4th. RSVP now to save your place – this one fills up very quickly!
- David Pilato will take the stage at DevoxxUK to discuss Advanced Search for Your Legacy Application DevoxxUK takes place June 12-13th in London, and David will speak on June 12th at 4 PM in Room 1.
- Elasticsearch will be out in force at London Technology Week. Join us June 17-19th to celebrate all that is fashionable in the UK tech scene, especially at the Networking Bar. Even cooler, on June 17th, Mark Harwood will be telling you all about how to Make sense of your (big) data and analyze in real-time like you have never done before! Mark’s talk is at 4 PM.
United States
- This week’s event highlight: Costin Leau will speak at Hadoop Summit North America on Real-time Analytics and Anomalies Detection using Elasticsearch, Hadoop and Storm. The conference runs from June 3-5th in San Jose, California. Costin will take the stage at 4:35 PM on June 3rd.
- The Bay Area Search Meetup will welcome our CTO, Shay Banon, on June 3rd. We’re still sorting the final details, but register now. All of our Bay Area meetups fill lightning fast!
- The Elasticsearch New York City Meetup group will get together on June 4th at 6:30 PM. Agenda is still being finalized, but you’ll hear from our core developers during the meetup. Many thanks to ThoughtWorks for hosting us!
- The San Francisco Elasticsearch Meetup will focus on all things logging and DevOps goodness. Join Kurt Hurtado and Gaurav Gupta to learn all about how to get more love from your logs.
- The Philadelphia Elasticsearch Meetup group will be getting together for their second meetup on June 26th. Details are still in the works, but sign up now so you know where to go for ELK treats on the 26th.
Where to Find You
Our Community Manager, Leslie Hawthorn, is hard at work to help folks create more Elasticsearch meetup groups and to help meetup organizers find more speakers. If you are interested in either effort, take a moment to let her know.
Oh yeah, we’re also hiring. If you’d like us to find you for employment purposes, just drop us a note.  We care more about your skill set and passion for Elasticearch, Kibana and Logstash than where you rest your head.
Trainings
If you are interested in Elasticsearch training we have courses taught by our core developers coming up in:
- Zurich – June 5, 2014 (core Elasticsearch training)
- San Francisco – June 6, 2014 (ELK workshop)
- Amsterdam – June 27, 2014 (ELK workshop)
- Seoul – July 10, 2014 (core Elasticsearch training)
- Tokyo – July 14, 2014 (core Elasticsearch training)