This Week in Elasticsearch - May 28, 2014
Welcome to This Week in Elasticsearch. In this roundup, we try to inform you about the latest and greatest changes in Elasticsearch. We cover what happened in the GitHub repositories, as well as many Elasticsearch events happening worldwide, and give you a small peek into the future of the project.
- Field data: Global ordinals cause
ClassCastExceptionsif used with a bounded fielddata cache (#6260, master and 1.2)
- Percolator API: Fix assertion in percolation with nested docs (#6263, master, 1.2 and 1.1)
- More Like This Query: replaced
includeto avoid double negation (#6248, master and 1.2)
- Lucene: Entirely cut over to
TopDocs#mergefor merging shard docs in the reduce phase (#6197, master and 1.3)
- Percolate API: Support for execute significant terms and geo hash grid aggregations (#6037, master and 1.3)
- Percolate API: Add
num_of_shardsstatistic to percolate context (#6123, master and 1.3)
date_histogramaggregation breaks on date fields with multiple formats (#6239, master and 1.3)
- More Like This Query: allow for both
docs/idsto be specified (#6246, master and 1.3)
- Highlighting: Highlight fields in request order (#6178, master and 1.3)
- Aggregations: Fix
ReverseNestedAggregatorto compute the parent document correctly (#6278, master, 1.3 and 1.2)
- Nested: Make sure queries/filters/aggs get a
FixedBitSetwhen they expect one (#6279, master, 1.3, 1.2 and 1.1)
- Guava: Filter cache size limit not honored 32GB or over (#6268, master, 1.3, 1.2 and 1.1)
- Aggregations: Added
top_hitsaggregation that keeps track of the most relevant document being aggregated per bucket (#6124, master and 1.3)
- Startup: Improve error when
mlockallfails (#6288, master and 1.3)
- Query API: Fix
MatchQueryParsernot parsing fuzzy_transpositions (#6300, master, 1.3, 1.2 and 1.1)
- Startup: Reset locale to
en_US.UTF-8in bin/elasticsearch (#6047, master and 1.3)
- Highlighting: Plain highlighter to use analyzer defined on a document level when available (#5497, master and 1.3)
- Serialization: Add support for Byte to the
XContentBuilder(#6127, master and 1.3)
- Mustache: Ensure internal scope extrators are always operating on a Map (6138#, master, 1.3, 1.2 and 1.1)
Today's #Elasticsearch #bbuzz hackathon. Thanks @elasticberlin @berlinonlinenet @syseleven @immobilienscout for all! pic.twitter.com/tLbmixz3cs
— Leslie Hawthorn (@lhawthorn) May 28, 2014
Here’s some more information about what is happening in the ecosystem we are maintaining around the ELK stack – that’s Elasticsearch plus Logstash and Kibana – including plugin and driver releases.
- In case you missed it, we’ve had several releases in the past week: Elasticsearch 1.2.0 and 1.1.2, Elasticsearch for Apache Hadoop 2.0 GA, and Kibana 3.1 last week.
- Speaking of Kibana (and Elasticsearch!), you may be interested in reading about the recent hackathon we hosted with the MIT Data Analytics Club. Cool hacks, plus making data visualization simple and beautiful for those without a comp sci background.
- If you missed it last week, you can check out the video of our latest webcast, The ELK Stack in a DevOps Environment, presented by Kurt Hurtado. (Registration required.)
- The lastest release of the community created Scala client for Elasticsearch, Elastic4s 126.96.36.199 is out!
- Zachary Tong created an awesome how-to last week about creating a negative connotation filter for Elasticsearch.
- Gareth Phillips shared his experiences building threat intelligence dashboards with live feeds from Elasticsearch.
- You may be interested to learn how Quizlet and OpenCrowd are using Elasticsearch to power education & high finance.
- Poul Foged shared an article on replication and scaling in Elasticsearch. (i dansk)
Slides & Videos
We’re got a bunch of slide & video love from Berlin Buzzwords and beyond!
Where to find Us
We’d love to feature all the great Elasticsearch, Logstash and Kibana presentations and meetups happening worldwide in this section. If you’re speaking or hosting a meetup, let our Community Manager, Leslie Hawthorn, know!
The Seventh Elasticsearch France User Group meeting is coming up on June 10th in Paris. Doors open at 7:30 PM.
We’ve got not one but two awesome conferences going on in Amsterdam the week of June 16th:
- DevOps Days Amsterdam: Join us June 19-21st for an Elasticsearch workshop, a little on DevOps for Happier Humans and all the news you can use at the Elasticsearch table. Swing by to say hello to Leslie Hawthorn and Kurt Hurtado! Lee Hinmann will also join us for the workshop. Yay Lee!
- GOTO Amsterdam: Our CTO, Shay Banon, will treat the crowd to a deep dive on Elasticsearch’s APIs, plus we’ll have lots of knowledge and swag on offer at our booth. See you there!
- Martijn Laarman will be speaking on from text to full-text search at the NDC Oslo 2014 conference. The show runs from June 2-6th.
- Mark Harwood will cover Big Data on Elasticsearch at the ComperioFrokost on June 11th.
The Torun JUG will get together on May 28th at 6 PM to talk about Lucene and all of her friends, with a spotlight on Elasticsearch.
The Code Lounge user group in Auckland will get together on June 7th to talk Getting Control of Your Logs. Lots of Logstash love on offer.
- Alexander Reelsen will be speaking at Mimacom Days Zurich on June 4th. He will talk about continuous improvement of your data using the ELK stack at 9:45 AM, directly after the conference welcome remarks.
- Alexander Reelsen and Britta Weber will be speaking at the Zurich Elasticsearch Meetup on June 7th. Alex will discuss What’s new in Elasticsearch and Britta will cover the Significant Terms Aggregation. Doors open at 7 PM.
- The London Elasticsearch User Group has already announced that there next meeting will take place on June 4th. RSVP now to save your place – this one fills up very quickly!
- David Pilato will take the stage at DevoxxUK to discuss Advanced Search for Your Legacy Application DevoxxUK takes place June 12-13th in London, and David will speak on June 12th at 4 PM in Room 1.
- Elasticsearch will be out in force at London Technology Week. Join us June 17-19th to celebrate all that is fashionable in the UK tech scene, especially at the Networking Bar. Even cooler, on June 17th, Mark Harwood will be telling you all about how to Make sense of your (big) data and analyze in real-time like you have never done before! Mark’s talk is at 4 PM.
- This week’s event highlight: Costin Leau will speak at Hadoop Summit North America on Real-time Analytics and Anomalies Detection using Elasticsearch, Hadoop and Storm. The conference runs from June 3-5th in San Jose, California. Costin will take the stage at 4:35 PM on June 3rd.
- The Bay Area Search Meetup will welcome our CTO, Shay Banon, on June 3rd. We’re still sorting the final details, but register now. All of our Bay Area meetups fill lightning fast!
- The Elasticsearch New York City Meetup group will get together on June 4th at 6:30 PM. Agenda is still being finalized, but you’ll hear from our core developers during the meetup. Many thanks to ThoughtWorks for hosting us!
- The San Francisco Elasticsearch Meetup will focus on all things logging and DevOps goodness. Join Kurt Hurtado and Gaurav Gupta to learn all about how to get more love from your logs.
- The Philadelphia Elasticsearch Meetup group will be getting together for their second meetup on June 26th. Details are still in the works, but sign up now so you know where to go for ELK treats on the 26th.
Where to Find You
Our Community Manager, Leslie Hawthorn, is hard at work to help folks create more Elasticsearch meetup groups and to help meetup organizers find more speakers. If you are interested in either effort, take a moment to let her know.
Oh yeah, we’re alsoÂ hiring. If you’d like us to find you for employment purposes, justÂ drop us a note. Â We care more about your skill set and passion for Elasticearch, Kibana and Logstash than where you rest your head.
If you are interested in Elasticsearch training we have courses taught by our core developers coming up in: