This Week in Elasticsearch - May 28, 2014

Welcome to This Week in Elasticsearch. In this roundup, we try to inform you about the latest and greatest changes in Elasticsearch. We cover what happened in the GitHub repositories, as well as many Elasticsearch events happening worldwide, and give you a small peek into the future of the project.

Elasticsearch core

  • Field data: Global ordinals cause ClassCastExceptions if used with a bounded fielddata cache (#6260, master and 1.2)
  • Percolator API: Fix assertion in percolation with nested docs (#6263, master, 1.2 and 1.1)
  • More Like This Query: replaced exclude with include to avoid double negation (#6248, master and 1.2)
  • Lucene: Entirely cut over to TopDocs#merge for merging shard docs in the reduce phase (#6197, master and 1.3)
  • Percolate API: Support for execute significant terms and geo hash grid aggregations (#6037, master and 1.3)
  • Percolate API: Add num_of_shards statistic to percolate context (#6123, master and 1.3)
  • Aggregations: date_histogram aggregation breaks on date fields with multiple formats (#6239, master and 1.3)
  • More Like This Query: allow for both like_text and docs/ids to be specified (#6246, master and 1.3)
  • Highlighting: Highlight fields in request order (#6178, master and 1.3)
  • Aggregations: Fix ReverseNestedAggregator to compute the parent document correctly (#6278, master, 1.3 and 1.2)
  • Nested: Make sure queries/filters/aggs get a FixedBitSet when they expect one (#6279, master, 1.3, 1.2 and 1.1)
  • Guava: Filter cache size limit not honored 32GB or over (#6268, master, 1.3, 1.2 and 1.1)
  • Aggregations: Added top_hits aggregation that keeps track of the most relevant document being aggregated per bucket (#6124, master and 1.3)
  • Startup: Improve error when mlockall fails (#6288, master and 1.3)
  • Query API: Fix MatchQueryParser not parsing fuzzy_transpositions (#6300, master, 1.3, 1.2 and 1.1)
  • Startup: Reset locale to en_US.UTF-8 in bin/elasticsearch (#6047, master and 1.3)
  • Highlighting: Plain highlighter to use analyzer defined on a document level when available (#5497, master and 1.3)
  • Serialization: Add support for Byte to the XContentBuilder (#6127, master and 1.3)
  • Mustache: Ensure internal scope extrators are always operating on a Map (6138#, master, 1.3, 1.2 and 1.1)

Elasticsearch Ecosystem

Here’s some more information about what is happening in the ecosystem we are maintaining around the ELK stack – that’s Elasticsearch plus Logstash and Kibana – including plugin and driver releases.

Binh Ly Introduces Kibana at last week’s Chicago Meetup

Slides & Videos

We’re got a bunch of slide & video love from Berlin Buzzwords and beyond!

Where to find Us

We’d love to feature all the great Elasticsearch, Logstash and Kibana presentations and meetups happening worldwide in this section. If you’re speaking or hosting a meetup, let our Community Manager, Leslie Hawthorn, know!

Austria

The Elasticsearch Vienna Meetup Group has scheduled their first meetup for Thursday, June 12th. Please join us at 7 PM to hear from Alexander Reelsen on What’s New in Elasticsearch.

Bulgaria

Honza Kral will discuss how to Explore Your Data using Elasticsearch at the Bulgarian Web Summit 2014. The conference takes place on May 31st in Sofia.

France

The Seventh Elasticsearch France User Group meeting is coming up on June 10th in Paris. Doors open at 7:30 PM.

The Netherlands

We’ve got not one but two awesome conferences going on in Amsterdam the week of June 16th:

  • DevOps Days Amsterdam: Join us June 19-21st for an Elasticsearch workshop, a little on DevOps for Happier Humans and all the news you can use at the Elasticsearch table. Swing by to say hello to Leslie Hawthorn and Kurt Hurtado! Lee Hinmann will also join us for the workshop. Yay Lee!
  • GOTO Amsterdam: Our CTO, Shay Banon, will treat the crowd to a deep dive on Elasticsearch’s APIs, plus we’ll have lots of knowledge and swag on offer at our booth. See you there!

Norway

Poland

The Torun JUG will get together on May 28th at 6 PM to talk about Lucene and all of her friends, with a spotlight on Elasticsearch.

Romania

Costin Leau will speak at Topconf Bucharest 2014 on Big data real time search and analytics. Topconf Bucharest runs from June 10-13th and Costin will speak at 3:20 PM on June 12th.

Russia

Honza Kral will speak at PyCon Ru. The schedule is still being finalized, but mark your calendars for June 2nd and 3rd. If you’re heading to PyCon Ru, make sure to say hello to Honza!

New Zealand

The Code Lounge user group in Auckland will get together on June 7th to talk Getting Control of Your Logs. Lots of Logstash love on offer.

Switzerland

  • Alexander Reelsen will be speaking at Mimacom Days Zurich on June 4th. He will talk about continuous improvement of your data using the ELK stack at 9:45 AM, directly after the conference welcome remarks.
  • Alexander Reelsen and Britta Weber will be speaking at the Zurich Elasticsearch Meetup on June 7th. Alex will discuss What’s new in Elasticsearch and Britta will cover the Significant Terms Aggregation. Doors open at 7 PM.

United Kingdom

  • The London Elasticsearch User Group has already announced that there next meeting will take place on June 4th. RSVP now to save your place – this one fills up very quickly!
  • David Pilato will take the stage at DevoxxUK to discuss Advanced Search for Your Legacy Application DevoxxUK takes place June 12-13th in London, and David will speak on June 12th at 4 PM in Room 1.
  • Elasticsearch will be out in force at London Technology Week. Join us June 17-19th to celebrate all that is fashionable in the UK tech scene, especially at the Networking Bar. Even cooler, on June 17th, Mark Harwood will be telling you all about how to Make sense of your (big) data and analyze in real-time like you have never done before! Mark’s talk is at 4 PM.

United States

  • This week’s event highlight: Costin Leau will speak at Hadoop Summit North America on Real-time Analytics and Anomalies Detection using Elasticsearch, Hadoop and Storm. The conference runs from June 3-5th in San Jose, California. Costin will take the stage at 4:35 PM on June 3rd.
  • The Bay Area Search Meetup will welcome our CTO, Shay Banon, on June 3rd. We’re still sorting the final details, but register now. All of our Bay Area meetups fill lightning fast!
  • The Elasticsearch New York City Meetup group will get together on June 4th at 6:30 PM. Agenda is still being finalized, but you’ll hear from our core developers during the meetup. Many thanks to ThoughtWorks for hosting us!
  • The San Francisco Elasticsearch Meetup will focus on all things logging and DevOps goodness. Join Kurt Hurtado and Gaurav Gupta to learn all about how to get more love from your logs.
  • The Philadelphia Elasticsearch Meetup group will be getting together for their second meetup on June 26th. Details are still in the works, but sign up now so you know where to go for ELK treats on the 26th.

Where to Find You

Our Community Manager, Leslie Hawthorn, is hard at work to help folks create more Elasticsearch meetup groups and to help meetup organizers find more speakers. If you are interested in either effort, take a moment to let her know.

Oh yeah, we’re also hiring. If you’d like us to find you for employment purposes, just drop us a note.  We care more about your skill set and passion for Elasticearch, Kibana and Logstash than where you rest your head.

Trainings

If you are interested in Elasticsearch training we have courses taught by our core developers coming up in:

  • Zurich – June 5, 2014 (core Elasticsearch training)
  • San Francisco – June 6, 2014 (ELK workshop)
  • Amsterdam – June 27, 2014 (ELK workshop)
  • Seoul – July 10, 2014 (core Elasticsearch training)
  • Tokyo – July 14, 2014 (core Elasticsearch training)