Elasticsearch for Apache Hadoop 2.1.0.rc1


See issues on GitHub

Release Notes

New features

  • Add latest Spark 1.3.1 data source filters #461

Enhancements

  • Spark SQL 1.3 support - esDF function incorrectly maps column names #451
  • Adding possibility to backup indexes into hdfs authorized by kerberos. #450
  • case sensitive columns in Pig #380
  • Upgrade to Storm 0.9.5 #475
  • Upgrade to Hive 1.2 #457

Bug fixes

  • ISO8601 dates with timezone offset considered invalid #458
  • Fix json field extraction with mix of nested objects. #456 (issue: #455)
  • JSON schema inference corrupt with Elasticsearch Spark #441

Docs

  • Fix package name for JavaEsSpark in example. #472
  • add section on SSL / PKI support #468
  • Document the beta status #465
  • Hadoop Configuration setMapOutputValueClass method doesn't exists #454
  • Add Spark write RDD example #453
  • Fix copy paste errors #445
  • Use javadoc of Cascading 2.6 #438
  • Update socks proxy configuration in configuration.adoc #419

Feedback

  • Issue With Elasticsearch Hadoop Config, Please help me #474
  • Not able to insert data into ES using elasticsearch-hadoop-2.1.0.Beta4.jar #473
  • full body search from Spark #466
  • Reading BulkResponse of BulkRequestBuilder in spark? #463
  • Behavior of elasticsearch-hadoop/spark when es nodes are restarted #462
  • is JOIN operation possible in ElasticSearch using a Presto Connector ? #459
  • Adding Nodes in ES-Yarn #443
  • Version compatibility detection wrong in 2.1.0.BUILD-SNAPSHOT (Tue Apr 28 00:37:19 EEST 2015) #440
  • ElasticSearch, Hive, and HBase #439
  • explicit client nodes vs preferred parallelism. Method to confirm what spark is actually using. #437
  • Jackson error in elasticsearch hadoop while loading data to elasticsearch #425