Elasticsearch for Apache Hadoop 2.1.3


See issues on GitHub

Release Notes

Enhancements

  • Hostname needs to be resolved to an IP#640
  • Improve creation of TrustManagers in SSL configuration#611
  • nested field extraction#605
  • Allow YARN specific parameters to be specified #520
  • Which column causes issue in MapperParsingException #395

Bug fixes

  • ES Hadoop does not retry on HTTP 429 #655
  • No data node with id found #563
  • When set "es.mapping.date.rich" to false, DataFrame schema not change to String or Long #672
  • ArrayWritable cannot be serialized #668
  • Fails to shutdown elasticsearch on YARN #658
  • RestService: missing index log message displays wrong setting #656
  • Hostname cannot be resolved if the uri schema is specified #652
  • multi index data frame causes OOM #634
  • Unable to get data from ElasticSearch in a variable via a PIG #632
  • Spark: saveToEs can evaluate the RDD twice#631
  • SimpleHttpConnectionManager problem with elasticsearch-hadoop-2.1.2.jar on Spark #618
  • es.mapping.exclude doesn't work for Hive #595
  • Parsing of argument fails #509

Docs

  • ElasticSearch+Spark in Java: error: package org.elasticsearch.spark.java.api does not exist #678
  • Fix typos #629
  • Update spark.adoc #612
  • Trouble writing to elasticsearch 2.0, spark 1.5#610
  • installation instruction does not work#609
  • [DOCS] Possible typo Apache Spark vs. Apache Storm#600
  • ES-HADOOP and ES V2?#597
  • [DOCS] Possible typo Apache Spark vs. Apache Storm #588

Reports

  • Error is thrown when multiple instances of the same es-hadoop library are deployed#685
  • Somehow elasticsearch-spark_2.10 depends on 2.11 version of scala-library#674
  • An error occurred while calling z:org.apache.spark.api.python.PythonRDD.newAPIHadoopRDD#670
  • Compressed snapshot for backing up#662
  • Caused by: java.io.IOException: org.elasticsearch.hadoop.rest.EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed; tried [[10.XXX.XXX.XX:9200]]#654
  • ClassNotFoundException EsPartition on spark_2.10-2.2.0-rc1#653
  • Compressed snapshot for backing up and restoring#646
  • how to use elasticsearch-spark to connect to elasticsearch server behind a proxy server#643
  • if nodeIp has '/' and the form /, just return . #641
  • ES hadoop problem finding the correct cluster nodes#636
  • IP Address badly parsed in org.elasticsearch.hadoop.serialization.dto.Node#630
  • Hadoop-Spark2Elasticsearch data ingestion problem: Elasticsearch index docs count is greater than Hive table rows count#628
  • Exception in thread "main" org.apache.spark.SparkException: Task not serializable#627
  • Resolve IP Address for spark.es.nodes param #623
  • Not able to process into Elasticsearch Found#622
  • SELECT * FROM tabletest WHERE col1 IN (0,10,5,27 )#615
  • ES 2.0 SSL problem#608
  • Hive loading data into ES error: org.elasticsearch.hadoop.rest.EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed#606
  • [SPARK] SparkContextFunctions.esRDD parameters #604 (issue: #592)
  • Exception org.elasticsearch.hadoop.rest.EsHadoopInvalidRequest with Spark 1.5.1, ES 2.0 and v2.2.0-beta1#603
  • a question with hive-es issue#601
  • Not specifying containers throws Exception#598
  • Too many requests?#594
  • Issue with Spark 1.5.1: es-hadoop "Connection error (check network and/or proxy settings)- all nodes failed"#591
  • java.lang.NoClassDefFoundError: org/apache/commons/httpclient/URIException#586
  • ClassNotFoundException: EsHadoopNoNodesLeftException#585
  • Unable to connect to my ElasticSearch server using HTTP Basic Auth#568