Elasticsearch for Apache Hadoop 2.1.3
See issues on GitHub
Release Notes
Enhancements
- Hostname needs to be resolved to an IP#640
- Improve creation of TrustManagers in SSL configuration#611
- nested field extraction#605
- Allow YARN specific parameters to be specified #520
- Which column causes issue in MapperParsingException #395
Bug fixes
- ES Hadoop does not retry on HTTP 429 #655
- No data node with id found #563
- When set "es.mapping.date.rich" to false, DataFrame schema not change to String or Long #672
- ArrayWritable cannot be serialized #668
- Fails to shutdown elasticsearch on YARN #658
- RestService: missing index log message displays wrong setting #656
- Hostname cannot be resolved if the uri schema is specified #652
- multi index data frame causes OOM #634
- Unable to get data from ElasticSearch in a variable via a PIG #632
- Spark: saveToEs can evaluate the RDD twice#631
- SimpleHttpConnectionManager problem with elasticsearch-hadoop-2.1.2.jar on Spark #618
- es.mapping.exclude doesn't work for Hive #595
- Parsing of argument fails #509
Docs
- ElasticSearch+Spark in Java: error: package org.elasticsearch.spark.java.api does not exist #678
- Fix typos #629
- Update spark.adoc #612
- Trouble writing to elasticsearch 2.0, spark 1.5#610
- installation instruction does not work#609
- [DOCS] Possible typo Apache Spark vs. Apache Storm#600
- ES-HADOOP and ES V2?#597
- [DOCS] Possible typo Apache Spark vs. Apache Storm #588
Reports
- Error is thrown when multiple instances of the same es-hadoop library are deployed#685
- Somehow elasticsearch-spark_2.10 depends on 2.11 version of scala-library#674
- An error occurred while calling z:org.apache.spark.api.python.PythonRDD.newAPIHadoopRDD#670
- Compressed snapshot for backing up#662
- Caused by: java.io.IOException: org.elasticsearch.hadoop.rest.EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed; tried [[10.XXX.XXX.XX:9200]]#654
- ClassNotFoundException EsPartition on spark_2.10-2.2.0-rc1#653
- Compressed snapshot for backing up and restoring#646
- how to use elasticsearch-spark to connect to elasticsearch server behind a proxy server#643
- if nodeIp has '/' and the form
/ , just return . #641 - ES hadoop problem finding the correct cluster nodes#636
- IP Address badly parsed in org.elasticsearch.hadoop.serialization.dto.Node#630
- Hadoop-Spark2Elasticsearch data ingestion problem: Elasticsearch index docs count is greater than Hive table rows count#628
- Exception in thread "main" org.apache.spark.SparkException: Task not serializable#627
- Resolve IP Address for spark.es.nodes param #623
- Not able to process into Elasticsearch Found#622
- SELECT * FROM tabletest WHERE col1 IN (0,10,5,27 )#615
- ES 2.0 SSL problem#608
- Hive loading data into ES error: org.elasticsearch.hadoop.rest.EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed#606
- [SPARK] SparkContextFunctions.esRDD parameters #604 (issue: #592)
- Exception org.elasticsearch.hadoop.rest.EsHadoopInvalidRequest with Spark 1.5.1, ES 2.0 and v2.2.0-beta1#603
- a question with hive-es issue#601
- Not specifying containers throws Exception#598
- Too many requests?#594
- Issue with Spark 1.5.1: es-hadoop "Connection error (check network and/or proxy settings)- all nodes failed"#591
- java.lang.NoClassDefFoundError: org/apache/commons/httpclient/URIException#586
- ClassNotFoundException: EsHadoopNoNodesLeftException#585
- Unable to connect to my ElasticSearch server using HTTP Basic Auth#568