Elasticsearch for Apache Hadoop 1.3.0 M3


See issues on GitHub

Release Notes

New features:

  • ssh tunnelling support for es-hadoop #166
  • Support MRv1 and MRv2/YARN through the same jar #154
  • Spark cannot determine task id #152
  • IP is not a supported type #144
  • Provide Hadoop counters with the ES stats #141
  • better integration with Cascading provider interfaces #137
  • prevent Hadoop from kill long running jobs #135
  • ESVariableResourceOutputFormat - ability to write to different mappings when needed #125
  • cascading counters in ESTap #87

Enhancements:

  • Split project features into separate modules #182
  • Add validation of queries against ES mapping #179
  • [repository] Allow multiple config locations #176
  • Introduce dedicated Es-Hadoop exceptions #164
  • reading an index and writing into another one #156
  • Per tap configuration for cascading #146
  • split library/deployment jar to separate cascading, hive, pig and raw mapreduce #86
  • Elasticsearch+Hadoop read and write #45
  • Not possible to query and create indexes in the same Job #26

Bug fixes:

  • Changing `es.resource` for writing during mapping phase #181
  • fix#169 MapJoin failed, Configuration and input path are inconsistent #173
  • [REGRESSION] Cascading Tap with host port in M2 #171
  • MapJoin failed, Configuration and input path are inconsistent #169
  • inconsistent num of docs written to ES #167
  • Hive dependency required when not needed #165
  • EsTap is not working #162
  • Reading double in hive #157
  • Cascading tap should persistent its settings if configured as a tap #155
  • NPE in HeartBeat.java in development snapshot #148
  • big mapping can cause UTFDataFormatException #147
  • es.port is not always used when set to non default port #138
  • regression: writing to elasticsearch failing with exception #133
  • Issue while joining two tables stored on Elasticsearch using HiveQL #180
  • Can't insert to ES - Malformed content, must start with an object #178
  • support dynamic index/type #175
  • unable to query elastic search repository from hive #172
  • pattern based indexes #70
  • Cascading tap needs to be able to write to multiple indexes? #51