Elasticsearch Now Certified On Cloudera Enterprise 5; Releases New Hadoop Connector
Los Altos, Calif. and Amsterdam, The Netherlands – 19 June 2014
Elasticsearch, Inc., the company on a mission to make data useful to businesses by delivering the world's most advanced search and analytics engine, today announced the 2.0 release of its Hadoop connector, Elasticsearch for Apache Hadoop, along with certification on Cloudera Enterprise 5. With Cloudera certification, Elasticsearch is now compatible across all Apache-based Hadoop distributions, including HortonWorks and MapR, helping businesses extract immediate insights regardless of where their hundreds of terabytes or even petabytes of data are stored.
Elasticsearch is the search and analytics engine behind the ELK stack, which also utilizes Logstash, a log management tool, and Kibana's powerful data visualization capabilities to help businesses pull vital information from their data stores. When used in conjunction with Hadoop, organizations no longer need to run a batch process and wait hours to analyze their data – Elasticsearch for Apache Hadoop can pipe data to Elasticsearch for indexing as it's being generated, making it available for search and analysis in a matter of seconds. Kibana can also be used to explore massive amounts of data in Elasticsearch through easy-to-generate pie charts, bar graphs, scatter plots, histograms, and more.
How Businesses Leverage Elasticsearch and Hadoop
Elasticsearch is becoming the critical piece of pulling data from any environment and getting it into the hands of developers, engineering leads, CTOs, and CIOs who need insight into moving parts of their business at the rate they are happening. Customer examples include:
- Klout, which stores petabytes of its 400 million+ users' data in a Hadoop Distributed File System and connects it to Elasticsearch. Klout query results, used to build targeted marketing campaigns, are delivered in seconds rather than minutes.
- MutualMind, which enables customers like AT&T, Kraft, Nestle, and Starbucks to monitor their brands on social networks. After its Hadoop batches started taking 15+ minutes, MutualMind moved to Elasticsearch to power its real-time analytics, while utilizing Hadoop for statistical analysis.
- An international financial services firm that uses Elasticsearch to analyze its access logs in just minutes instead of having to wait hours to run MapReduce jobs. Because Elasticsearch provided insights so quickly on the firm's large amounts of data, they've been able to increase the window of data they can analyze from one hour to a full week.
Key Features of Elasticsearch for Apache Hadoop
- The ability to read and write data between Hadoop and Elasticsearch: Lets businesses get immediate, actionable insights by writing their data to Elasticsearch for real-time search and analysis. Complex jobs that would normally take minutes or hours to run in Hadoop can be handled quickly in Elasticsearch and read right back to Hadoop.
- Native integration and support for popular Hadoop libraries: Lets users run queries natively on Hadoop through MapReduce, Hive, Pig, or Cascading APIs.
- Snapshot/Restore: Makes it easy to take a snapshot of data within Elasticsearch – perhaps a year's worth – and archive it in Hadoop. At any time, the snapshot can be restored back to Elasticsearch for additional analysis.
Supporting Quotes from Cloudera, Elasticsearch, and Klout
Steven Schuurman, Co-founder and CEO, Elasticsearch
"Hadoop was created to store and archive data at a massive scale, but businesses need to be able to ask, iterate, and extract actionable insights from this data – which is what we designed our products for. With today's certification from Cloudera, Elasticsearch now works with all Apache-based Hadoop distributions, and with it, solves the last mile of big data Hadoop deployments by getting big insights, fast."
Tim Stevens, Vice President of Business and Corporate Development at Cloudera
"Part of our mission at Cloudera is to support and promote an open architecture and allow customers to leverage their technology investments. Together, Cloudera and Elasticsearch provide businesses with a solution that allows them to get insight out of massive amounts of data."
Felipe Oliveria, Director of Engineering, Backend for Klout
"Elasticsearch has a very good integration with Hadoop. It allows us to export a Hive table to an index on Elasticsearch very easily. HBase is a great data store, and it allows random access to the data, which Elasticsearch is perfect for. Elasticsearch fits very nicely into our data pipeline."
Because Elasticsearch works across distributed, diverse environments, engineers can search, extract, clean up and analyze data whether it comes from log events, social media activity, support tickets, website analytics or product interactions. Thousands of businesses worldwide continue to adopt Elasticsearch to store, search and analyze any type of data in real time, including Bloomberg, Comcast, eBay, Facebook, GitHub, Mayo Clinic, McGraw-Hill, Netflix, The New York Times, Target, Verizon, WordPress and Yelp.
Upcoming Webinar: Real-time Analytics and Anomaly Detection using Elasticsearch and Apache Hadoop
On Wednesday, August 20, 2014 at 9:00am PT/12:00pm ET, Elasticsearch will host a webinar that goes over the features and benefits of Elasticsearch for Apache Hadoop, including a demonstration of how to use it as a platform to perform search and analytics, such as anomaly detection. To register, visit http://www.elasticsearch.org/webinars/elasticsearch-and-apache-hadoop.
About Elasticsearch, Inc.
Elasticsearch is on a mission to make massive amounts of data usable for businesses by delivering the world’s most advanced search and analytics engine. With a laser focus on achieving the best user experience imaginable, the Elasticsearch ELK stack – comprised of Elasticsearch, Logstash, and Kibana – has become a rapidly growing open source solution in the market. Used by thousands of enterprises in virtually every industry today, Elasticsearch Inc. provides production support, development support, and training for the full ELK stack.
Elasticsearch, Inc. was founded in 2012 by the people behind the Elasticsearch and Apache Lucene open source projects. Since its initial release, the ELK stack has more than 10 million cumulative downloads. The company is backed by Benchmark Capital, Index Ventures, and NEA with headquarters in Amsterdam and Los Altos, California, and offices around the world.
To learn more, visit www.elasticsearch.com.