Book Image

Apache Karaf Cookbook

By : Jamie Goodyear, Johan Edstorm, Achim Nierbeck, Heath J Kesler
Book Image

Apache Karaf Cookbook

By: Jamie Goodyear, Johan Edstorm, Achim Nierbeck, Heath J Kesler

Overview of this book

Table of Contents (17 chapters)
Apache Karaf Cookbook
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Introduction


To continue building on data storage models that aren't as inflexible as traditional RDBMS structures, we will look at Apache Hadoop. Hadoop was created by Doug Cutting and Mike Cafarella in 2005. Cutting, who was working at Yahoo! at the time, named it after his son's toy elephant. It was originally developed to support distribution for the Nutch search engine project.

Hadoop followed the ideas published by Google in the papers pertaining to Google File System and Google MapReduce. With over a decade of use, Hadoop has grown to a very large and complex ecosystem with a projected revenue of around $23 billion in 2016. Hadoop drives everything from repackaged distributions to full database implementations, analytics packages, and management solutions.

Hadoop has also started changing the way startups look at their data models, allowing new companies to make Big Data part of their overall strategy.

At the core of Hadoop, you have Hadoop Distributed File System (HDFS). This mechanism...