To continue building on data storage models that aren't as inflexible as traditional RDBMS structures, we will look at Apache Hadoop. Hadoop was created by Doug Cutting and Mike Cafarella in 2005. Cutting, who was working at Yahoo! at the time, named it after his son's toy elephant. It was originally developed to support distribution for the Nutch search engine project.
Hadoop followed the ideas published by Google in the papers pertaining to Google File System and Google MapReduce. With over a decade of use, Hadoop has grown to a very large and complex ecosystem with a projected revenue of around $23 billion in 2016. Hadoop drives everything from repackaged distributions to full database implementations, analytics packages, and management solutions.