Apache Hadoop is an open source platform to develop and deploy Big Data applications. It was initially developed at Yahoo! based on the MapReduce and Google File System papers published by Google. Over the past few years, Hadoop has become the flagship Big Data platform.
The following are the key components of a Hadoop cluster:
Hadoop Distributed File System (HDFS)
Yet Another Resource Negotiator (YARN)
Both HDFS and YARN are based on a set of libraries called Hadoop Common. It provides an abstraction for OS and filesystem operations so that Hadoop can be deployed on a variety of platforms. Now let's have a deeper look into HDFS and YARN.