We now have a basic understanding of the Apache Hadoop architecture and its inner workings. In this chapter, we will dive deeper into the two major components of Apache Hadoop—HDFS and MapReduce, and will cover the following topics:
Essentials of Hadoop Distributed File System
The read/write operational flow in HDFS
Exploring HDFS commands
Getting acquainted with MapReduce