Chapter 2. HDFS and MapReduce
We now have a basic understanding of the Apache Hadoop architecture and its inner workings. In this chapter, we will dive deeper into the two major components of Apache Hadoop—HDFS and MapReduce, and will cover the following topics:
- Essentials of Hadoop Distributed File System
- The read/write operational flow in HDFS
- Exploring HDFS commands
- Getting acquainted with MapReduce