The hardware specification might vary according to the amount of data to be stored and the type of processing power required. It is recommended to use the following configurations:
1 to 4 TB hard disks
Two (8 to 24 core) processors, running at least 2 to 2.5 GHz
64 to 512 GB of memory
Bonded Gigabit Ethernet or 10 Gigabit Ethernets
Now, let's explain these hardware components in more detail:
CPU: The workload depends on this hardware component. It is recommended that we have a medium-clock-speed CPU with two slots for DataNodes. Why medium? This is because the high-end processor cost of a setup rises quickly, so we can have a comparatively cheaper CPU with more machines than use fewer machines with high-end processors. So, it is recommended to have 8 to 24 core processors with medium CPU cycle for less power consumption.
Power: This is also a component to consider when configuring a Hadoop cluster because power consumption tends to go up with high...