To configure a Hadoop cluster in fully-distributed mode , we need to configure all the master and slave machines. Although different from the pseudo-distributed mode, the configuration experience will be similar. In this recipe, we will outline steps to configure Hadoop in fully-distributed mode.
In this book, we propose to configure a Hadoop cluster with one master node and five slave nodes. The hostname of the master
node is 1
and the hostnames of the slave nodes are slave1
, slave2
, slave3
, slave4
, and slave5
.
Before getting started, we assume that Linux has been installed on all the cluster nodes and we should validate password-less login with the following commands on the master node:
ssh hduser@slave1 ssh hduser@slave2 ...