It is important to preserve the state of ResourceManager during the restart of RM, so as to keep the application running with minimal interruptions. The concept is that the RM preserves the application state in a store and reloads it on restart. ApplicationMasters (AM) and NodeManagers continuously poll RM for status and re-register with it when available, thus resuming the containers from saved state.
For this recipe, you will again need a running cluster and have completed the previous recipes to make sure the cluster is working fine in terms of HDFS and YARN.
Connect to the
master1.cyrus.com
master node and switch to userhadoop
.Navigate to the directory
/opt/cluster/hadoop/etc/hadoop
.Edit the
yarn-site.xml
configuration file to make the necessary changes as shown in the following steps.Enable RM recovery by making changes as shown in the following screenshot:
Specify the
state-store
to be used for this, as shown in the following...