As the Hadoop infrastructure starts to grow, several new requirements will start to appear such as automating the creation of images, keeping the Hadoop cluster resilient to failure, and looking to the best outfit for the big data cluster networking configuration. In this chapter, we will discuss several topics that come up with Sahara, which offers in the latest stable releases more advanced functionalities that allow setting up a more customized Hadoop cluster within more possibilities and choices of configuration. This chapter will examine the following topics:
Discussing different plugins supported by Sahara in the current version
Creating images for different Sahara plugins using image tools out of the box
Checking the requirements and limitations for each plugin in Sahara
Learning what is an affinity group and how to use it in Sahara
Understanding data locality and how to use it in Sahara
Discussing different networking configurations...