We have visited the different clustering algorithms offered by Mahout. We have also discussed how to use these algorithms with different datasets. We saw how to evaluate and improve cluster qualities. Now, in this final chapter, we will evaluate how to create a production-ready clustering model.
In this chapter, we will pick up one real-world scenario and discuss the following points:
Preparing the dataset
Launching the Mahout job on the cluster
Performance tuning for the job