Book Image

Rapid - Apache Mahout Clustering designs

Book Image

Rapid - Apache Mahout Clustering designs

Overview of this book

Table of Contents (16 chapters)
Apache Mahout Clustering Designs
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 4. Understanding the Fuzzy K-means Algorithm Using Mahout

In the previous chapter, we discussed Canopy clustering and used Mahout to run Canopy clustering. In this chapter, we will discuss a new algorithm in clustering—Fuzzy K-means. Fuzzy clustering is also called soft clustering because data points can belongs to more than one cluster. Fuzzy K-means clustering is also called Fuzzy C-Means (FCM). We will discuss the following topics in this chapter:

  • Learning Fuzzy K-means clustering

  • Using mahout to execute Fuzzy K-means clustering

  • Visualizing the Fuzzy K-means cluster using Mahout