In this chapter, you learned about StreamingKMeans
clustering, which is used for streaming data. We discussed both steps involved in this algorithm—streaming and BallKMeans. We used Mahout Streaming K-means on the census1990 data. We also discussed the clusterQualitySummarizer
class. In the next chapter, we will discuss one more clustering algorithm implemented in Mahout—spectral clustering.
Rapid - Apache Mahout Clustering designs
Rapid - Apache Mahout Clustering designs
Overview of this book
Table of Contents (16 chapters)
Apache Mahout Clustering Designs
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Free Chapter
Understanding Clustering
Understanding K-means Clustering
Understanding Canopy Clustering
Understanding the Fuzzy K-means Algorithm Using Mahout
Understanding Model-based Clustering
Understanding Streaming K-means
Spectral Clustering
Improving Cluster Quality
Creating a Cluster Model for Production
Index
Customer Reviews