Book Image

Rapid - Apache Mahout Clustering designs

Book Image

Rapid - Apache Mahout Clustering designs

Overview of this book

Table of Contents (16 chapters)
Apache Mahout Clustering Designs
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Clustering algorithms in Mahout


Mahout has the implementation of the following clustering algorithms (as of release 0.9):

  • K-means clustering: This is available as both single machine and map reduce way

  • Fuzzy K-means: This is available as both single machine and map reduce way

  • Streaming K-means: This is available as both single machine and map reduce way

  • Spectral clustering: This is available only as map reduce way

  • Canopy clustering: This is available as map reduce way

  • Latent Dirichlet Allocation for topic modeling: This is available as both single machine and map reduce way