Clustering is a method of unsupervised learning, and a common technique for statistical data analysis used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformatics. Read more

Combining partitional and hierarchical algorithms for robust and efficient data clustering with cohesion self-merging  Cheng-Ru Lin, Ming-Syan Chen
A Comparison of Document Clustering Techniques  Michael Steinbach, George Karypis, Vipin Kumar
A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise  Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu - Author introduce a new density-based algorithm DBSCAN.
A k-means clustering algorithm  JA Hartigan, MA Wong
A Survey of Clustering Data Mining Techniques  Pavel Berkhin
A survey of Web clustering engines  Claudio Carpineto, Stanislaw Osiński, Giovanni Romano, Dawid Weiss
A Very Fast Method for Clustering Big Text Datasets  Frank Lin and William W. Cohen
Building High-level Features Using Large Scale Unsupervised Learning  Quoc V. Le, Marc
Carrot2 and Language Properties in Web Search Results Clustering   - A clustering engine
Clustering - What Both Theoreticians and Practitioners are Doing Wrong  Shai Ben-David
Clustering Related Stories  Jenny Finkel - HowTo: real-world clustering in layman terms.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction.  Trevor Hastie , Robert Tibshirani , Jerome Friedman
Web Document Clustering Using The Suffix Trees  Oren Zamir, Oren Etzioni - This paper describes STC clustering algorithm.To satisfy the stringent requirements of the Web domain,we introduce an incremental, linear time (in the documentcollection size) algorithm called Suffix Tree Clustering (STC).
Yale Clustering - A Plugin for Advanced Clustering in Yale   - The YALE Clustering Plugin provides a framework and some basic functionality to enable advanced clustering in YALE.