Deep Dive into Clustering: k-Means and HDBSCAN: a detailed comparison

unsupervised learning
clustering
code
Introduction to HDBSCAN with examples and code from Daniel Capellupo, PhD
Published

February 25, 2021

Modified

June 14, 2024

Original Blog Post: Deep Dive into Clustering: k-Means and HDBSCAN: a detailed comparison
Authors: Daniel Capellupo, PhD
Published: 2021-02-10

My summary:

This post gives examples of different shaped clusters and looks at how k-Means handles these. It then compares how the HDBSCAN algorithm handles these different clustering scenarios. In general, HDBSCAN does quite well, even with the default hyperparameters.

There is also a Jupyter notebook available with all the code used for generating the examples in this blog post.

Here is the link again to their blog post for more details: Deep Dive into Clustering: k-Means and HDBSCAN: a detailed comparison