K-Medoids聚类（K-Medoids Clustering）

定义

K-Medoids（也称 PAM，Partitioning Around Medoids）是一种基于实际数据点作为簇代表的聚类算法。与 K-Means 使用簇内均值作为质心不同，K-Medoids 始终选择簇内的一个实际数据点作为中心点（Medoid），因此对噪声和异常值更加鲁棒。

K-Medoids 最小化簇内数据点到簇中心的曼哈顿距离或其他距离度量：

\min \sum_{i=1}^{K} \sum_{\mathbf{x} \in \mathcal{C}_i} D(\mathbf{x}, \mathbf{m}_i)

其中 $\mathbf{m}_i$ 为第 $i$ 个簇的实际中心点， $D(\cdot, \cdot)$ 为距离函数。

随机选择 $K$ 个数据点作为初始 Medoids
重复直至收敛：
- 分配步骤：将每个数据点分配到最近的 Medoid
- 交换步骤：尝试用非 Medoid 点替换当前 Medoid，若替换降低总代价则接受

Klonowski（2025）使用 K-Medoids 聚类方法对地月空间 SSA 帕累托最优架构进行分类：

K-Medoids 将数据集 $\mathcal{X} = \{\mathbf{x}_1, ..., \mathbf{x}_N\}$ 划分为 $K$ 个簇，每个簇由一个实际数据点 $\mathbf{m}_i$ 代表，最小化总距离代价。

K-Medoids 对噪声和异常值鲁棒，因为中心点是实际数据点而非均值。计算复杂度高于 K-Means，但结果更稳定。

K-Medoids 适用于需要鲁棒聚类的场景，如异常检测、图像分割、文档聚类以及本文档重点关注的 SSA 架构分类。

Kaufman L, Rousseeuw P J. Finding groups in data: an introduction to cluster analysis[M]. John Wiley & Sons, 1990.
Klonowski M, Owens-Fahrner N, Heidrich C, et al. Cislunar space domain awareness architecture design and analysis for cooperative agents[J]. The Journal of the Astronautical Sciences, 2024.