-
- Downloads
[SPARK-3261][MLLIB] KMeans clusterer can return duplicate cluster centers
## What changes were proposed in this pull request? Return potentially fewer than k cluster centers in cases where k distinct centroids aren't available or aren't selected. ## How was this patch tested? Existing tests Author: Sean Owen <sowen@cloudera.com> Closes #15450 from srowen/SPARK-3261.
Showing
- mllib/src/main/scala/org/apache/spark/ml/clustering/KMeans.scala 3 additions, 1 deletion...rc/main/scala/org/apache/spark/ml/clustering/KMeans.scala
- mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala 17 additions, 10 deletions...main/scala/org/apache/spark/mllib/clustering/KMeans.scala
- mllib/src/test/scala/org/apache/spark/mllib/clustering/KMeansSuite.scala 65 additions, 54 deletions...scala/org/apache/spark/mllib/clustering/KMeansSuite.scala
Loading
Please register or sign in to comment