object KMeans extends Serializable
Top-level methods for calling K-means clustering.
- Annotations
- @Since( "0.8.0" )
- Source
- KMeans.scala
- Alphabetic
- By Inheritance
- KMeans
- Serializable
- Serializable
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Value Members
-
val
K_MEANS_PARALLEL: String
- Annotations
- @Since( "0.8.0" )
-
val
RANDOM: String
- Annotations
- @Since( "0.8.0" )
-
def
train(data: RDD[Vector], k: Int, maxIterations: Int): KMeansModel
Trains a k-means model using specified parameters and the default values for unspecified.
Trains a k-means model using specified parameters and the default values for unspecified.
- Annotations
- @Since( "0.8.0" )
-
def
train(data: RDD[Vector], k: Int, maxIterations: Int, initializationMode: String): KMeansModel
Trains a k-means model using the given set of parameters.
Trains a k-means model using the given set of parameters.
- data
Training points as an
RDD
ofVector
types.- k
Number of clusters to create.
- maxIterations
Maximum number of iterations allowed.
- initializationMode
The initialization algorithm. This can either be "random" or "k-means||". (default: "k-means||")
- Annotations
- @Since( "2.1.0" )
-
def
train(data: RDD[Vector], k: Int, maxIterations: Int, initializationMode: String, seed: Long): KMeansModel
Trains a k-means model using the given set of parameters.
Trains a k-means model using the given set of parameters.
- data
Training points as an
RDD
ofVector
types.- k
Number of clusters to create.
- maxIterations
Maximum number of iterations allowed.
- initializationMode
The initialization algorithm. This can either be "random" or "k-means||". (default: "k-means||")
- seed
Random seed for cluster initialization. Default is to generate seed based on system time.
- Annotations
- @Since( "2.1.0" )