GaussianMixtureSummary¶

class
pyspark.ml.clustering.
GaussianMixtureSummary
(java_obj=None)[source]¶ Gaussian mixture clustering results for a given model.
New in version 2.1.0.
Attributes
DataFrame of predicted cluster centers for each training data point.
Size of (number of data points in) each cluster.
Name for column of features in predictions.
The number of clusters the model was trained with.
Total loglikelihood for this model on the given data.
Number of iterations.
Name for column of predicted clusters in predictions.
DataFrame produced by the model’s transform method.
DataFrame of probabilities of each cluster for each training data point.
Name for column of predicted probability of each cluster in predictions.
Attributes Documentation

cluster
¶ DataFrame of predicted cluster centers for each training data point.
New in version 2.1.0.

clusterSizes
¶ Size of (number of data points in) each cluster.
New in version 2.1.0.

featuresCol
¶ Name for column of features in predictions.
New in version 2.1.0.

k
¶ The number of clusters the model was trained with.
New in version 2.1.0.

logLikelihood
¶ Total loglikelihood for this model on the given data.
New in version 2.2.0.

numIter
¶ Number of iterations.
New in version 2.4.0.

predictionCol
¶ Name for column of predicted clusters in predictions.
New in version 2.1.0.

predictions
¶ DataFrame produced by the model’s transform method.
New in version 2.1.0.

probability
¶ DataFrame of probabilities of each cluster for each training data point.
New in version 2.1.0.

probabilityCol
¶ Name for column of predicted probability of each cluster in predictions.
New in version 2.1.0.
