SummaryBuilder

class pyspark.ml.stat.SummaryBuilder(jSummaryBuilder)[source]

A builder object that provides summary statistics about a given column.

Users should not directly create such builders, but instead use one of the methods in pyspark.ml.stat.Summarizer

New in version 2.4.0.

Methods

summary(featuresCol[, weightCol])

Returns an aggregate object that contains the summary of the column with the requested metrics.

Methods Documentation

summary(featuresCol, weightCol=None)[source]

Returns an aggregate object that contains the summary of the column with the requested metrics.

New in version 2.4.0.

Parameters:
featuresColstr

a column that contains features Vector object.

weightColstr, optional

a column that contains weight value. Default weight is 1.0.

Returns:
pyspark.sql.Column

an aggregate column that contains the statistics. The exact content of this structure is determined during the creation of the builder.