pyspark.sql.functions.var_pop#

pyspark.sql.functions.var_pop(col)[source]#

Aggregate function: returns the population variance of the values in a group.

New in version 1.6.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
colColumn or str

target column to compute on.

Returns
Column

variance of given column.

Examples

>>> df = spark.range(6)
>>> df.select(var_pop(df.id)).first()
Row(var_pop(id)=2.91666...)