approxCountDistinct {SparkR}R Documentation

approxCountDistinct

Description

Aggregate function: returns the approximate number of distinct items in a group.

Approx Count Distinct

Usage

## S4 method for signature 'Column'
approxCountDistinct(x, rsd = 0.05)

## S4 method for signature 'Column'
approxCountDistinct(x, rsd = 0.05)

approxCountDistinct(x, ...)

Value

the approximate number of distinct items in a group.

See Also

Other agg_funcs: agg, summarize; avg, avg; countDistinct, countDistinct, n_distinct, n_distinct; count, n, n; first, first; kurtosis, kurtosis; last, last; max; mean; min; sd, sd, stddev, stddev; skewness, skewness; stddev_pop, stddev_pop; stddev_samp, stddev_samp; sumDistinct, sumDistinct; sum; var_pop, var_pop; var_samp, var_samp; var, var, variance, variance

Other agg_funcs: agg, summarize; avg, avg; countDistinct, countDistinct, n_distinct, n_distinct; count, n, n; first, first; kurtosis, kurtosis; last, last; max; mean; min; sd, sd, stddev, stddev; skewness, skewness; stddev_pop, stddev_pop; stddev_samp, stddev_samp; sumDistinct, sumDistinct; sum; var_pop, var_pop; var_samp, var_samp; var, var, variance, variance

Examples

## Not run: approxCountDistinct(df$c)
## Not run: approxCountDistinct(df$c, 0.02)

[Package SparkR version 1.6.1 Index]