Package org.apache.spark.util
Class StatCounter
Object
org.apache.spark.util.StatCounter
- All Implemented Interfaces:
Serializable
A class for tracking the statistics of a set of numbers (count, mean and variance) in a
numerically robust way. Includes support for merging two StatCounters. Based on Welford
and Chan's
algorithms for running variance.
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionInitialize the StatCounter with no values.StatCounter(scala.collection.IterableOnce<Object> values) -
Method Summary
Modifier and TypeMethodDescriptionstatic StatCounterBuild a StatCounter from a list of values passed as variable-length arguments.static StatCounterBuild a StatCounter from a list of values.copy()Clone this StatCounterlongcount()doublemax()doublemean()merge(double value) Add a value into this StatCounter, updating the internal statistics.merge(StatCounter other) Merge another StatCounter into this one, adding up the internal statistics.Add multiple values into this StatCounter, updating the internal statistics.doublemin()doublepopStdev()Return the population standard deviation of the values.doubleReturn the population variance of the values.doubleReturn the sample standard deviation of the values, which corrects for bias in estimating the variance by dividing by N-1 instead of N.doubleReturn the sample variance, which corrects for bias in estimating the variance by dividing by N-1 instead of N.doublestdev()Return the population standard deviation of the values.doublesum()toString()doublevariance()Return the population variance of the values.
-
Constructor Details
-
StatCounter
-
StatCounter
public StatCounter()Initialize the StatCounter with no values.
-
-
Method Details
-
apply
Build a StatCounter from a list of values. -
apply
Build a StatCounter from a list of values passed as variable-length arguments. -
merge
Add a value into this StatCounter, updating the internal statistics. -
merge
Add multiple values into this StatCounter, updating the internal statistics. -
merge
Merge another StatCounter into this one, adding up the internal statistics. -
copy
Clone this StatCounter -
count
public long count() -
mean
public double mean() -
sum
public double sum() -
max
public double max() -
min
public double min() -
variance
public double variance()Return the population variance of the values. -
popVariance
public double popVariance()Return the population variance of the values.- Returns:
- (undocumented)
-
sampleVariance
public double sampleVariance()Return the sample variance, which corrects for bias in estimating the variance by dividing by N-1 instead of N.- Returns:
- (undocumented)
-
stdev
public double stdev()Return the population standard deviation of the values. -
popStdev
public double popStdev()Return the population standard deviation of the values.- Returns:
- (undocumented)
-
sampleStdev
public double sampleStdev()Return the sample standard deviation of the values, which corrects for bias in estimating the variance by dividing by N-1 instead of N.- Returns:
- (undocumented)
-
toString
-