class MultivariateOnlineSummarizer extends MultivariateStatisticalSummary with Serializable
MultivariateOnlineSummarizer implements MultivariateStatisticalSummary to compute the mean, variance, minimum, maximum, counts, and nonzero counts for instances in sparse or dense vector format in an online fashion.
Two MultivariateOnlineSummarizer can be merged together to have a statistical summary of the corresponding joint dataset.
A numerically stable algorithm is implemented to compute the mean and variance of instances: Reference: variance-wiki Zero elements (including explicit zero values) are skipped when calling add(), to have time complexity O(nnz) instead of O(n) for each column.
For weighted instances, the unbiased estimation of variance is defined by the reliability weights: see Reliability weights (Wikipedia).
- Annotations
- @Since( "1.1.0" )
- Source
- MultivariateOnlineSummarizer.scala
- Alphabetic
- By Inheritance
- MultivariateOnlineSummarizer
- Serializable
- Serializable
- MultivariateStatisticalSummary
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
- new MultivariateOnlineSummarizer()
Value Members
-
def
add(sample: Vector): MultivariateOnlineSummarizer.this.type
Add a new sample to this summarizer, and update the statistical summary.
Add a new sample to this summarizer, and update the statistical summary.
- sample
The sample in dense/sparse vector format to be added into this summarizer.
- returns
This MultivariateOnlineSummarizer object.
- Annotations
- @Since( "1.1.0" )
-
def
count: Long
Sample size.
Sample size.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
max: Vector
Maximum value of each dimension.
Maximum value of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
mean: Vector
Sample mean of each dimension.
Sample mean of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
merge(other: MultivariateOnlineSummarizer): MultivariateOnlineSummarizer.this.type
Merge another MultivariateOnlineSummarizer, and update the statistical summary.
Merge another MultivariateOnlineSummarizer, and update the statistical summary. (Note that it's in place merging; as a result,
this
object will be modified.)- other
The other MultivariateOnlineSummarizer to be merged.
- returns
This MultivariateOnlineSummarizer object.
- Annotations
- @Since( "1.1.0" )
-
def
min: Vector
Minimum value of each dimension.
Minimum value of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
normL1: Vector
L1 norm of each dimension.
L1 norm of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.2.0" )
-
def
normL2: Vector
L2 (Euclidean) norm of each dimension.
L2 (Euclidean) norm of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.2.0" )
-
def
numNonzeros: Vector
Number of nonzero elements in each dimension.
Number of nonzero elements in each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
variance: Vector
Unbiased estimate of sample variance of each dimension.
Unbiased estimate of sample variance of each dimension.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary
- Annotations
- @Since( "1.1.0" )
-
def
weightSum: Double
Sum of weights.
Sum of weights.
- Definition Classes
- MultivariateOnlineSummarizer → MultivariateStatisticalSummary