public interface SimpleMetricsCachedBatch extends CachedBatch
CachedBatch
that stores some simple metrics that can be used for filtering of batches with
the SimpleMetricsCachedBatchSerializer
.
The metrics are returned by the stats value. For each column in the batch 5 columns of metadata
are needed in the row.Modifier and Type | Method and Description |
---|---|
long |
sizeInBytes() |
org.apache.spark.sql.catalyst.InternalRow |
stats()
Holds stats for each cached column.
|
numRows
long sizeInBytes()
sizeInBytes
in interface CachedBatch
org.apache.spark.sql.catalyst.InternalRow stats()
upperBound
and lowerBound
should be
of the same type as the original column. If they are null, then it is assumed that they
are not provided, and will not be used for filtering.
upperBound
(optional)lowerBound
(Optional)nullCount
: Int
rowCount
: Int
sizeInBytes
: Long