org.apache.spark.ml.evaluation.Evaluator

org.apache.spark.ml.evaluation.MulticlassClassificationEvaluator

All Implemented Interfaces:: Serializable, Params, HasLabelCol, HasPredictionCol, HasProbabilityCol, HasWeightCol, DefaultParamsWritable, Identifiable, MLWritable

public class MulticlassClassificationEvaluator extends Evaluator implements HasPredictionCol, HasLabelCol, HasWeightCol, HasProbabilityCol, DefaultParamsWritable

Evaluator for multiclass classification, which expects input columns: prediction, label, weight (optional) and probability (only for logLoss).

See Also:

Serialized Form

Constructor Summary

Constructors

Constructor

Description

MulticlassClassificationEvaluator()

MulticlassClassificationEvaluator(String uid)
Method Summary

Modifier and Type

Method

Description

final DoubleParam

beta()

The beta value, which controls precision vs recall weighting, used in "weightedFMeasure", "fMeasureByLabel".

MulticlassClassificationEvaluator

copy(ParamMap extra)

Creates a copy of this instance with the same UID and some extra params.

final DoubleParam

eps()

param for eps.

double

evaluate(Dataset<?> dataset)

Evaluates model output and returns a scalar metric.

double

getBeta()

double

getEps()

double

getMetricLabel()

String

getMetricName()

MulticlassMetrics

getMetrics(Dataset<?> dataset)

Get a MulticlassMetrics, which can be used to get multiclass classification metrics such as accuracy, weightedPrecision, etc.

boolean

isLargerBetter()

Indicates whether the metric returned by evaluate should be maximized (true, default) or minimized (false).

final Param<String>

labelCol()

Param for label column name.

static MulticlassClassificationEvaluator

load(String path)

final DoubleParam

metricLabel()

The class whose metric will be computed in "truePositiveRateByLabel", "falsePositiveRateByLabel", "precisionByLabel", "recallByLabel", "fMeasureByLabel".

Param<String>

metricName()

param for metric name in evaluation (supports "f1" (default), "accuracy", "weightedPrecision", "weightedRecall", "weightedTruePositiveRate", "weightedFalsePositiveRate", "weightedFMeasure", "truePositiveRateByLabel", "falsePositiveRateByLabel", "precisionByLabel", "recallByLabel", "fMeasureByLabel", "logLoss", "hammingLoss")

final Param<String>

predictionCol()

Param for prediction column name.

final Param<String>

probabilityCol()

Param for Column name for predicted class conditional probabilities.

static MLReader<T>

read()

MulticlassClassificationEvaluator

setBeta(double value)

MulticlassClassificationEvaluator

setEps(double value)

MulticlassClassificationEvaluator

setLabelCol(String value)

MulticlassClassificationEvaluator

setMetricLabel(double value)

MulticlassClassificationEvaluator

setMetricName(String value)

MulticlassClassificationEvaluator

setPredictionCol(String value)

MulticlassClassificationEvaluator

setProbabilityCol(String value)

MulticlassClassificationEvaluator

setWeightCol(String value)

String

toString()

String

uid()

An immutable unique ID for the object and its derivatives.

final Param<String>

weightCol()

Param for weight column name.

Methods inherited from class org.apache.spark.ml.evaluation.Evaluator
evaluate, params

Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait

Methods inherited from interface org.apache.spark.ml.util.DefaultParamsWritable
write

Methods inherited from interface org.apache.spark.ml.param.shared.HasLabelCol
getLabelCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasPredictionCol
getPredictionCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasProbabilityCol
getProbabilityCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasWeightCol
getWeightCol

Methods inherited from interface org.apache.spark.ml.util.MLWritable
save

Methods inherited from interface org.apache.spark.ml.param.Params
clear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn

Constructor Details
- MulticlassClassificationEvaluator
  
  public MulticlassClassificationEvaluator(String uid)
- MulticlassClassificationEvaluator
  
  public MulticlassClassificationEvaluator()
Method Details
- load
  
  public static MulticlassClassificationEvaluator load(String path)
- read
  
  public static MLReader<T> read()
- probabilityCol
  
  public final Param<String> probabilityCol()
  
  Description copied from interface: HasProbabilityCol
  
  Param for Column name for predicted class conditional probabilities. Note: Not all models output well-calibrated probability estimates! These probabilities should be treated as confidences, not precise probabilities.
  
  Specified by:
  
  probabilityCol in interface HasProbabilityCol
  
  Returns:
  
  (undocumented)
- weightCol
  
  public final Param<String> weightCol()
  
  Description copied from interface: HasWeightCol
  
  Param for weight column name. If this is not set or empty, we treat all instance weights as 1.0.
  
  Specified by:
  
  weightCol in interface HasWeightCol
  
  Returns:
  
  (undocumented)
- labelCol
  
  public final Param<String> labelCol()
  
  Description copied from interface: HasLabelCol
  
  Param for label column name.
  
  Specified by:
  
  labelCol in interface HasLabelCol
  
  Returns:
  
  (undocumented)
- predictionCol
  
  public final Param<String> predictionCol()
  
  Description copied from interface: HasPredictionCol
  
  Param for prediction column name.
  
  Specified by:
  
  predictionCol in interface HasPredictionCol
  
  Returns:
  
  (undocumented)
- uid
  
  public String uid()
  
  Description copied from interface: Identifiable
  
  An immutable unique ID for the object and its derivatives.
  
  Specified by:
  
  uid in interface Identifiable
  
  Returns:
  
  (undocumented)
- metricName
  
  public Param<String> metricName()
  
  param for metric name in evaluation (supports "f1" (default), "accuracy", "weightedPrecision", "weightedRecall", "weightedTruePositiveRate", "weightedFalsePositiveRate", "weightedFMeasure", "truePositiveRateByLabel", "falsePositiveRateByLabel", "precisionByLabel", "recallByLabel", "fMeasureByLabel", "logLoss", "hammingLoss")
  
  Returns:
  
  (undocumented)
- getMetricName
  
  public String getMetricName()
- setMetricName
  
  public MulticlassClassificationEvaluator setMetricName(String value)
- setPredictionCol
  
  public MulticlassClassificationEvaluator setPredictionCol(String value)
- setLabelCol
  
  public MulticlassClassificationEvaluator setLabelCol(String value)
- setWeightCol
  
  public MulticlassClassificationEvaluator setWeightCol(String value)
- setProbabilityCol
  
  public MulticlassClassificationEvaluator setProbabilityCol(String value)
- metricLabel
  
  public final DoubleParam metricLabel()
  
  The class whose metric will be computed in "truePositiveRateByLabel", "falsePositiveRateByLabel", "precisionByLabel", "recallByLabel", "fMeasureByLabel". Must be greater than or equal to 0. The default value is 0.
  
  Returns:
  
  (undocumented)
- getMetricLabel
  
  public double getMetricLabel()
- setMetricLabel
  
  public MulticlassClassificationEvaluator setMetricLabel(double value)
- beta
  
  public final DoubleParam beta()
  
  The beta value, which controls precision vs recall weighting, used in "weightedFMeasure", "fMeasureByLabel". Must be greater than 0. The default value is 1.
  
  Returns:
  
  (undocumented)
- getBeta
  
  public double getBeta()
- setBeta
  
  public MulticlassClassificationEvaluator setBeta(double value)
- eps
  
  public final DoubleParam eps()
  
  param for eps. log-loss is undefined for p=0 or p=1, so probabilities are clipped to max(eps, min(1 - eps, p)). Must be in range (0, 0.5). The default value is 1e-15.
  
  Returns:
  
  (undocumented)
- getEps
  
  public double getEps()
- setEps
  
  public MulticlassClassificationEvaluator setEps(double value)
- evaluate
  
  public double evaluate(Dataset<?> dataset)
  
  Description copied from class: Evaluator
  
  Evaluates model output and returns a scalar metric. The value of Evaluator.isLargerBetter() specifies whether larger values are better.
  
  Specified by:
  
  evaluate in class Evaluator
  
  Parameters:
  
  dataset - a dataset that contains labels/observations and predictions.
  
  Returns:
  
  metric
- getMetrics
  
  public MulticlassMetrics getMetrics(Dataset<?> dataset)
  
  Get a MulticlassMetrics, which can be used to get multiclass classification metrics such as accuracy, weightedPrecision, etc.
  
  Parameters:
  
  dataset - a dataset that contains labels/observations and predictions.
  
  Returns:
  
  MulticlassMetrics
- isLargerBetter
  
  public boolean isLargerBetter()
  
  Description copied from class: Evaluator
  
  Indicates whether the metric returned by evaluate should be maximized (true, default) or minimized (false). A given evaluator may support multiple metrics which may be maximized or minimized.
  
  Overrides:
  
  isLargerBetter in class Evaluator
  
  Returns:
  
  (undocumented)
- copy
  
  public MulticlassClassificationEvaluator copy(ParamMap extra)
  
  Description copied from interface: Params
  
  Creates a copy of this instance with the same UID and some extra params. Subclasses should implement this method and set the return type properly. See defaultCopy().
  
  Specified by:
  
  copy in interface Params
  
  Specified by:
  
  copy in class Evaluator
  
  Parameters:
  
  extra - (undocumented)
  
  Returns:
  
  (undocumented)
- toString
  
  public String toString()
  
  Specified by:
  
  toString in interface Identifiable
  
  Overrides:
  
  toString in class Object

Class MulticlassClassificationEvaluator

Constructor Summary

Method Summary

Methods inherited from class org.apache.spark.ml.evaluation.Evaluator

Methods inherited from class java.lang.Object

Methods inherited from interface org.apache.spark.ml.util.DefaultParamsWritable

Methods inherited from interface org.apache.spark.ml.param.shared.HasLabelCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasPredictionCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasProbabilityCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasWeightCol

Methods inherited from interface org.apache.spark.ml.util.MLWritable

Methods inherited from interface org.apache.spark.ml.param.Params

Constructor Details

MulticlassClassificationEvaluator

MulticlassClassificationEvaluator

Method Details

load

read

probabilityCol

weightCol

labelCol

predictionCol

uid

metricName

getMetricName

setMetricName

setPredictionCol

setLabelCol

setWeightCol

setProbabilityCol

metricLabel

getMetricLabel

setMetricLabel

beta

getBeta

setBeta

eps

getEps

setEps

evaluate

getMetrics

isLargerBetter

copy

toString