org.apache.spark.ml
Class PredictionModel<FeaturesType,M extends PredictionModel<FeaturesType,M>>

Object
  extended by org.apache.spark.ml.PipelineStage
      extended by org.apache.spark.ml.Transformer
          extended by org.apache.spark.ml.Model<M>
              extended by org.apache.spark.ml.PredictionModel<FeaturesType,M>
All Implemented Interfaces:
java.io.Serializable, Logging, Params
Direct Known Subclasses:
ClassificationModel, DecisionTreeClassificationModel, DecisionTreeRegressionModel, GBTClassificationModel, GBTRegressionModel, RandomForestClassificationModel, RandomForestRegressionModel, RegressionModel

public abstract class PredictionModel<FeaturesType,M extends PredictionModel<FeaturesType,M>>
extends Model<M>

:: DeveloperApi :: Abstraction for a model for prediction tasks (regression and classification).

See Also:
Serialized Form

Constructor Summary
PredictionModel()
           
 
Method Summary
 M setFeaturesCol(String value)
           
 M setPredictionCol(String value)
           
 DataFrame transform(DataFrame dataset)
          Transforms dataset by reading from featuresCol, calling predict(), and storing the predictions as a new column predictionCol.
 StructType transformSchema(StructType schema)
          :: DeveloperApi ::
 StructType validateAndTransformSchema(StructType schema, boolean fitting, DataType featuresDataType)
          Validates and transforms the input schema with the provided param map.
 
Methods inherited from class org.apache.spark.ml.Model
copy, hasParent, parent, setParent
 
Methods inherited from class org.apache.spark.ml.Transformer
transform, transform, transform
 
Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.spark.ml.param.Params
clear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, setDefault, shouldOwn, validateParams
 
Methods inherited from interface org.apache.spark.Logging
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning
 

Constructor Detail

PredictionModel

public PredictionModel()
Method Detail

setFeaturesCol

public M setFeaturesCol(String value)

setPredictionCol

public M setPredictionCol(String value)

transformSchema

public StructType transformSchema(StructType schema)
Description copied from class: PipelineStage
:: DeveloperApi ::

Derives the output schema from the input schema.

Specified by:
transformSchema in class PipelineStage
Parameters:
schema - (undocumented)
Returns:
(undocumented)

transform

public DataFrame transform(DataFrame dataset)
Transforms dataset by reading from featuresCol, calling predict(), and storing the predictions as a new column predictionCol.

Specified by:
transform in class Transformer
Parameters:
dataset - input dataset
Returns:
transformed dataset with predictionCol of type Double

validateAndTransformSchema

public StructType validateAndTransformSchema(StructType schema,
                                             boolean fitting,
                                             DataType featuresDataType)
Validates and transforms the input schema with the provided param map.

Parameters:
schema - input schema
fitting - whether this is in fitting
featuresDataType - SQL DataType for FeaturesType. E.g., VectorUDT for vector features.
Returns:
output schema