public abstract class PipelineStage extends Object implements Params, org.apache.spark.internal.Logging
Estimator
or a Transformer
.Constructor and Description |
---|
PipelineStage() |
Modifier and Type | Method and Description |
---|---|
abstract PipelineStage |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
Param<?>[] |
params()
Returns all params sorted by their names.
|
abstract StructType |
transformSchema(StructType schema)
Check transform validity and derive the output schema from the input schema.
|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
clear, copyValues, defaultCopy, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, set, set, set, setDefault, setDefault, shouldOwn
toString, uid
$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitialize
public abstract PipelineStage copy(ParamMap extra)
Params
defaultCopy()
.public Param<?>[] params()
Params
Param
.
public abstract StructType transformSchema(StructType schema)
We check validity for interactions between parameters during transformSchema
and
raise an exception if any parameter value is invalid. Parameter value checks which
do not depend on other parameters are handled by Param.validate()
.
Typical implementation should first conduct verification on schema change and parameter validity, including complex parameter interaction checks.
schema
- (undocumented)