public final class ChiSqSelectorModel extends Model<T>
ChiSqSelector.| Modifier and Type | Class and Description |
|---|---|
static class |
ChiSqSelectorModel.ChiSqSelectorModelWriter |
| Modifier and Type | Method and Description |
|---|---|
static scala.Tuple2<int[],double[]> |
compressSparse(int[] indices,
double[] values,
int[] selectedFeatures) |
ChiSqSelectorModel |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
DoubleParam |
fdr()
The upper bound of the expected false discovery rate.
|
Param<String> |
featuresCol()
Param for features column name.
|
DoubleParam |
fpr()
The highest p-value for features to be kept.
|
DoubleParam |
fwe()
The upper bound of the expected family-wise error rate.
|
Param<String> |
labelCol()
Param for label column name.
|
static ChiSqSelectorModel |
load(String path) |
IntParam |
numTopFeatures()
Number of features that selector will select, ordered by ascending p-value.
|
Param<String> |
outputCol()
Param for output column name.
|
DoubleParam |
percentile()
Percentile of features that selector will select, ordered by ascending p-value.
|
static StructField |
prepOutputField(StructType schema,
int[] selectedFeatures,
String outputCol,
String featuresCol,
boolean isNumericAttribute)
Prepare the output column field, including per-feature metadata.
|
static MLReader<ChiSqSelectorModel> |
read() |
int[] |
selectedFeatures() |
Param<String> |
selectorType()
The selector type.
|
ChiSqSelectorModel |
setFeaturesCol(String value) |
ChiSqSelectorModel |
setOutputCol(String value) |
String |
toString() |
Dataset<Row> |
transform(Dataset<?> dataset)
Transforms the input dataset.
|
StructType |
transformSchema(StructType schema)
Check transform validity and derive the output schema from the input schema.
|
String |
uid()
An immutable unique ID for the object and its derivatives.
|
MLWriter |
write()
Returns an
MLWriter instance for this ML instance. |
transform, transform, transformparamsgetFdr, getFpr, getFwe, getNumTopFeatures, getPercentile, getSelectorTypegetFeaturesColgetLabelColgetOutputColclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwnsave$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitializepublic static MLReader<ChiSqSelectorModel> read()
public static ChiSqSelectorModel load(String path)
public String uid()
Identifiableuid in interface Identifiablepublic int[] selectedFeatures()
public ChiSqSelectorModel setFeaturesCol(String value)
public ChiSqSelectorModel setOutputCol(String value)
public StructType transformSchema(StructType schema)
PipelineStage
We check validity for interactions between parameters during transformSchema and
raise an exception if any parameter value is invalid. Parameter value checks which
do not depend on other parameters are handled by Param.validate().
Typical implementation should first conduct verification on schema change and parameter validity, including complex parameter interaction checks.
schema - (undocumented)public ChiSqSelectorModel copy(ParamMap extra)
ParamsdefaultCopy().copy in interface Paramscopy in class Model<ChiSqSelectorModel>extra - (undocumented)public MLWriter write()
MLWritableMLWriter instance for this ML instance.public String toString()
toString in interface IdentifiabletoString in class Objectpublic static StructField prepOutputField(StructType schema, int[] selectedFeatures, String outputCol, String featuresCol, boolean isNumericAttribute)
schema - (undocumented)selectedFeatures - (undocumented)outputCol - (undocumented)featuresCol - (undocumented)isNumericAttribute - (undocumented)public static scala.Tuple2<int[],double[]> compressSparse(int[] indices,
double[] values,
int[] selectedFeatures)
public final IntParam numTopFeatures()
SelectorParamsnumTopFeatures in interface SelectorParamspublic final DoubleParam percentile()
SelectorParamspercentile in interface SelectorParamspublic final DoubleParam fpr()
SelectorParamsfpr in interface SelectorParamspublic final DoubleParam fdr()
SelectorParamsfdr in interface SelectorParamspublic final DoubleParam fwe()
SelectorParamsfwe in interface SelectorParamspublic final Param<String> selectorType()
SelectorParamsselectorType in interface SelectorParamspublic final Param<String> outputCol()
HasOutputColoutputCol in interface HasOutputColpublic final Param<String> labelCol()
HasLabelCollabelCol in interface HasLabelColpublic final Param<String> featuresCol()
HasFeaturesColfeaturesCol in interface HasFeaturesColpublic Dataset<Row> transform(Dataset<?> dataset)
Transformertransform in class Transformerdataset - (undocumented)