Package org.apache.spark.ml.feature
Interface LSHParams
- All Superinterfaces:
HasInputCol
,HasOutputCol
,Identifiable
,Params
,Serializable
- All Known Implementing Classes:
BucketedRandomProjectionLSH
,BucketedRandomProjectionLSHModel
,MinHashLSH
,MinHashLSHModel
Params for
LSH
.-
Method Summary
Modifier and TypeMethodDescriptionint
Param for the number of hash tables used in LSH OR-amplification.validateAndTransformSchema
(StructType schema) Transform the Schema for LSHMethods inherited from interface org.apache.spark.ml.param.shared.HasInputCol
getInputCol, inputCol
Methods inherited from interface org.apache.spark.ml.param.shared.HasOutputCol
getOutputCol, outputCol
Methods inherited from interface org.apache.spark.ml.util.Identifiable
toString, uid
Methods inherited from interface org.apache.spark.ml.param.Params
clear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn
-
Method Details
-
getNumHashTables
int getNumHashTables() -
numHashTables
IntParam numHashTables()Param for the number of hash tables used in LSH OR-amplification.LSH OR-amplification can be used to reduce the false negative rate. Higher values for this param lead to a reduced false negative rate, at the expense of added computational complexity.
- Returns:
- (undocumented)
-
validateAndTransformSchema
Transform the Schema for LSH- Parameters:
schema
- The schema of the input dataset withoutHasOutputCol.outputCol()
.- Returns:
- A derived schema with
HasOutputCol.outputCol()
added.
-