Interface LSHParams

All Superinterfaces:
HasInputCol, HasOutputCol, Identifiable, Params, Serializable
All Known Implementing Classes:
BucketedRandomProjectionLSH, BucketedRandomProjectionLSHModel, MinHashLSH, MinHashLSHModel

public interface LSHParams extends HasInputCol, HasOutputCol
Params for LSH.
  • Method Details

    • getNumHashTables

      int getNumHashTables()
    • numHashTables

      IntParam numHashTables()
      Param for the number of hash tables used in LSH OR-amplification.

      LSH OR-amplification can be used to reduce the false negative rate. Higher values for this param lead to a reduced false negative rate, at the expense of added computational complexity.

      Returns:
      (undocumented)
    • validateAndTransformSchema

      StructType validateAndTransformSchema(StructType schema)
      Transform the Schema for LSH
      Parameters:
      schema - The schema of the input dataset without HasOutputCol.outputCol().
      Returns:
      A derived schema with HasOutputCol.outputCol() added.