Package

org.apache.spark.ml

tuning

Permalink

package tuning

Visibility
  1. Public
  2. All

Type Members

  1. class CrossValidator extends Estimator[CrossValidatorModel] with CrossValidatorParams with HasParallelism with HasCollectSubModels with MLWritable with Logging

    Permalink

    K-fold cross validation performs model selection by splitting the dataset into a set of non-overlapping randomly partitioned folds which are used as separate training and test datasets e.g., with k=3 folds, K-fold cross validation will generate 3 (training, test) dataset pairs, each of which uses 2/3 of the data for training and 1/3 for testing.

    K-fold cross validation performs model selection by splitting the dataset into a set of non-overlapping randomly partitioned folds which are used as separate training and test datasets e.g., with k=3 folds, K-fold cross validation will generate 3 (training, test) dataset pairs, each of which uses 2/3 of the data for training and 1/3 for testing. Each fold is used as the test set exactly once.

    Annotations
    @Since( "1.2.0" )
  2. class CrossValidatorModel extends Model[CrossValidatorModel] with CrossValidatorParams with MLWritable

    Permalink

    CrossValidatorModel contains the model with the highest average cross-validation metric across folds and uses this model to transform input data.

    CrossValidatorModel contains the model with the highest average cross-validation metric across folds and uses this model to transform input data. CrossValidatorModel also tracks the metrics for each param map evaluated.

    Annotations
    @Since( "1.2.0" )
  3. class ParamGridBuilder extends AnyRef

    Permalink

    Builder for a param grid used in grid search-based model selection.

    Builder for a param grid used in grid search-based model selection.

    Annotations
    @Since( "1.2.0" )
  4. class TrainValidationSplit extends Estimator[TrainValidationSplitModel] with TrainValidationSplitParams with HasParallelism with HasCollectSubModels with MLWritable with Logging

    Permalink

    Validation for hyper-parameter tuning.

    Validation for hyper-parameter tuning. Randomly splits the input dataset into train and validation sets, and uses evaluation metric on the validation set to select the best model. Similar to CrossValidator, but only splits the set once.

    Annotations
    @Since( "1.5.0" )
  5. class TrainValidationSplitModel extends Model[TrainValidationSplitModel] with TrainValidationSplitParams with MLWritable

    Permalink

    Model from train validation split.

    Model from train validation split.

    Annotations
    @Since( "1.5.0" )

Value Members

  1. object CrossValidator extends MLReadable[CrossValidator] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  2. object CrossValidatorModel extends MLReadable[CrossValidatorModel] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  3. object TrainValidationSplit extends MLReadable[TrainValidationSplit] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  4. object TrainValidationSplitModel extends MLReadable[TrainValidationSplitModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )

Members