Packages

c

org.apache.spark.mllib.classification

LogisticRegressionWithSGD

class LogisticRegressionWithSGD extends GeneralizedLinearAlgorithm[LogisticRegressionModel] with Serializable

Train a classification model for Binary Logistic Regression using Stochastic Gradient Descent. By default L2 regularization is used, which can be changed via LogisticRegressionWithSGD.optimizer.

Using LogisticRegressionWithLBFGS is recommended over this.

Annotations
@Since( "0.8.0" )
Source
LogisticRegression.scala
Note

Labels used in Logistic Regression should be {0, 1, ..., k - 1} for k classes multi-label classification problem.

Linear Supertypes
GeneralizedLinearAlgorithm[LogisticRegressionModel], Serializable, Serializable, Logging, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. LogisticRegressionWithSGD
  2. GeneralizedLinearAlgorithm
  3. Serializable
  4. Serializable
  5. Logging
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. def getNumFeatures: Int

    The dimension of training features.

    The dimension of training features.

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "1.4.0" )
  2. def isAddIntercept: Boolean

    Get if the algorithm uses addIntercept

    Get if the algorithm uses addIntercept

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "1.4.0" )
  3. val optimizer: GradientDescent

    The optimizer to solve the problem.

    The optimizer to solve the problem.

    Definition Classes
    LogisticRegressionWithSGDGeneralizedLinearAlgorithm
    Annotations
    @Since( "0.8.0" )
  4. def run(input: RDD[LabeledPoint], initialWeights: Vector): LogisticRegressionModel

    Run the algorithm with the configured parameters on an input RDD of LabeledPoint entries starting from the initial weights provided.

    Run the algorithm with the configured parameters on an input RDD of LabeledPoint entries starting from the initial weights provided.

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "1.0.0" )
  5. def run(input: RDD[LabeledPoint]): LogisticRegressionModel

    Run the algorithm with the configured parameters on an input RDD of LabeledPoint entries.

    Run the algorithm with the configured parameters on an input RDD of LabeledPoint entries.

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "0.8.0" )
  6. def setIntercept(addIntercept: Boolean): LogisticRegressionWithSGD.this.type

    Set if the algorithm should add an intercept.

    Set if the algorithm should add an intercept. Default false. We set the default to false because adding the intercept will cause memory allocation.

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "0.8.0" )
  7. def setValidateData(validateData: Boolean): LogisticRegressionWithSGD.this.type

    Set if the algorithm should validate data before training.

    Set if the algorithm should validate data before training. Default true.

    Definition Classes
    GeneralizedLinearAlgorithm
    Annotations
    @Since( "0.8.0" )