public class GeneralizedLinearRegression extends Regressor<Vector,GeneralizedLinearRegression,GeneralizedLinearRegressionModel> implements GeneralizedLinearRegressionBase, DefaultParamsWritable, org.apache.spark.internal.Logging
| Modifier and Type | Class and Description |
|---|---|
static class |
GeneralizedLinearRegression.Binomial$
Binomial exponential family distribution.
|
static class |
GeneralizedLinearRegression.CLogLog$ |
static class |
GeneralizedLinearRegression.Family$ |
static class |
GeneralizedLinearRegression.FamilyAndLink$ |
static class |
GeneralizedLinearRegression.Gamma$
Gamma exponential family distribution.
|
static class |
GeneralizedLinearRegression.Gaussian$
Gaussian exponential family distribution.
|
static class |
GeneralizedLinearRegression.Identity$ |
static class |
GeneralizedLinearRegression.Inverse$ |
static class |
GeneralizedLinearRegression.Link$ |
static class |
GeneralizedLinearRegression.Log$ |
static class |
GeneralizedLinearRegression.Logit$ |
static class |
GeneralizedLinearRegression.Poisson$
Poisson exponential family distribution.
|
static class |
GeneralizedLinearRegression.Probit$ |
static class |
GeneralizedLinearRegression.Sqrt$ |
static class |
GeneralizedLinearRegression.Tweedie$ |
| Constructor and Description |
|---|
GeneralizedLinearRegression() |
GeneralizedLinearRegression(String uid) |
| Modifier and Type | Method and Description |
|---|---|
IntParam |
aggregationDepth()
Param for suggested depth for treeAggregate (>= 2).
|
GeneralizedLinearRegression |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
Param<String> |
family()
Param for the name of family which is a description of the error distribution
to be used in the model.
|
BooleanParam |
fitIntercept()
Param for whether to fit an intercept term.
|
Param<String> |
link()
Param for the name of link function which provides the relationship
between the linear predictor and the mean of the distribution function.
|
DoubleParam |
linkPower()
Param for the index in the power link function.
|
Param<String> |
linkPredictionCol()
Param for link prediction (linear predictor) column name.
|
static GeneralizedLinearRegression |
load(String path) |
IntParam |
maxIter()
Param for maximum number of iterations (>= 0).
|
Param<String> |
offsetCol()
Param for offset column name.
|
static MLReader<T> |
read() |
DoubleParam |
regParam()
Param for regularization parameter (>= 0).
|
GeneralizedLinearRegression |
setAggregationDepth(int value) |
GeneralizedLinearRegression |
setFamily(String value)
Sets the value of param
family. |
GeneralizedLinearRegression |
setFitIntercept(boolean value)
Sets if we should fit the intercept.
|
GeneralizedLinearRegression |
setLink(String value)
Sets the value of param
link. |
GeneralizedLinearRegression |
setLinkPower(double value)
Sets the value of param
linkPower. |
GeneralizedLinearRegression |
setLinkPredictionCol(String value)
Sets the link prediction (linear predictor) column name.
|
GeneralizedLinearRegression |
setMaxIter(int value)
Sets the maximum number of iterations (applicable for solver "irls").
|
GeneralizedLinearRegression |
setOffsetCol(String value)
Sets the value of param
offsetCol. |
GeneralizedLinearRegression |
setRegParam(double value)
Sets the regularization parameter for L2 regularization.
|
GeneralizedLinearRegression |
setSolver(String value)
Sets the solver algorithm used for optimization.
|
GeneralizedLinearRegression |
setTol(double value)
Sets the convergence tolerance of iterations.
|
GeneralizedLinearRegression |
setVariancePower(double value)
Sets the value of param
variancePower. |
GeneralizedLinearRegression |
setWeightCol(String value)
Sets the value of param
weightCol. |
Param<String> |
solver()
The solver algorithm for optimization.
|
DoubleParam |
tol()
Param for the convergence tolerance for iterative algorithms (>= 0).
|
String |
uid()
An immutable unique ID for the object and its derivatives.
|
DoubleParam |
variancePower()
Param for the power in the variance function of the Tweedie distribution which provides
the relationship between the variance and mean of the distribution.
|
Param<String> |
weightCol()
Param for weight column name.
|
featuresCol, fit, labelCol, predictionCol, setFeaturesCol, setLabelCol, setPredictionCol, transformSchemaparamsequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetFamily, getLink, getLinkPower, getLinkPredictionCol, getOffsetCol, getVariancePower, hasLinkPredictionCol, hasOffsetCol, hasWeightCol, validateAndTransformSchemagetLabelCol, labelColfeaturesCol, getFeaturesColgetPredictionCol, predictionColclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwntoStringgetFitInterceptgetMaxItergetRegParamgetWeightColgetAggregationDepth$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitializewritesavepublic GeneralizedLinearRegression(String uid)
public GeneralizedLinearRegression()
public static GeneralizedLinearRegression load(String path)
public static MLReader<T> read()
public final Param<String> family()
GeneralizedLinearRegressionBasefamily in interface GeneralizedLinearRegressionBasepublic final DoubleParam variancePower()
GeneralizedLinearRegressionBasevariancePower in interface GeneralizedLinearRegressionBasepublic final Param<String> link()
GeneralizedLinearRegressionBaselinkPower.
link in interface GeneralizedLinearRegressionBasepublic final DoubleParam linkPower()
GeneralizedLinearRegressionBasevariancePower, which matches the R "statmod"
package.
linkPower in interface GeneralizedLinearRegressionBasepublic final Param<String> linkPredictionCol()
GeneralizedLinearRegressionBaselinkPredictionCol in interface GeneralizedLinearRegressionBasepublic final Param<String> offsetCol()
GeneralizedLinearRegressionBaseoffsetCol in interface GeneralizedLinearRegressionBasepublic final Param<String> solver()
GeneralizedLinearRegressionBasesolver in interface HasSolversolver in interface GeneralizedLinearRegressionBasepublic final IntParam aggregationDepth()
HasAggregationDepthaggregationDepth in interface HasAggregationDepthpublic final Param<String> weightCol()
HasWeightColweightCol in interface HasWeightColpublic final DoubleParam regParam()
HasRegParamregParam in interface HasRegParampublic final DoubleParam tol()
HasTolpublic final IntParam maxIter()
HasMaxItermaxIter in interface HasMaxIterpublic final BooleanParam fitIntercept()
HasFitInterceptfitIntercept in interface HasFitInterceptpublic String uid()
Identifiableuid in interface Identifiablepublic GeneralizedLinearRegression setFamily(String value)
family.
Default is "gaussian".
value - (undocumented)public GeneralizedLinearRegression setVariancePower(double value)
variancePower.
Used only when family is "tweedie".
Default is 0.0, which corresponds to the "gaussian" family.
value - (undocumented)public GeneralizedLinearRegression setLinkPower(double value)
linkPower.
Used only when family is "tweedie".
value - (undocumented)public GeneralizedLinearRegression setLink(String value)
link.
Used only when family is not "tweedie".
value - (undocumented)public GeneralizedLinearRegression setFitIntercept(boolean value)
value - (undocumented)public GeneralizedLinearRegression setMaxIter(int value)
value - (undocumented)public GeneralizedLinearRegression setTol(double value)
value - (undocumented)public GeneralizedLinearRegression setRegParam(double value)
$$ 0.5 * regParam * L2norm(coefficients)^2 $$Default is 0.0.
value - (undocumented)public GeneralizedLinearRegression setWeightCol(String value)
weightCol.
If this is not set or empty, we treat all instance weights as 1.0.
Default is not set, so all instances have weight one.
In the Binomial family, weights correspond to number of trials and should be integer.
Non-integer weights are rounded to integer in AIC calculation.
value - (undocumented)public GeneralizedLinearRegression setOffsetCol(String value)
offsetCol.
If this is not set or empty, we treat all instance offsets as 0.0.
Default is not set, so all instances have offset 0.0.
value - (undocumented)public GeneralizedLinearRegression setSolver(String value)
value - (undocumented)public GeneralizedLinearRegression setLinkPredictionCol(String value)
value - (undocumented)public GeneralizedLinearRegression setAggregationDepth(int value)
public GeneralizedLinearRegression copy(ParamMap extra)
ParamsdefaultCopy().copy in interface Paramscopy in class Predictor<Vector,GeneralizedLinearRegression,GeneralizedLinearRegressionModel>extra - (undocumented)