public interface CountVectorizerParams extends Params, HasInputCol, HasOutputCol
CountVectorizer and CountVectorizerModel.| Modifier and Type | Method and Description | 
|---|---|
| BooleanParam | binary()Binary toggle to control the output vector values. | 
| boolean | getBinary() | 
| double | getMaxDF() | 
| double | getMinDF() | 
| double | getMinTF() | 
| int | getVocabSize() | 
| DoubleParam | maxDF()Specifies the maximum number of different documents a term could appear in to be included
 in the vocabulary. | 
| DoubleParam | minDF()Specifies the minimum number of different documents a term must appear in to be included
 in the vocabulary. | 
| DoubleParam | minTF()Filter to ignore rare words in a document. | 
| StructType | validateAndTransformSchema(StructType schema)Validates and transforms the input schema. | 
| IntParam | vocabSize()Max size of the vocabulary. | 
getInputCol, inputColgetOutputCol, outputColclear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwntoString, uidBooleanParam binary()
boolean getBinary()
double getMaxDF()
double getMinDF()
double getMinTF()
int getVocabSize()
DoubleParam maxDF()
Default: (2^63^) - 1
DoubleParam minDF()
Default: 1.0
DoubleParam minTF()
 Note that the parameter is only used in transform of CountVectorizerModel and does not
 affect fitting.
 
Default: 1.0
StructType validateAndTransformSchema(StructType schema)
IntParam vocabSize()
Default: 2^18^