public interface Word2VecBase extends Params, HasInputCol, HasOutputCol, HasMaxIter, HasStepSize, HasSeed
Word2Vec and Word2VecModel.| Modifier and Type | Method and Description |
|---|---|
int |
getMaxSentenceLength() |
int |
getMinCount() |
int |
getNumPartitions() |
int |
getVectorSize() |
int |
getWindowSize() |
IntParam |
maxSentenceLength()
Sets the maximum length (in words) of each sentence in the input data.
|
IntParam |
minCount()
The minimum number of times a token must appear to be included in the word2vec model's
vocabulary.
|
IntParam |
numPartitions()
Number of partitions for sentences of words.
|
StructType |
validateAndTransformSchema(StructType schema)
Validate and transform the input schema.
|
IntParam |
vectorSize()
The dimension of the code that you want to transform from words.
|
IntParam |
windowSize()
The window size (context words from [-window, window]).
|
getInputCol, inputColgetOutputCol, outputColgetMaxIter, maxItergetStepSize, stepSizeclear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwntoString, uidint getMaxSentenceLength()
int getMinCount()
int getNumPartitions()
int getVectorSize()
int getWindowSize()
IntParam maxSentenceLength()
maxSentenceLength size.
Default: 1000IntParam minCount()
IntParam numPartitions()
StructType validateAndTransformSchema(StructType schema)
schema - (undocumented)IntParam vectorSize()
IntParam windowSize()