org.apache.spark.ml.feature
Class Tokenizer

Object
  extended by org.apache.spark.ml.PipelineStage
      extended by org.apache.spark.ml.Transformer
          extended by org.apache.spark.ml.UnaryTransformer<String,scala.collection.Seq<String>,Tokenizer>
              extended by org.apache.spark.ml.feature.Tokenizer
All Implemented Interfaces:
java.io.Serializable, Logging, Params

public class Tokenizer
extends UnaryTransformer<String,scala.collection.Seq<String>,Tokenizer>

:: Experimental :: A tokenizer that converts the input string to lowercase and then splits it by white spaces.

See Also:
RegexTokenizer, Serialized Form

Constructor Summary
Tokenizer()
           
Tokenizer(String uid)
           
 
Method Summary
 Tokenizer copy(ParamMap extra)
          Creates a copy of this instance with the same UID and some extra params.
 String uid()
           
 
Methods inherited from class org.apache.spark.ml.UnaryTransformer
setInputCol, setOutputCol, transform, transformSchema
 
Methods inherited from class org.apache.spark.ml.Transformer
transform, transform, transform
 
Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.spark.Logging
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning
 
Methods inherited from interface org.apache.spark.ml.param.Params
clear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, setDefault, shouldOwn, validateParams
 

Constructor Detail

Tokenizer

public Tokenizer(String uid)

Tokenizer

public Tokenizer()
Method Detail

uid

public String uid()

copy

public Tokenizer copy(ParamMap extra)
Description copied from interface: Params
Creates a copy of this instance with the same UID and some extra params. Subclasses should implement this method and set the return type properly.

Specified by:
copy in interface Params
Overrides:
copy in class UnaryTransformer<String,scala.collection.Seq<String>,Tokenizer>
Parameters:
extra - (undocumented)
Returns:
(undocumented)
See Also:
defaultCopy()