public class StopWordsRemover extends Transformer
| Constructor and Description |
|---|
StopWordsRemover() |
StopWordsRemover(String uid) |
| Modifier and Type | Method and Description |
|---|---|
BooleanParam |
caseSensitive()
whether to do a case sensitive comparison over the stop words
Default: false
|
StopWordsRemover |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
boolean |
getCaseSensitive() |
String[] |
getStopWords() |
static StopWordsRemover |
load(String path) |
StopWordsRemover |
setCaseSensitive(boolean value) |
StopWordsRemover |
setInputCol(String value) |
StopWordsRemover |
setOutputCol(String value) |
StopWordsRemover |
setStopWords(String[] value) |
StringArrayParam |
stopWords()
the stop words set to be filtered out
Default:
StopWords.English |
DataFrame |
transform(DataFrame dataset)
Transforms the input dataset.
|
StructType |
transformSchema(StructType schema)
:: DeveloperApi ::
|
String |
uid()
An immutable unique ID for the object and its derivatives.
|
transform, transform, transformequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn, validateParamstoStringinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic StopWordsRemover(String uid)
public StopWordsRemover()
public static StopWordsRemover load(String path)
public String uid()
Identifiablepublic StopWordsRemover setInputCol(String value)
public StopWordsRemover setOutputCol(String value)
public StringArrayParam stopWords()
StopWords.Englishpublic StopWordsRemover setStopWords(String[] value)
public String[] getStopWords()
public BooleanParam caseSensitive()
public StopWordsRemover setCaseSensitive(boolean value)
public boolean getCaseSensitive()
public DataFrame transform(DataFrame dataset)
Transformertransform in class Transformerdataset - (undocumented)public StructType transformSchema(StructType schema)
PipelineStageDerives the output schema from the input schema.
transformSchema in class PipelineStageschema - (undocumented)public StopWordsRemover copy(ParamMap extra)
Paramscopy in interface Paramscopy in class Transformerextra - (undocumented)defaultCopy()