public class WhitespaceTokenizerFactory extends TokenizerFactory
WhitespaceTokenizer.
<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory" rule="unicode" maxTokenLen="256"/>
</analyzer>
</fieldType>
Options:
WhitespaceTokenizer
or "unicode" for UnicodeWhitespaceTokenizerCharTokenizer::DEFAULT_MAX_TOKEN_LEN| Modifier and Type | Field and Description |
|---|---|
static java.lang.String |
RULE_JAVA |
static java.lang.String |
RULE_UNICODE |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion| Constructor and Description |
|---|
WhitespaceTokenizerFactory(java.util.Map<java.lang.String,java.lang.String> args)
Creates a new WhitespaceTokenizerFactory
|
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
create(AttributeFactory factory)
Creates a TokenStream of the specified input using the given AttributeFactory
|
availableTokenizers, create, forName, lookupClass, reloadTokenizersget, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNamespublic static final java.lang.String RULE_JAVA
public static final java.lang.String RULE_UNICODE
public WhitespaceTokenizerFactory(java.util.Map<java.lang.String,java.lang.String> args)
public Tokenizer create(AttributeFactory factory)
TokenizerFactorycreate in class TokenizerFactoryCopyright © 2000–2025 The Apache Software Foundation. All rights reserved.