public class JapaneseAnalyzer extends StopwordAnalyzerBase
JapaneseTokenizerAnalyzer.ReuseStrategy, Analyzer.TokenStreamComponentsstopwordsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description |
|---|
JapaneseAnalyzer() |
JapaneseAnalyzer(UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
java.util.Set<java.lang.String> stoptags) |
| Modifier and Type | Method and Description |
|---|---|
protected Analyzer.TokenStreamComponents |
createComponents(java.lang.String fieldName)
Creates a new
Analyzer.TokenStreamComponents instance for this analyzer. |
static CharArraySet |
getDefaultStopSet() |
static java.util.Set<java.lang.String> |
getDefaultStopTags() |
protected TokenStream |
normalize(java.lang.String fieldName,
TokenStream in)
Wrap the given
TokenStream in order to apply normalization filters. |
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetattributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, initReaderForNormalization, normalize, setVersion, tokenStream, tokenStreampublic JapaneseAnalyzer()
public JapaneseAnalyzer(UserDictionary userDict, JapaneseTokenizer.Mode mode, CharArraySet stopwords, java.util.Set<java.lang.String> stoptags)
public static CharArraySet getDefaultStopSet()
public static java.util.Set<java.lang.String> getDefaultStopTags()
protected Analyzer.TokenStreamComponents createComponents(java.lang.String fieldName)
AnalyzerAnalyzer.TokenStreamComponents instance for this analyzer.createComponents in class AnalyzerfieldName - the name of the fields content passed to the
Analyzer.TokenStreamComponents sink as a readerAnalyzer.TokenStreamComponents for this analyzer.protected TokenStream normalize(java.lang.String fieldName, TokenStream in)
AnalyzerTokenStream in order to apply normalization filters.
The default implementation returns the TokenStream as-is. This is
used by Analyzer.normalize(String, String).Copyright © 2000–2025 The Apache Software Foundation. All rights reserved.