public class WordlistLoader
extends java.lang.Object
to obtain {@link Reader} instances| Constructor and Description |
|---|
WordlistLoader() |
| Modifier and Type | Method and Description |
|---|---|
static CharArraySet |
getSnowballWordSet(java.io.Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
getSnowballWordSet(java.io.Reader reader,
Version matchVersion)
Reads stopwords from a stopword list in Snowball format.
|
static CharArrayMap<java.lang.String> |
getStemDict(java.io.Reader reader,
CharArrayMap<java.lang.String> result)
Reads a stem dictionary.
|
static CharArraySet |
getWordSet(java.io.Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
getWordSet(java.io.Reader reader,
java.lang.String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
getWordSet(java.io.Reader reader,
java.lang.String comment,
Version matchVersion)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
getWordSet(java.io.Reader reader,
Version matchVersion)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
public static CharArraySet getWordSet(java.io.Reader reader, CharArraySet result) throws java.io.IOException
reader - Reader containing the wordlistresult - the CharArraySet to fill with the readers wordsCharArraySet with the reader's wordsjava.io.IOExceptionpublic static CharArraySet getWordSet(java.io.Reader reader, Version matchVersion) throws java.io.IOException
reader - Reader containing the wordlistmatchVersion - the Lucene VersionCharArraySet with the reader's wordsjava.io.IOExceptionpublic static CharArraySet getWordSet(java.io.Reader reader, java.lang.String comment, Version matchVersion) throws java.io.IOException
reader - Reader containing the wordlistcomment - The string representing a comment.matchVersion - the Lucene Versionjava.io.IOExceptionpublic static CharArraySet getWordSet(java.io.Reader reader, java.lang.String comment, CharArraySet result) throws java.io.IOException
reader - Reader containing the wordlistcomment - The string representing a comment.result - the CharArraySet to fill with the readers wordsCharArraySet with the reader's wordsjava.io.IOExceptionpublic static CharArraySet getSnowballWordSet(java.io.Reader reader, CharArraySet result) throws java.io.IOException
The snowball format is the following:
reader - Reader containing a Snowball stopword listresult - the CharArraySet to fill with the readers wordsCharArraySet with the reader's wordsjava.io.IOExceptionpublic static CharArraySet getSnowballWordSet(java.io.Reader reader, Version matchVersion) throws java.io.IOException
The snowball format is the following:
reader - Reader containing a Snowball stopword listmatchVersion - the Lucene VersionCharArraySet with the reader's wordsjava.io.IOExceptionpublic static CharArrayMap<java.lang.String> getStemDict(java.io.Reader reader, CharArrayMap<java.lang.String> result) throws java.io.IOException
word\tstem(i.e. two tab separated words)
java.io.IOExceptionCopyright © 2000-2022 Apache Software Foundation. All Rights Reserved.