public class HighFreqTerms
extends java.lang.Object
HighFreqTerms class extracts the top n most frequent terms
(by document frequency ) from an existing Lucene index and reports their
document frequency. If used with the -t flag it also reports their
total tf (total number of occurences) in order of highest total tf| Modifier and Type | Field and Description |
|---|---|
static int |
DEFAULTnumTerms |
static int |
numTerms |
| Constructor and Description |
|---|
HighFreqTerms() |
| Modifier and Type | Method and Description |
|---|---|
static org.apache.lucene.misc.TermStats[] |
getHighFreqTerms(IndexReader reader,
int numTerms,
java.lang.String field) |
static long |
getTotalTermFreq(IndexReader reader,
Term term) |
static void |
main(java.lang.String[] args) |
static org.apache.lucene.misc.TermStats[] |
sortByTotalTermFreq(IndexReader reader,
org.apache.lucene.misc.TermStats[] terms)
Takes array of TermStats.
|
public static final int DEFAULTnumTerms
public static int numTerms
public static void main(java.lang.String[] args)
throws java.lang.Exception
java.lang.Exceptionpublic static org.apache.lucene.misc.TermStats[] getHighFreqTerms(IndexReader reader, int numTerms, java.lang.String field) throws java.lang.Exception
reader - numTerms - field - java.lang.Exceptionpublic static org.apache.lucene.misc.TermStats[] sortByTotalTermFreq(IndexReader reader, org.apache.lucene.misc.TermStats[] terms) throws java.lang.Exception
reader - terms - TermStats[]java.lang.Exceptionpublic static long getTotalTermFreq(IndexReader reader, Term term) throws java.lang.Exception
java.lang.ExceptionCopyright © 2000-2022 Apache Software Foundation. All Rights Reserved.