public final class CzechAnalyzer extends Analyzer
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String[] |
CZECH_STOP_WORDS
List of typical stopwords.
|
| Constructor and Description |
|---|
CzechAnalyzer()
Builds an analyzer with the default stop words (
CZECH_STOP_WORDS). |
CzechAnalyzer(java.io.File stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(java.util.HashSet stopwords) |
CzechAnalyzer(java.lang.String[] stopwords)
Builds an analyzer with the given stop words.
|
| Modifier and Type | Method and Description |
|---|---|
void |
loadStopWords(java.io.InputStream wordfile,
java.lang.String encoding)
Loads stopwords hash from resource stream (file, database...).
|
TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
|
close, getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStreampublic static final java.lang.String[] CZECH_STOP_WORDS
public CzechAnalyzer()
CZECH_STOP_WORDS).public CzechAnalyzer(java.lang.String[] stopwords)
public CzechAnalyzer(java.util.HashSet stopwords)
public CzechAnalyzer(java.io.File stopwords)
throws java.io.IOException
java.io.IOExceptionpublic void loadStopWords(java.io.InputStream wordfile,
java.lang.String encoding)
wordfile - File containing the wordlistencoding - Encoding used (win-1250, iso-8859-2, ...), null for default system encodingpublic final TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
tokenStream in class AnalyzerCopyright © 2000-2014 Apache Software Foundation. All Rights Reserved.