|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.cz.CzechAnalyzer
public final class CzechAnalyzer
Analyzer for Czech language. Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default.
Field Summary | |
---|---|
static String[] |
CZECH_STOP_WORDS
List of typical stopwords. |
Constructor Summary | |
---|---|
CzechAnalyzer()
Builds an analyzer with the default stop words ( CZECH_STOP_WORDS ). |
|
CzechAnalyzer(File stopwords)
Builds an analyzer with the given stop words. |
|
CzechAnalyzer(HashSet stopwords)
|
|
CzechAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words. |
Method Summary | |
---|---|
void |
loadStopWords(InputStream wordfile,
String encoding)
Loads stopwords hash from resource stream (file, database...). |
TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader. |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String[] CZECH_STOP_WORDS
Constructor Detail |
---|
public CzechAnalyzer()
CZECH_STOP_WORDS
).
public CzechAnalyzer(String[] stopwords)
public CzechAnalyzer(HashSet stopwords)
public CzechAnalyzer(File stopwords) throws IOException
IOException
Method Detail |
---|
public void loadStopWords(InputStream wordfile, String encoding)
wordfile
- File containing the wordlistencoding
- Encoding used (win-1250, iso-8859-2, ...), null for default system encodingpublic final TokenStream tokenStream(String fieldName, Reader reader)
tokenStream
in class Analyzer
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |