StandardAnalyzer (Lucene 2.4.0 API)

Overview

Package

Class

Use

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

org.apache.lucene.analysis.standard
Class StandardAnalyzer

java.lang.Object
  org.apache.lucene.analysis.Analyzer
      org.apache.lucene.analysis.standard.StandardAnalyzer

Version:: $Id: StandardAnalyzer.java 692634 2008-09-06 10:58:33Z mikemccand $

Field Summary
`static int`	`DEFAULT_MAX_TOKEN_LENGTH` Default maximum allowed token length
`static String[]`	`STOP_WORDS` An array containing some common English words that are usually not useful for searching.

Constructor Summary
`StandardAnalyzer()` Builds an analyzer with the default stop words (`STOP_WORDS`).
`StandardAnalyzer(boolean replaceInvalidAcronym)` Deprecated. Remove in 3.X and make true the only valid value
`StandardAnalyzer(File stopwords)` Builds an analyzer with the stop words from the given file.
`StandardAnalyzer(File stopwords, boolean replaceInvalidAcronym)` Deprecated. Remove in 3.X and make true the only valid value
`StandardAnalyzer(Reader stopwords)` Builds an analyzer with the stop words from the given reader.
`StandardAnalyzer(Reader stopwords, boolean replaceInvalidAcronym)` Deprecated. Remove in 3.X and make true the only valid value
`StandardAnalyzer(Set stopWords)` Builds an analyzer with the given stop words.
`StandardAnalyzer(Set stopwords, boolean replaceInvalidAcronym)` Deprecated. Remove in 3.X and make true the only valid value
`StandardAnalyzer(String[] stopWords)` Builds an analyzer with the given stop words.
`StandardAnalyzer(String[] stopwords, boolean replaceInvalidAcronym)` Deprecated. Remove in 3.X and make true the only valid value

Method Summary
`static boolean`	`getDefaultReplaceInvalidAcronym()` Deprecated. This will be removed (hardwired to true) in 3.0
`int`	`getMaxTokenLength()`
`boolean`	`isReplaceInvalidAcronym()` Deprecated. This will be removed (hardwired to true) in 3.0
`TokenStream`	`reusableTokenStream(String fieldName, Reader reader)` Creates a TokenStream that is allowed to be re-used from the previous time that the same thread called this method.
`static void`	`setDefaultReplaceInvalidAcronym(boolean replaceInvalidAcronym)` Deprecated. This will be removed (hardwired to true) in 3.0
`void`	`setMaxTokenLength(int length)` Set maximum allowed token length.
`void`	`setReplaceInvalidAcronym(boolean replaceInvalidAcronym)` Deprecated. This will be removed (hardwired to true) in 3.0
`TokenStream`	`tokenStream(String fieldName, Reader reader)` Constructs a `StandardTokenizer` filtered by a `StandardFilter`, a `LowerCaseFilter` and a `StopFilter`.

Methods inherited from class org.apache.lucene.analysis.Analyzer
`getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStream`

Methods inherited from class java.lang.Object
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait`

Field Detail

public static final String[] STOP_WORDS

An array containing some common English words that are usually not useful for searching.

public static final int DEFAULT_MAX_TOKEN_LENGTH

Default maximum allowed token length

Constructor Detail

public StandardAnalyzer()

Builds an analyzer with the default stop words (STOP_WORDS).

public StandardAnalyzer(Set stopWords)

Builds an analyzer with the given stop words.

public StandardAnalyzer(String[] stopWords)

Builds an analyzer with the given stop words.

public StandardAnalyzer(File stopwords)
                 throws IOException

Builds an analyzer with the stop words from the given file.

public StandardAnalyzer(Reader stopwords)
                 throws IOException

Builds an analyzer with the stop words from the given reader.