public final class DutchStemFilter extends TokenFilter
TokenFilter
that stems Dutch words.
It supports a table of words that should
not be stemmed at all. The stemmer used can be changed at runtime after the
filter object is created (as long as it is a DutchStemmer
).
AttributeSource.AttributeFactory, AttributeSource.State
input
Constructor and Description |
---|
DutchStemFilter(TokenStream _in) |
DutchStemFilter(TokenStream _in,
Set exclusiontable)
Builds a DutchStemFilter that uses an exclusion table.
|
DutchStemFilter(TokenStream _in,
Set exclusiontable,
Map stemdictionary) |
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken()
Returns the next token in the stream, or null at EOS
|
void |
setExclusionTable(HashSet exclusiontable)
Set an alternative exclusion list for this filter.
|
void |
setStemDictionary(HashMap dict)
Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
|
void |
setStemmer(DutchStemmer stemmer)
Set a alternative/custom
DutchStemmer for this filter. |
close, end, reset
getOnlyUseNewAPI, next, next, setOnlyUseNewAPI
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString
public DutchStemFilter(TokenStream _in)
public DutchStemFilter(TokenStream _in, Set exclusiontable)
public DutchStemFilter(TokenStream _in, Set exclusiontable, Map stemdictionary)
stemdictionary
- Dictionary of word stem pairs, that overrule the algorithmpublic boolean incrementToken() throws IOException
incrementToken
in class TokenStream
Note that this method will be defined abstract in Lucene 3.0.
IOException
public void setStemmer(DutchStemmer stemmer)
DutchStemmer
for this filter.public void setExclusionTable(HashSet exclusiontable)
public void setStemDictionary(HashMap dict)
Copyright © 2000-2012 Apache Software Foundation. All Rights Reserved.