Provides various convenience classes for creating payloads on Tokens.
org.apache.lucene.analysis.ru - package org.apache.lucene.analysis.ruAnalyzer for Russian.org.apache.lucene.analysis.sinks - package org.apache.lucene.analysis.sinksImplementations of the SinkTokenizer that might be useful.
org.apache.lucene.analysis.th - package org.apache.lucene.analysis.th
P
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
-
- PayloadHelper() -
Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
-
R
- reset() -
Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkTokenizer
-
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
- Analyzer for Russian language.
- RussianAnalyzer() -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(char[]) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer.
- RussianAnalyzer(char[], String[]) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words.
- RussianAnalyzer(char[], Hashtable) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words.
- RussianCharsets - Class in org.apache.lucene.analysis.ru
- RussianCharsets class contains encodings schemes (charsets) and toLowerCase() method implementation
for russian characters in Unicode, KOI8 and CP1252.
- RussianCharsets() -
Constructor for class org.apache.lucene.analysis.ru.RussianCharsets
-
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
- A RussianLetterTokenizer is a tokenizer that extends LetterTokenizer by additionally looking up letters
in a given "russian charset".
- RussianLetterTokenizer(Reader, char[]) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
- Normalizes token text to lower case, analyzing given ("russian") charset.
- RussianLowerCaseFilter(TokenStream, char[]) -
Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
-
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
- A filter that stems Russian words.
- RussianStemFilter(TokenStream, char[]) -
Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
-
S
- setArticles(Set) -
Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
- setExclusionSet(Set) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Set an alternative exclusion list for this filter.
- setExclusionTable(Hashtable) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Set an alternative exclusion list for this filter.
- setExclusionTable(HashSet) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Set an alternative exclusion list for this filter.
- setStemDictionary(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Reads a stemdictionary file , that overrules the stemming algorithm
This is a textfile that contains per line
word\tstem, i.e: two tab seperated words
- setStemDictionary(HashMap) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Builds an exclusionlist from an array of Strings.
- setStemExclusionTable(Hashtable) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Builds an exclusionlist from a Hashtable.
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Builds an exclusionlist from the words contained in the given file.
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Builds an exclusionlist from an array of Strings.
- setStemExclusionTable(Hashtable) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Builds an exclusionlist from a Hashtable.
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Builds an exclusionlist from the words contained in the given file.
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Builds an exclusionlist from an array of Strings.
- setStemExclusionTable(Hashtable) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Builds an exclusionlist from a Hashtable.
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Builds an exclusionlist from the words contained in the given file.
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Builds an exclusionlist from an array of Strings.
- setStemExclusionTable(HashSet) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Builds an exclusionlist from a Hashtable.
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Builds an exclusionlist from the words contained in the given file.
- setStemmer(GermanStemmer) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Set a alternative/custom GermanStemmer for this filter.
- setStemmer(FrenchStemmer) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Set a alternative/custom FrenchStemmer for this filter.
- setStemmer(DutchStemmer) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Set a alternative/custom DutchStemmer for this filter.
- setStemmer(RussianStemmer) -
Method in class org.apache.lucene.analysis.ru.RussianStemFilter
- Set a alternative/custom RussianStemmer for this filter.
- stem(String) -
Method in class org.apache.lucene.analysis.br.BrazilianStemmer
- Stemms the given term to an unique discriminator.
- stem(String) -
Method in class org.apache.lucene.analysis.de.GermanStemmer
- Stemms the given term to an unique discriminator.
- stem(String) -
Method in class org.apache.lucene.analysis.fr.FrenchStemmer
- Stemms the given term to a unique discriminator.
- stem(String) -
Method in class org.apache.lucene.analysis.nl.DutchStemmer
-
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- An array containing some common English words that are not usually
useful for searching and some double-byte interpunctions.
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
-
T
- ThaiAnalyzer - Class in org.apache.lucene.analysis.th
- Analyzer for Thai language.
- ThaiAnalyzer() -
Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
- ThaiWordFilter - Class in org.apache.lucene.analysis.th
- TokenFilter that use java.text.BreakIterator to break each
Token that is Thai into separate Token(s) for each Thai word.
- ThaiWordFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
-
- TokenOffsetPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Adds the
Token.setStartOffset(int)
and Token.setEndOffset(int)
First 4 bytes are the start - TokenOffsetPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
-
- TokenRangeSinkTokenizer - Class in org.apache.lucene.analysis.sinks
- Counts the tokens as they go by and saves to the internal list those between the range of lower and upper, exclusive of upper
- TokenRangeSinkTokenizer(int, int) -
Constructor for class org.apache.lucene.analysis.sinks.TokenRangeSinkTokenizer
-
- TokenRangeSinkTokenizer(int, int, int) -
Constructor for class org.apache.lucene.analysis.sinks.TokenRangeSinkTokenizer
-
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- get token stream from input
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.cn.ChineseAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.cz.CzechAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.el.GreekAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided TextReader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.ru.RussianAnalyzer
- Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
- TokenTypeSinkTokenizer - Class in org.apache.lucene.analysis.sinks
- If the
Token.type()
matches the passed in typeToMatch
then
add it to the sink - TokenTypeSinkTokenizer(String) -
Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkTokenizer
-
- TokenTypeSinkTokenizer(int, String) -
Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkTokenizer
-
- TokenTypeSinkTokenizer(List, String) -
Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkTokenizer
-
- toLowerCase(char, char[]) -
Static method in class org.apache.lucene.analysis.el.GreekCharsets
-
- toLowerCase(char, char[]) -
Static method in class org.apache.lucene.analysis.ru.RussianCharsets
-
- TypeAsPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Makes the
Token.type()
a payload. - TypeAsPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
-
U
- UnicodeGreek -
Static variable in class org.apache.lucene.analysis.el.GreekCharsets
-
- UnicodeRussian -
Static variable in class org.apache.lucene.analysis.ru.RussianCharsets
-
W
- WordlistLoader - Class in org.apache.lucene.analysis.nl
- Deprecated. use
WordlistLoader
instead - WordlistLoader() -
Constructor for class org.apache.lucene.analysis.nl.WordlistLoader
- Deprecated.
A B C D E F G I K L N O P R S T U W
Copyright © 2000-2008 Apache Software Foundation. All Rights Reserved.