public class WordTokenizer extends CharacterDelimitedTokenizer
-delimiters <value> The delimiters to use (default ' \r\n\t.,;:'"()?!').
| Constructor and Description |
|---|
WordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing the stemmer
|
boolean |
hasMoreElements()
Tests if this enumeration contains more elements.
|
static void |
main(java.lang.String[] args)
Runs the tokenizer with the given options and strings to tokenize.
|
java.lang.Object |
nextElement()
Returns the next element of this enumeration if this enumeration object
has at least one more element to provide.
|
void |
tokenize(java.lang.String s)
Sets the string to tokenize.
|
delimitersTipText, getDelimiters, getOptions, listOptions, setDelimiters, setOptionsrunTokenizer, tokenizepublic java.lang.String globalInfo()
globalInfo in class Tokenizerpublic boolean hasMoreElements()
hasMoreElements in interface java.util.EnumerationhasMoreElements in class Tokenizerpublic java.lang.Object nextElement()
nextElement in interface java.util.EnumerationnextElement in class Tokenizerpublic void tokenize(java.lang.String s)
public java.lang.String getRevision()
public static void main(java.lang.String[] args)
args - the commandline options and strings to tokenize