public class WordTokenizer extends CharacterDelimitedTokenizer
-delimiters <value> The delimiters to use (default ' \r\n\t.,;:'"()?!').
| Constructor and Description |
|---|
WordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing the stemmer
|
boolean |
hasMoreElements()
Tests if this enumeration contains more elements.
|
static void |
main(java.lang.String[] args)
Runs the tokenizer with the given options and strings to tokenize.
|
java.lang.String |
nextElement()
Returns the next element of this enumeration if this enumeration object has
at least one more element to provide.
|
void |
tokenize(java.lang.String s)
Sets the string to tokenize.
|
delimitersTipText, getDelimiters, getOptions, listOptions, setDelimiters, setOptionsrunTokenizer, tokenizeequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitmakeCopypublic java.lang.String globalInfo()
globalInfo in class Tokenizerpublic boolean hasMoreElements()
hasMoreElements in interface java.util.Enumeration<java.lang.String>hasMoreElements in class Tokenizerpublic java.lang.String nextElement()
nextElement in interface java.util.Enumeration<java.lang.String>nextElement in class Tokenizerpublic void tokenize(java.lang.String s)
public java.lang.String getRevision()
public static void main(java.lang.String[] args)
args - the commandline options and strings to tokenize