public class WordTokenizer extends CharacterDelimitedTokenizer
-delimiters <value> The delimiters to use (default ' \r\n\t.,;:'"()?!').
Constructor and Description |
---|
WordTokenizer() |
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing the stemmer
|
boolean |
hasMoreElements()
Tests if this enumeration contains more elements.
|
static void |
main(java.lang.String[] args)
Runs the tokenizer with the given options and strings to tokenize.
|
java.lang.Object |
nextElement()
Returns the next element of this enumeration if this enumeration object
has at least one more element to provide.
|
void |
tokenize(java.lang.String s)
Sets the string to tokenize.
|
delimitersTipText, getDelimiters, getOptions, listOptions, setDelimiters, setOptions
runTokenizer, tokenize
public java.lang.String globalInfo()
globalInfo
in class Tokenizer
public boolean hasMoreElements()
hasMoreElements
in interface java.util.Enumeration
hasMoreElements
in class Tokenizer
public java.lang.Object nextElement()
nextElement
in interface java.util.Enumeration
nextElement
in class Tokenizer
public void tokenize(java.lang.String s)
public java.lang.String getRevision()
public static void main(java.lang.String[] args)
args
- the commandline options and strings to tokenize