public class WordTokenizer extends CharacterDelimitedTokenizer
-delimiters <value> The delimiters to use (default ' \r\n\t.,;:'"()?!').
Constructor and Description |
---|
WordTokenizer() |
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing the stemmer
|
boolean |
hasMoreElements()
Tests if this enumeration contains more elements.
|
static void |
main(java.lang.String[] args)
Runs the tokenizer with the given options and strings to tokenize.
|
java.lang.String |
nextElement()
Returns the next element of this enumeration if this enumeration object has
at least one more element to provide.
|
void |
tokenize(java.lang.String s)
Sets the string to tokenize.
|
delimitersTipText, getDelimiters, getOptions, listOptions, setDelimiters, setOptions
runTokenizer, tokenize
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
makeCopy
public java.lang.String globalInfo()
globalInfo
in class Tokenizer
public boolean hasMoreElements()
hasMoreElements
in interface java.util.Enumeration<java.lang.String>
hasMoreElements
in class Tokenizer
public java.lang.String nextElement()
nextElement
in interface java.util.Enumeration<java.lang.String>
nextElement
in class Tokenizer
public void tokenize(java.lang.String s)
public java.lang.String getRevision()
public static void main(java.lang.String[] args)
args
- the commandline options and strings to tokenize