public class ReservoirSample extends Filter implements UnsupervisedFilter, OptionHandler, StreamableFilter, Randomizable, WeightedAttributesHandler
@article{Vitter1985, author = {J. S. Vitter}, journal = {ACM Transactions on Mathematical Software}, number = {1} volume = {11} pages = {37-57}, title = {Random Sampling with a Reservoir}, year = {1985} }Valid options are:
-S <num> Specify the random number seed (default 1)
-Z <num> The size of the output dataset - number of instances (default 100)
Constructor and Description |
---|
ReservoirSample() |
Modifier and Type | Method and Description |
---|---|
boolean |
batchFinished()
Signify that this batch of input to the filter is finished.
|
Capabilities |
getCapabilities()
Returns the Capabilities of this filter.
|
java.lang.String[] |
getOptions()
Gets the current settings of the filter.
|
int |
getRandomSeed()
Gets the random number seed.
|
java.lang.String |
getRevision()
Returns the revision string.
|
int |
getSampleSize()
Gets the subsample size.
|
int |
getSeed()
Gets the seed for the random number generations
|
java.lang.String |
globalInfo()
Returns a string describing this filter
|
boolean |
input(Instance instance)
Input an instance for filtering.
|
java.util.Enumeration<Option> |
listOptions()
Returns an enumeration describing the available options.
|
static void |
main(java.lang.String[] argv)
Main method for testing this class.
|
java.lang.String |
randomSeedTipText()
Returns the tip text for this property
|
java.lang.String |
sampleSizeTipText()
Returns the tip text for this property
|
boolean |
setInputFormat(Instances instanceInfo)
Sets the format of the input instances.
|
void |
setOptions(java.lang.String[] options)
Parses a given list of options.
|
void |
setRandomSeed(int newSeed)
Sets the random number seed.
|
void |
setSampleSize(int newSampleSize)
Sets the size of the subsample.
|
void |
setSeed(int seed)
Set the seed for random number generation.
|
batchFilterFile, debugTipText, doNotCheckCapabilitiesTipText, filterFile, getCapabilities, getCopyOfInputFormat, getDebug, getDoNotCheckCapabilities, getOutputFormat, isFirstBatchDone, isNewBatch, isOutputFormatDefined, makeCopies, makeCopy, mayRemoveInstanceAfterFirstBatchDone, numPendingOutput, output, outputPeek, postExecution, preExecution, run, runFilter, setDebug, setDoNotCheckCapabilities, toString, useFilter, wekaStaticWrapper
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait
makeCopy
public java.lang.String globalInfo()
public java.util.Enumeration<Option> listOptions()
listOptions
in interface OptionHandler
listOptions
in class Filter
public void setOptions(java.lang.String[] options) throws java.lang.Exception
-S <num> Specify the random number seed (default 1)
-Z <num> The size of the output dataset - number of instances (default 100)
setOptions
in interface OptionHandler
setOptions
in class Filter
options
- the list of options as an array of stringsjava.lang.Exception
- if an option is not supportedpublic java.lang.String[] getOptions()
getOptions
in interface OptionHandler
getOptions
in class Filter
public java.lang.String randomSeedTipText()
public int getRandomSeed()
public void setRandomSeed(int newSeed)
newSeed
- the new random number seed.@ProgrammaticProperty public void setSeed(int seed)
Randomizable
setSeed
in interface Randomizable
seed
- the seed@ProgrammaticProperty public int getSeed()
Randomizable
getSeed
in interface Randomizable
public java.lang.String sampleSizeTipText()
public int getSampleSize()
public void setSampleSize(int newSampleSize)
newSampleSize
- size of the subsample.public Capabilities getCapabilities()
getCapabilities
in interface CapabilitiesHandler
getCapabilities
in class Filter
Capabilities
public boolean setInputFormat(Instances instanceInfo) throws java.lang.Exception
setInputFormat
in class Filter
instanceInfo
- an Instances object containing the input instance
structure (any instances contained in the object are ignored -
only the structure is required).java.lang.Exception
- if the input format can't be set successfullypublic boolean input(Instance instance)
public boolean batchFinished()
batchFinished
in class Filter
java.lang.IllegalStateException
- if no input structure has been definedpublic java.lang.String getRevision()
getRevision
in interface RevisionHandler
getRevision
in class Filter
public static void main(java.lang.String[] argv)
argv
- should contain arguments to the filter: use -h for help