public class EM extends RandomizableDensityBasedClusterer implements NumberOfClustersRequestable, WeightedInstancesHandler
-N <num> number of clusters. If omitted or -1 specified, then cross validation is used to select the number of clusters.
-I <num> max iterations. (default 100)
-V verbose.
-M <num> minimum allowable standard deviation for normal density computation (default 1e-6)
-O Display model in old format (good when there are many clusters)
-S <num> Random number seed. (default 100)
| Constructor and Description |
|---|
EM()
Constructor.
|
| Modifier and Type | Method and Description |
|---|---|
void |
buildClusterer(Instances data)
Generates a clusterer.
|
double[] |
clusterPriors()
Returns the cluster priors.
|
java.lang.String |
debugTipText()
Returns the tip text for this property
|
java.lang.String |
displayModelInOldFormatTipText()
Returns the tip text for this property
|
Capabilities |
getCapabilities()
Returns default capabilities of the clusterer (i.e., the ones of
SimpleKMeans).
|
double[][][] |
getClusterModelsNumericAtts()
Return the normal distributions for the cluster models
|
double[] |
getClusterPriors()
Return the priors for the clusters
|
boolean |
getDebug()
Get debug mode
|
boolean |
getDisplayModelInOldFormat()
Get whether to display model output in the old, original
format.
|
int |
getMaxIterations()
Get the maximum number of iterations
|
double |
getMinStdDev()
Get the minimum allowable standard deviation.
|
int |
getNumClusters()
Get the number of clusters
|
java.lang.String[] |
getOptions()
Gets the current settings of EM.
|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing this clusterer
|
java.util.Enumeration |
listOptions()
Returns an enumeration describing the available options.
|
double[] |
logDensityPerClusterForInstance(Instance inst)
Computes the log of the conditional density (per cluster) for a given instance.
|
static void |
main(java.lang.String[] argv)
Main method for testing this class.
|
java.lang.String |
maxIterationsTipText()
Returns the tip text for this property
|
java.lang.String |
minStdDevTipText()
Returns the tip text for this property
|
int |
numberOfClusters()
Returns the number of clusters.
|
java.lang.String |
numClustersTipText()
Returns the tip text for this property
|
void |
setDebug(boolean v)
Set debug mode - verbose output
|
void |
setDisplayModelInOldFormat(boolean d)
Set whether to display model output in the old, original
format.
|
void |
setMaxIterations(int i)
Set the maximum number of iterations to perform
|
void |
setMinStdDev(double m)
Set the minimum value for standard deviation when calculating
normal density.
|
void |
setMinStdDevPerAtt(double[] m) |
void |
setNumClusters(int n)
Set the number of clusters (-1 to select by CV).
|
void |
setOptions(java.lang.String[] options)
Parses a given list of options.
|
java.lang.String |
toString()
Outputs the generated clusters into a string.
|
getSeed, seedTipText, setSeeddistributionForInstance, logDensityForInstance, logJointDensitiesForInstance, makeCopiesclusterInstance, forName, makeCopies, makeCopyequals, getClass, hashCode, notify, notifyAll, wait, wait, waitclusterInstancepublic java.lang.String globalInfo()
public java.util.Enumeration listOptions()
listOptions in interface OptionHandlerlistOptions in class RandomizableDensityBasedClustererpublic void setOptions(java.lang.String[] options)
throws java.lang.Exception
-N <num> number of clusters. If omitted or -1 specified, then cross validation is used to select the number of clusters.
-I <num> max iterations. (default 100)
-V verbose.
-M <num> minimum allowable standard deviation for normal density computation (default 1e-6)
-O Display model in old format (good when there are many clusters)
-S <num> Random number seed. (default 100)
setOptions in interface OptionHandlersetOptions in class RandomizableDensityBasedClustereroptions - the list of options as an array of stringsjava.lang.Exception - if an option is not supportedpublic java.lang.String displayModelInOldFormatTipText()
public void setDisplayModelInOldFormat(boolean d)
d - true if model ouput is to be shown in the old formatpublic boolean getDisplayModelInOldFormat()
public java.lang.String minStdDevTipText()
public void setMinStdDev(double m)
m - minimum value for standard deviationpublic void setMinStdDevPerAtt(double[] m)
public double getMinStdDev()
public java.lang.String numClustersTipText()
public void setNumClusters(int n)
throws java.lang.Exception
setNumClusters in interface NumberOfClustersRequestablen - the number of clustersjava.lang.Exception - if n is 0public int getNumClusters()
public java.lang.String maxIterationsTipText()
public void setMaxIterations(int i)
throws java.lang.Exception
i - the number of iterationsjava.lang.Exception - if i is less than 1public int getMaxIterations()
public java.lang.String debugTipText()
public void setDebug(boolean v)
v - true for verbose outputpublic boolean getDebug()
public java.lang.String[] getOptions()
getOptions in interface OptionHandlergetOptions in class RandomizableDensityBasedClustererpublic double[][][] getClusterModelsNumericAtts()
double[][][] valuepublic double[] getClusterPriors()
double[] valuepublic java.lang.String toString()
toString in class java.lang.Objectpublic int numberOfClusters()
throws java.lang.Exception
numberOfClusters in interface ClusterernumberOfClusters in class AbstractClustererjava.lang.Exception - if number of clusters could not be returned
successfullypublic Capabilities getCapabilities()
getCapabilities in interface ClusterergetCapabilities in interface CapabilitiesHandlergetCapabilities in class AbstractClustererCapabilitiespublic void buildClusterer(Instances data) throws java.lang.Exception
buildClusterer in interface ClustererbuildClusterer in class AbstractClustererdata - set of instances serving as training datajava.lang.Exception - if the clusterer has not been
generated successfullypublic double[] clusterPriors()
clusterPriors in interface DensityBasedClustererclusterPriors in class AbstractDensityBasedClustererpublic double[] logDensityPerClusterForInstance(Instance inst) throws java.lang.Exception
logDensityPerClusterForInstance in interface DensityBasedClustererlogDensityPerClusterForInstance in class AbstractDensityBasedClustererinst - the instance to compute the density forjava.lang.Exception - if the density could not be computed
successfullypublic java.lang.String getRevision()
getRevision in interface RevisionHandlergetRevision in class AbstractClustererpublic static void main(java.lang.String[] argv)
argv - should contain the following arguments: -t training file [-T test file] [-N number of clusters] [-S random seed]