public class LatentSemanticAnalysis extends UnsupervisedAttributeEvaluator implements AttributeTransformer, OptionHandler
-N Normalize input data.
-R Rank approximation used in LSA. May be actual number of LSA attributes to include (if greater than 1) or a proportion of total singular values to account for (if between 0 and 1). A value less than or equal to zero means use all latent variables. (default = 0.95)
-A Maximum number of attributes to include in transformed attribute names. (-1 = include all)
| Constructor and Description |
|---|
LatentSemanticAnalysis() |
| Modifier and Type | Method and Description |
|---|---|
void |
buildEvaluator(Instances data)
Initializes the singular values/vectors and performs the analysis
|
Instance |
convertInstance(Instance instance)
Transform an instance in original (unnormalized) format
|
double |
evaluateAttribute(int att)
Evaluates the merit of a transformed attribute.
|
Capabilities |
getCapabilities()
Returns the capabilities of this evaluator.
|
int |
getMaximumAttributeNames()
Gets maximum number of attributes to include in
transformed attribute names.
|
boolean |
getNormalize()
Gets whether or not input data is to be normalized
|
java.lang.String[] |
getOptions()
Gets the current settings of LatentSemanticAnalysis
|
double |
getRank()
Gets the desired matrix rank (or coverage proportion) for feature-space reduction
|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
globalInfo()
Returns a string describing this attribute transformer
|
java.util.Enumeration |
listOptions()
Returns an enumeration describing the available options.
|
static void |
main(java.lang.String[] argv)
Main method for testing this class
|
java.lang.String |
maximumAttributeNamesTipText()
Returns the tip text for this property
|
java.lang.String |
normalizeTipText()
Returns the tip text for this property
|
java.lang.String |
rankTipText()
Returns the tip text for this property
|
void |
setMaximumAttributeNames(int newMaxAttributes)
Sets maximum number of attributes to include in
transformed attribute names.
|
void |
setNormalize(boolean newNormalize)
Set whether input data will be normalized.
|
void |
setOptions(java.lang.String[] options)
Parses a given list of options.
|
void |
setRank(double newRank)
Sets the desired matrix rank (or coverage proportion) for feature-space reduction
|
java.lang.String |
toString()
Returns a description of this attribute transformer
|
Instances |
transformedData(Instances data)
Transform the supplied data set (assumed to be the same format
as the training data)
|
Instances |
transformedHeader()
Returns just the header for the transformed data (ie.
|
clean, forName, makeCopies, postProcesspublic java.lang.String globalInfo()
public java.util.Enumeration listOptions()
listOptions in interface OptionHandlerpublic void setOptions(java.lang.String[] options)
throws java.lang.Exception
-N Normalize input data.
-R Rank approximation used in LSA. May be actual number of LSA attributes to include (if greater than 1) or a proportion of total singular values to account for (if between 0 and 1). A value less than or equal to zero means use all latent variables. (default = 0.95)
-A Maximum number of attributes to include in transformed attribute names. (-1 = include all)
setOptions in interface OptionHandleroptions - the list of options as an array of stringsjava.lang.Exception - if an option is not supportedpublic java.lang.String normalizeTipText()
public void setNormalize(boolean newNormalize)
newNormalize - true if input data is to be normalizedpublic boolean getNormalize()
public java.lang.String rankTipText()
public void setRank(double newRank)
newRank - the desired rank (or coverage) for feature-space reductionpublic double getRank()
public java.lang.String maximumAttributeNamesTipText()
public void setMaximumAttributeNames(int newMaxAttributes)
newMaxAttributes - the maximum number of attributespublic int getMaximumAttributeNames()
public java.lang.String[] getOptions()
getOptions in interface OptionHandlerpublic Capabilities getCapabilities()
getCapabilities in interface CapabilitiesHandlergetCapabilities in class ASEvaluationCapabilitiespublic void buildEvaluator(Instances data) throws java.lang.Exception
buildEvaluator in class ASEvaluationdata - the instances to analyse/transformjava.lang.Exception - if analysis failspublic Instances transformedHeader() throws java.lang.Exception
transformedHeader in interface AttributeTransformerjava.lang.Exception - if the header of the transformed data can't
be determined.public Instances transformedData(Instances data) throws java.lang.Exception
transformedData in interface AttributeTransformerjava.lang.Exception - if transformed data can't be returnedpublic double evaluateAttribute(int att)
throws java.lang.Exception
evaluateAttribute in interface AttributeEvaluatoratt - the attribute to be evaluatedjava.lang.Exception - if attribute can't be evaluatedpublic Instance convertInstance(Instance instance) throws java.lang.Exception
convertInstance in interface AttributeTransformerinstance - an instance in the original (unnormalized) formatjava.lang.Exception - if instance can't be transformedpublic java.lang.String toString()
toString in class java.lang.Objectpublic java.lang.String getRevision()
getRevision in interface RevisionHandlergetRevision in class ASEvaluationpublic static void main(java.lang.String[] argv)
argv - should contain the command line arguments to the
evaluator/transformer (see AttributeSelection)