public class CSVLoader extends AbstractFileLoader implements BatchConverter, OptionHandler
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-E <enclosures> The enclosure character(s) to use for strings. Specify as a comma separated list (e.g. ",' (default: '"')
Loader
,
Serialized FormModifier and Type | Field and Description |
---|---|
static java.lang.String |
FILE_EXTENSION
the file extension.
|
FILE_EXTENSION_COMPRESSED
BATCH, INCREMENTAL, NONE
Constructor and Description |
---|
CSVLoader()
default constructor.
|
Modifier and Type | Method and Description |
---|---|
java.lang.String |
dateAttributesTipText()
Returns the tip text for this property.
|
java.lang.String |
dateFormatTipText()
Returns the tip text for this property.
|
java.lang.String |
enclosureCharactersTipText()
Returns the tip text for this property.
|
Instances |
getDataSet()
Return the full data set.
|
java.lang.String |
getDateAttributes()
Returns the current attribute range to be forced to type date.
|
java.lang.String |
getDateFormat()
Get the format to use for parsing date values.
|
java.lang.String |
getEnclosureCharacters()
Get the character(s) to use/recognize as string enclosures
|
java.lang.String |
getFileDescription()
Returns a description of the file type.
|
java.lang.String |
getFileExtension()
Get the file extension used for arff files.
|
java.lang.String[] |
getFileExtensions()
Gets all the file extensions used for this type of file.
|
java.lang.String |
getMissingValue()
Returns the current placeholder for missing values.
|
Instance |
getNextInstance(Instances structure)
CSVLoader is unable to process a data set incrementally.
|
java.lang.String |
getNominalAttributes()
Returns the current attribute range to be forced to type nominal.
|
java.lang.String[] |
getOptions()
Gets the current settings of the Classifier.
|
java.lang.String |
getRevision()
Returns the revision string.
|
java.lang.String |
getStringAttributes()
Returns the current attribute range to be forced to type string.
|
Instances |
getStructure()
Determines and returns (if possible) the structure (internally the header)
of the data set as an empty set of instances.
|
java.lang.String |
globalInfo()
Returns a string describing this attribute evaluator.
|
java.util.Enumeration |
listOptions()
Returns an enumeration describing the available options.
|
static void |
main(java.lang.String[] args)
Main method.
|
java.lang.String |
missingValueTipText()
Returns the tip text for this property.
|
java.lang.String |
nominalAttributesTipText()
Returns the tip text for this property.
|
void |
reset()
Resets the Loader ready to read a new data set or the same data set again.
|
void |
setDateAttributes(java.lang.String value)
Set the attribute range to be forced to type date.
|
void |
setDateFormat(java.lang.String value)
Set the format to use for parsing date values.
|
void |
setEnclosureCharacters(java.lang.String enclosure)
Set the character(s) to use/recognize as string enclosures
|
void |
setMissingValue(java.lang.String value)
Sets the placeholder for missing values.
|
void |
setNominalAttributes(java.lang.String value)
Sets the attribute range to be forced to type nominal.
|
void |
setOptions(java.lang.String[] options)
Parses a given list of options.
|
void |
setSource(java.io.File file)
Resets the Loader object and sets the source of the data set to be the
supplied File object.
|
void |
setSource(java.io.InputStream input)
Resets the Loader object and sets the source of the data set to be the
supplied Stream object.
|
void |
setStringAttributes(java.lang.String value)
Sets the attribute range to be forced to type string.
|
java.lang.String |
stringAttributesTipText()
Returns the tip text for this property.
|
getUseRelativePath, retrieveFile, runFileLoader, setEnvironment, setFile, setUseRelativePath, useRelativePathTipText
setRetrieval
public java.lang.String getFileExtension()
getFileExtension
in interface FileSourcedConverter
public java.lang.String getFileDescription()
getFileDescription
in interface FileSourcedConverter
public java.lang.String[] getFileExtensions()
getFileExtensions
in interface FileSourcedConverter
public java.lang.String globalInfo()
public java.util.Enumeration listOptions()
listOptions
in interface OptionHandler
public void setOptions(java.lang.String[] options) throws java.lang.Exception
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-E <enclosures> The enclosure character(s) to use for strings. Specify as a comma separated list (e.g. ",' (default: '"')
setOptions
in interface OptionHandler
options
- the list of options as an array of stringsjava.lang.Exception
- if an option is not supportedpublic java.lang.String[] getOptions()
getOptions
in interface OptionHandler
public void setNominalAttributes(java.lang.String value)
value
- the rangepublic java.lang.String getNominalAttributes()
public java.lang.String nominalAttributesTipText()
public void setStringAttributes(java.lang.String value)
value
- the rangepublic java.lang.String getStringAttributes()
public java.lang.String stringAttributesTipText()
public void setDateAttributes(java.lang.String value)
value
- the rangepublic java.lang.String getDateAttributes()
public java.lang.String dateAttributesTipText()
public void setDateFormat(java.lang.String value)
value
- the format to use.public java.lang.String getDateFormat()
public java.lang.String dateFormatTipText()
public java.lang.String enclosureCharactersTipText()
public void setEnclosureCharacters(java.lang.String enclosure)
enclosure
- the characters to use as string enclosurespublic java.lang.String getEnclosureCharacters()
public void setMissingValue(java.lang.String value)
value
- the placeholderpublic java.lang.String getMissingValue()
public java.lang.String missingValueTipText()
public void setSource(java.io.InputStream input) throws java.io.IOException
setSource
in interface Loader
setSource
in class AbstractLoader
input
- the input streamjava.io.IOException
- if an error occurspublic void setSource(java.io.File file) throws java.io.IOException
setSource
in interface Loader
setSource
in class AbstractFileLoader
file
- the source file.java.io.IOException
- if an error occurspublic Instances getStructure() throws java.io.IOException
getStructure
in interface Loader
getStructure
in class AbstractLoader
java.io.IOException
- if an error occurspublic Instances getDataSet() throws java.io.IOException
getDataSet
in interface Loader
getDataSet
in class AbstractLoader
java.io.IOException
- if there is no source or parsing failspublic Instance getNextInstance(Instances structure) throws java.io.IOException
getNextInstance
in interface Loader
getNextInstance
in class AbstractLoader
structure
- ignoredjava.io.IOException
- always. CSVLoader is unable to process a data set
incrementally.public void reset() throws java.io.IOException
reset
in interface Loader
reset
in class AbstractFileLoader
java.io.IOException
- if something goes wrongpublic java.lang.String getRevision()
getRevision
in interface RevisionHandler
public static void main(java.lang.String[] args)
args
- should contain the name of an input file.