|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectweka.core.converters.AbstractLoader
weka.core.converters.TextDirectoryLoader
public class TextDirectoryLoader
Loads all text files in a directory and uses the subdirectory names as class labels. The content of the text files will be stored in a String attribute, the filename can be stored as well.
Valid options are:-D Enables debug output. (default: off)
-F Stores the filename in an additional attribute. (default: off)
-dir <directory> The directory to work on. (default: current directory)Based on code from the TextDirectoryToArff tool:
Loader
,
Serialized FormField Summary |
---|
Fields inherited from interface weka.core.converters.Loader |
---|
BATCH, INCREMENTAL, NONE |
Constructor Summary | |
---|---|
TextDirectoryLoader()
default constructor |
Method Summary | |
---|---|
java.lang.String |
charSetTipText()
the tip text for this property |
java.lang.String |
debugTipText()
the tip text for this property |
java.lang.String |
getCharSet()
Get the character set to use when reading text files. |
Instances |
getDataSet()
Return the full data set. |
boolean |
getDebug()
Gets whether additional debug information is printed. |
java.io.File |
getDirectory()
get the Dir specified as the source |
java.lang.String |
getFileDescription()
Returns a description of the file type, actually it's directories. |
Instance |
getNextInstance(Instances structure)
TextDirectoryLoader is unable to process a data set incrementally. |
java.lang.String[] |
getOptions()
Gets the setting |
boolean |
getOutputFilename()
Gets whether the filename will be stored as an extra attribute. |
java.lang.String |
getRevision()
Returns the revision string. |
Instances |
getStructure()
Determines and returns (if possible) the structure (internally the header) of the data set as an empty set of instances. |
java.lang.String |
globalInfo()
Returns a string describing this loader |
java.util.Enumeration |
listOptions()
Lists the available options |
static void |
main(java.lang.String[] args)
Main method. |
java.lang.String |
outputFilenameTipText()
the tip text for this property |
void |
reset()
Resets the loader ready to read a new data set |
void |
setCharSet(java.lang.String charSet)
Set the character set to use when reading text files (an empty string indicates that the default character set will be used). |
void |
setDebug(boolean value)
Sets whether to print some debug information. |
void |
setDirectory(java.io.File dir)
sets the source directory |
void |
setOptions(java.lang.String[] options)
Parses a given list of options. |
void |
setOutputFilename(boolean value)
Sets whether the filename will be stored as an extra attribute. |
void |
setSource(java.io.File dir)
Resets the Loader object and sets the source of the data set to be the supplied File object. |
Methods inherited from class weka.core.converters.AbstractLoader |
---|
setRetrieval, setSource |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public TextDirectoryLoader()
Method Detail |
---|
public java.lang.String globalInfo()
public java.util.Enumeration listOptions()
listOptions
in interface OptionHandler
public void setOptions(java.lang.String[] options) throws java.lang.Exception
-D Enables debug output. (default: off)
-F Stores the filename in an additional attribute. (default: off)
-dir <directory> The directory to work on. (default: current directory)
setOptions
in interface OptionHandler
options
- the options
java.lang.Exception
- if options cannot be setpublic java.lang.String[] getOptions()
getOptions
in interface OptionHandler
public java.lang.String charSetTipText()
public void setCharSet(java.lang.String charSet)
charSet
- the character set to use.public java.lang.String getCharSet()
public void setDebug(boolean value)
value
- if true additional debug information will be printed.public boolean getDebug()
public java.lang.String debugTipText()
public void setOutputFilename(boolean value)
value
- if true the filename will be stored in an extra
attributepublic boolean getOutputFilename()
public java.lang.String outputFilenameTipText()
public java.lang.String getFileDescription()
public java.io.File getDirectory()
public void setDirectory(java.io.File dir) throws java.io.IOException
dir
- the source directory
java.io.IOException
- if an error occurspublic void reset()
reset
in interface Loader
reset
in class AbstractLoader
public void setSource(java.io.File dir) throws java.io.IOException
setSource
in interface Loader
setSource
in class AbstractLoader
dir
- the source directory.
java.io.IOException
- if an error occurspublic Instances getStructure() throws java.io.IOException
getStructure
in interface Loader
getStructure
in class AbstractLoader
java.io.IOException
- if an error occurspublic Instances getDataSet() throws java.io.IOException
getDataSet
in interface Loader
getDataSet
in class AbstractLoader
java.io.IOException
- if there is no source or parsing failspublic Instance getNextInstance(Instances structure) throws java.io.IOException
getNextInstance
in interface Loader
getNextInstance
in class AbstractLoader
structure
- ignored
java.io.IOException
- always. TextDirectoryLoader is unable to process a data
set incrementally.public java.lang.String getRevision()
getRevision
in interface RevisionHandler
public static void main(java.lang.String[] args)
args
- should contain the name of an input file.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |