Class that represents a set of email messages
94 | usefull to be returned by the TFReader class
95 |
96 | encapsulates a list of Email Messages to ease the
97 | processing of muliple messages to generate usefull structures
98 | like HashMaps of the occurrency of all the tokens in a message set
EmailDataset.getTotalTokenOcurr()
117 |
118 |
119 | This method returns a pair of HashMaps that map the tokens
120 | of the tagged messages to the total number of occurrencies
121 | in all the messages of the dataset
122 |
123 | TOKEN -> TOTAL OCURR
124 |
125 | First - spam token ocurrencies
126 | Second - ham token ocurrencies
EmailDataset.split()
132 |
133 |
134 | Method used to splin the dataset
135 | in a random fashion returning a pair of
136 | datasets each one with the same size
137 |
138 | (Math.random() <0.5)
85 | This API (Application Programming Interface) document has pages corresponding to the items in the navigation bar, described as follows.
86 | Package
87 |
88 |
89 |
90 | Each package has a page that contains a list of its classes and interfaces, with a summary for each. This page can contain four categories:
91 |
Interfaces (italic)
Classes
Enums
Exceptions
Errors
Annotation Types
92 |
93 |
94 | Class/Interface
95 |
96 |
97 |
98 | Each class, interface, nested class and nested interface has its own separate page. Each of these pages has three sections consisting of a class/interface description, summary tables, and detailed member descriptions:
99 |
Class inheritance diagram
Direct Subclasses
All Known Subinterfaces
All Known Implementing Classes
Class/interface declaration
Class/interface description
100 |
101 |
Nested Class Summary
Field Summary
Constructor Summary
Method Summary
102 |
103 |
Field Detail
Constructor Detail
Method Detail
104 | Each summary entry contains the first sentence from the detailed description for that item. The summary entries are alphabetical, while the detailed descriptions are in the order they appear in the source code. This preserves the logical groupings established by the programmer.
105 |
106 |
107 | Annotation Type
108 |
109 |
110 |
111 | Each annotation type has its own separate page with the following sections:
112 |
Annotation Type declaration
Annotation Type description
Required Element Summary
Optional Element Summary
Element Detail
113 |
114 |
115 |
116 | Enum
117 |
118 |
119 |
120 | Each enum has its own separate page with the following sections:
121 |
Enum declaration
Enum description
Enum Constant Summary
Enum Constant Detail
122 |
123 |
124 | Use
125 |
126 | Each documented package, class and interface has its own Use page. This page describes what packages, classes, methods, constructors and fields use any part of the given class or package. Given a class or interface A, its Use page includes subclasses of A, fields declared as A, methods that return A, and methods and constructors with parameters of type A. You can access this page by first going to the package, class or interface, then clicking on the "Use" link in the navigation bar.
127 |
128 | Tree (Class Hierarchy)
129 |
130 | There is a Class Hierarchy page for all packages, plus a hierarchy for each package. Each hierarchy page contains a list of classes and a list of interfaces. The classes are organized by inheritance structure starting with java.lang.Object. The interfaces do not inherit from java.lang.Object.
131 |
When viewing the Overview page, clicking on "Tree" displays the hierarchy for all packages.
When viewing a particular package, class or interface page, clicking "Tree" displays the hierarchy for only that package.
132 |
133 |
134 | Deprecated API
135 |
136 | The Deprecated API page lists all of the API that have been deprecated. A deprecated API is not recommended for use, generally due to improvements, and a replacement API is usually given. Deprecated APIs may be removed in future implementations.
137 |
138 | Index
139 |
140 | The Index contains an alphabetic list of all classes, interfaces, constructors, methods, and fields.
141 |
142 | Prev/Next
143 | These links take you to the next or previous class, interface, package, or related page.
144 | Frames/No Frames
145 | These links show and hide the HTML frames. All pages are available with or without frames.
146 |
147 |
148 | Serialized Form
149 | Each serializable or externalizable class has a description of its serialization fields and methods. This information is of interest to re-implementors, not to developers using the API. While there is no link in the navigation bar, you can get to this information by going to any serialized class and clicking "Serialized Form" in the "See also" section of the class description.
150 |
151 |
152 | Constant Field Values
153 | The Constant Field Values page lists the static final fields and their values.
154 |
155 |
156 |
157 | This help file applies to API documentation generated using the standard doclet.
158 |
159 |
160 |
read()
145 |
146 |
147 | Method used to read the tf file supplied in the constructor
148 | and return an EmailDataset object containing all the
149 | EmailMessages of the file
198 | public EmailDatasetread()
199 | throws java.io.FileNotFoundException
200 |
201 |
Method used to read the tf file supplied in the constructor
202 | and return an EmailDataset object containing all the
203 | EmailMessages of the file
204 |
205 |
206 |
207 |
Returns:
messages EmailDataset - the messages loaded into a dataset
208 |