TY - JOUR
T1 - One-class document classification via Neural Networks
AU - Manevitz, Larry
AU - Yousef, Malik
N1 - Funding Information:
This work was partially supported by HIACS, the Haifa Interdisciplinary Center for Advanced Computer Science. This work forms part of the doctoral thesis of the second author who was supported by a University of Haifa fellowship during his studies, and afterwards hosted by the Neurocomputation Laboratory situated in the Caesarea Rothschild Institute for Interdisciplinary Computer Science. We thank Nathalie Japkowicz for lending us some Matlab software which was used during a revision of this paper. The first author thanks Oxford University for its hospitality during his sabbatical visit.
Funding Information:
This work was partially supported by HIACS, the Haifa University Interdisciplinary Center for Advanced Computer Science, and the Neurocomputation Laboratory located at the Caesarea Rothschild Institute for Interdisciplinary Computer Science.
PY - 2007/3
Y1 - 2007/3
N2 - Automated document retrieval and classification is of central importance in many contexts; our main motivating goal is the efficient classification and retrieval of "interests" on the internet when only positive information is available. In this paper, we show how a simple feed-forward neural network can be trained to filter documents under these conditions, and that this method seems to be superior to modified methods (modified to use only positive examples), such as Rocchio, Nearest Neighbor, Naive-Bayes, Distance-based Probability and One-Class SVM algorithms. A novel experimental finding is that retrieval is enhanced substantially in this context by carrying out a certain kind of uniform transformation ("Hadamard") of the information prior to the training of the network.
AB - Automated document retrieval and classification is of central importance in many contexts; our main motivating goal is the efficient classification and retrieval of "interests" on the internet when only positive information is available. In this paper, we show how a simple feed-forward neural network can be trained to filter documents under these conditions, and that this method seems to be superior to modified methods (modified to use only positive examples), such as Rocchio, Nearest Neighbor, Naive-Bayes, Distance-based Probability and One-Class SVM algorithms. A novel experimental finding is that retrieval is enhanced substantially in this context by carrying out a certain kind of uniform transformation ("Hadamard") of the information prior to the training of the network.
KW - Autoencoder
KW - Automated document retrieval
KW - Bottleneck neural network
KW - Classification
KW - Feed-forward neural networks
KW - Machine learning
KW - One-class classification
UR - http://www.scopus.com/inward/record.url?scp=33847410597&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2006.05.013
DO - 10.1016/j.neucom.2006.05.013
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:33847410597
SN - 0925-2312
VL - 70
SP - 1466
EP - 1481
JO - Neurocomputing
JF - Neurocomputing
IS - 7-9
ER -