Computing Publications

Publications Home » Using dragpushing to refine centr...

Using dragpushing to refine centroid text classifiers.

Songbo Tan, Xueqi Cheng, Bin Wang, Hongbo Xu, Moustafa Ghanem, Yike Guo

Conference or Workshop Paper
SIGIR '05: 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
August, 2005
ISBN 1-59593-034-5
DOI 10.1145/1076034.1076174

We present a novel algorithm, DragPushing, for automatic text classification. Using a training data set, the algorithm first calculates the prototype vectors, or centroids, for each of the available document classes. Using misclassified examples, it then iteratively refines these centroids; by dragging the centroid of a correct class towards a misclassified example and in the same time pushing the centroid of an incorrect class away from the misclassified example. The algorithm is simple to implement and is computationally very efficient. Evaluation experiments conducted on two benchmark collections show that its classification accuracy is comparable to that of more complex methods, such as support vector machines (SVM).

BibTEX file for the publication built & maintained by Ashok Argent-Katwala.