Extension of the Rocchio classification method to multi-modal categorization of documents in social media

Amin Mantrach; Jean Michel Renders

Conference ProceedingsOPEN ACCESS

Extension of the Rocchio classification method to multi-modal categorization of documents in social media

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7523 LNAI(PART 1) 130-142

DOI: 10.1007/978-3-642-33460-3_14

0Citations

3Readers

Abstract

Most of the approaches in multi-view categorization use early fusion, late fusion or co-training strategies. We propose here a novel classification method that is able to efficiently capture the interactions across the different modes. This method is a multi-modal extension of the Rocchio classification algorithm - very popular in the Information Retrieval community. The extension consists of simultaneously maintaining different "centroid" representations for each class, in particular "cross-media" centroids that correspond to pairs of modes. To classify new data points, different scores are derived from similarity measures between the new data point and these different centroids; a global classification score is finally obtained by suitably aggregating the individual scores. This method outperforms the multi-view logistic regression approach (using either the early fusion or the late fusion strategies) on a social media corpus - namely the ENRON email collection - on two very different categorization tasks (folder classification and recipient prediction). © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Mantrach, A., & Renders, J. M. (2012). Extension of the Rocchio classification method to multi-modal categorization of documents in social media. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7523 LNAI, pp. 130–142). https://doi.org/10.1007/978-3-642-33460-3_14

Extension of the Rocchio classification method to multi-modal categorization of documents in social media

Abstract

Cite

Register to see more suggestions