Boosting over groups and its application to acronym-expansion extraction

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In many real-world classification applications, instances are generated from different 'groups'. Take webpage classification as an example, the webpages for training and testing can be naturally grouped by network domains, which often vary a lot from one to another in domain size or webpage template. The differences between 'groups' would result that the distribution of instances from different 'groups' also vary. Thus, it is not so reasonable to equally treat the instances as the independent elements during training and testing as in conventional classification algorithms. This paper addresses the classification problem where all the instances can be naturally grouped. Specifically, we give a formulation to this kind of problem and propose a simple but effective boosting approach, which is called AdaBoost.Group. The problem is demonstrated by the task of recognizing acronyms and their expansions from text, where all the instances are grouped by sentences. The experimental results show that our approach is more appropriate to this kind of problems than conventional classification approaches. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Ni, W., Huang, Y., Li, D., & Wang, Y. (2008). Boosting over groups and its application to acronym-expansion extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5139 LNAI, pp. 27–38). Springer Verlag. https://doi.org/10.1007/978-3-540-88192-6_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free