Boosting over groups and its application to acronym-expansion extraction

Weijian Ni; Yalou Huang; Dong Li; Yang Wang

Conference Proceedings

Boosting over groups and its application to acronym-expansion extraction

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5139 LNAI 27-38

DOI: 10.1007/978-3-540-88192-6_5

1Citations

2Readers

Get full text

Abstract

In many real-world classification applications, instances are generated from different 'groups'. Take webpage classification as an example, the webpages for training and testing can be naturally grouped by network domains, which often vary a lot from one to another in domain size or webpage template. The differences between 'groups' would result that the distribution of instances from different 'groups' also vary. Thus, it is not so reasonable to equally treat the instances as the independent elements during training and testing as in conventional classification algorithms. This paper addresses the classification problem where all the instances can be naturally grouped. Specifically, we give a formulation to this kind of problem and propose a simple but effective boosting approach, which is called AdaBoost.Group. The problem is demonstrated by the task of recognizing acronyms and their expansions from text, where all the instances are grouped by sentences. The experimental results show that our approach is more appropriate to this kind of problems than conventional classification approaches. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Ni, W., Huang, Y., Li, D., & Wang, Y. (2008). Boosting over groups and its application to acronym-expansion extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5139 LNAI, pp. 27–38). Springer Verlag. https://doi.org/10.1007/978-3-540-88192-6_5

Boosting over groups and its application to acronym-expansion extraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions