Using non-lexical features to identify effective indexing terms for biomedical illustrations

Matthew Simpson; Dina Demner-Fushman; Charles Sneiderman; Sameer K. Antani; George R. Thoma

Conference Proceedings

Using non-lexical features to identify effective indexing terms for biomedical illustrations

EACL 2009 - 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings (2009) 737-744

DOI: 10.3115/1609067.1609149

2Citations

80Readers

Get full text

Abstract

Automatic image annotation is an attractive approach for enabling convenient access to images found in a variety of documents. Since image captions and relevant discussions found in the text can be useful for summarizing the content of images, it is also possible that this text can be used to generate salient indexing terms. Unfortunately, this problem is generally domain-specific because indexing terms that are useful in one domain can be ineffective in others. Thus, we present a supervised machine learning approach to image annotation utilizing non-lexical features' extracted from image-related text to select useful terms. We apply this approach to several subdomains of the biomedical sciences and show that we are able to reduce the number of ineffective indexing terms. © 2009 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Simpson, M., Demner-Fushman, D., Sneiderman, C., Antani, S. K., & Thoma, G. R. (2009). Using non-lexical features to identify effective indexing terms for biomedical illustrations. In EACL 2009 - 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings (pp. 737–744). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1609067.1609149

Using non-lexical features to identify effective indexing terms for biomedical illustrations

Abstract

Cite

Register to see more suggestions