Abstract
This paper describes the Twitter lexical normalization system submitted by IHS R&D Belarus team for the ACL 2015 workshop on noisy user-generated text. The proposed system consists of two components: a CRFbased approach to identify possible normalization candidates, and a post-processing step in an attempt to normalize words that do not have normalization variants in the lexicon. Evaluation on the test data set showed that our unconstrained system achieved the F-measure of 0.8272 (rank 1 out of 5 submissions for the unconstrained mode, rank 2 out of all 11 submissions).
Cite
CITATION STYLE
Supranovich, D., & Patsepnia, V. (2015). IHS_RD: Lexical Normalization for English Tweets. In ACL-IJCNLP 2015 - Workshop on Noisy User-Generated Text, WNUT 2015 - Proceedings of the Workshop (pp. 78–81). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-4311
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.