Abstract
We propose a novel framework for improving a word segmenter using information acquired from symbol grounding. We generate a term dictionary in three steps: generating a pseudo-stochastically segmented corpus, building a symbol grounding model to enumerate word candidates, and filtering them according to the grounding scores. We applied our method to game records of Japanese chess with commentaries. The experimental results show that the accuracy of a word segmenter can be improved by incorporating the generated dictionary.
Cite
CITATION STYLE
Kameko, H., Mori, S., & Tsuruoka, Y. (2015). Can symbol grounding improve low-level NLP? Word segmentation as a case study. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 2298–2303). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1277
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.