Recently, language acquisition with aids of multi-modal information have drawn more and more attention. However, semantic grounding of verbs has been less concerned due to their complex semantic representation. This paper proposed a novel way to combine visual information into semantic representation of Chinese verb. While introducing original representation of two constituents, which are verb frame and argument from Frame Semantic, both of them are linked with visual information for verb semantic. And a visual information based categorization for arguments is mainly discussed. For achieving it, a collection of {video, its text description} pairs is first built. After preprocessing on both sides, the correspondence between arguments of verbs and related visual features is constructed basing on SOM groups. A video describing system has also been built to generate sentences for new videos. The evaluation of the describing system shows the effectiveness of our visual semantic representation on Chinese verbs. © 2011 Springer-Verlag.
CITATION STYLE
Liu, H. P., Wang, X. J., & Zhong, Y. X. (2011). Visual information based argument categorization for semantics of Chinese verb. In Communications in Computer and Information Science (Vol. 185 CCIS, pp. 206–213). https://doi.org/10.1007/978-3-642-22309-9_25
Mendeley helps you to discover research relevant for your work.