Typically, visually-grounded language learning systems only accept feature data about objects in the environment that are explicitly mentioned, whether through annotation labels or direct reference through natural language. We show that when objects are described ambiguously using natural language, a system can use a combination of the pragmatic principles of Contrast and Conventionality, and multiple-instance learning to learn from ambiguous examples in an online fashion. Applying child language learning strategies to visual learning enables more effective learning in real-time environments, which can lead to enhanced teaching interactions with robots or grounded systems in multi-object environments.
CITATION STYLE
Perera, I., & Allen, J. F. (2015). Quantity, contrast, and convention in cross-situated language comprehension. In CoNLL 2015 - 19th Conference on Computational Natural Language Learning, Proceedings (pp. 226–236). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/k15-1023
Mendeley helps you to discover research relevant for your work.