Abstract
Olfaction is a rather understudied sense compared to the other human senses. In NLP, however, there have been recent attempts to develop taxonomies and benchmarks specifically designed to capture smell-related information. In this work, we further extend this research line by presenting a supervised system for olfactory information extraction in English. We cast this problem as a token classification task and build a system that identifies smell words, smell sources and qualities. The classifier is then applied to a set of English historical corpora, covering different domains and written in a time period between the 15th and the 20th Century. A qualitative analysis of the extracted data shows that they can be used to infer interesting information about smelly items such as tea and tobacco from a diachronical perspective, supporting historical investigation with corpus-based evidence.
Cite
CITATION STYLE
Menini, S., Paccosi, T., Tekiroğlu, S. S., & Tonelli, S. (2023). Scent Mining: Extracting Olfactory Events, Smell Sources and Qualities. In EACL 2023 - 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Proceedings of LaTeCH-CLfL 2023 (pp. 135–140). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.latechclfl-1.15
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.