OpenBrand: Open Brand Value Extraction from Product Descriptions

Kassem Sabeh; Mouna Kacimi; Johann Gamper

Conference Proceedings

OpenBrand: Open Brand Value Extraction from Product Descriptions

ECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop (2022) 161-170

DOI: 10.18653/v1/2022.ecnlp-1.19

2Citations

43Readers

Get full text

Abstract

Extracting attribute-value information from unstructured product descriptions continue to be of a vital importance in e-commerce applications. One of the most important product attributes is the brand which highly influences customers’ purchasing behaviour. Thus, it is crucial to accurately extract brand information dealing with the main challenge of discovering new brand names. Under the open world assumption, several approaches have adopted deep learning models to extract attribute-values using sequence tagging paradigm. However, they did not employ finer grained data representations such as character level embeddings which improve generalizability. In this paper, we introduce OpenBrand, a novel approach for discovering brand names. OpenBrand is a BiLSTM-CRF-Attention model with embeddings at different granularities. Such embeddings are learned using CNN and LSTM architectures to provide more accurate representations. We further propose a new dataset for brand value extraction, with a very challenging task on zero-shot extraction. We have tested our approach, through extensive experiments, and shown that it outperforms state-of-the-art models in brand name discovery.

Cite

CITATION STYLE

APA

Sabeh, K., Kacimi, M., & Gamper, J. (2022). OpenBrand: Open Brand Value Extraction from Product Descriptions. In ECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop (pp. 161–170). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.ecnlp-1.19

OpenBrand: Open Brand Value Extraction from Product Descriptions

Abstract

Cite

Register to see more suggestions