Who wrote this book? a challenge for e-commerce

1Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.

Abstract

Modern e-commerce catalogs contain millions of references, associated with textual and visual information that is of paramount importance for the products to be found via search or browsing. Of particular significance is the book category, where the author name(s) field poses a significant challenge. Indeed, books written by a given author might be listed with different authors' names due to abbreviations, spelling variants and mistakes, among others. To solve this problem at scale, we design a composite system involving open data sources for books, as well as deep learning components, such as approximate match with Siamese networks and name correction with sequence-tosequence networks. We evaluate this approach on product data from the e-commerce website Rakuten France, and find that the top proposal of the system is the normalized author name with 72% accuracy.

Cite

CITATION STYLE

APA

Dumont, B., Maggio, S., Said, G. S., & Au, Q. T. (2019). Who wrote this book? a challenge for e-commerce. In W-NUT@EMNLP 2019 - 5th Workshop on Noisy User-Generated Text, Proceedings (pp. 121–125). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d19-5516

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free