Deep learning for classification of bi-lingual ads in online classifieds

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Classification of ads in online classifieds is a domain well suited for applying deep learning for understanding images and text. Since many different items are in the same category, classification based on images alone is hard. Adding the title of the ad increases classification accuracy significantly. This paper describes a system developed for an online classifieds site in Thailand (kaidee.com), where titles are often a mixture of Thai and English. To achieve machine understanding of bi-lingual text, a character-level neural embedding was used. Both 1D convolution and bidirectional long short-term memory (BLSTM) were examined, with convolution being both more accurate and quicker to train. The Inception v3 model was used to extract visual features from images. Visual features and character embeddings are concatenated and fed into a classifier. The results show that this approach is better than classifying based on either image or text alone. A focus of this paper is the simplicity of the solution, yielding an accuracy of 86.0% applied on real-world data.

Cite

CITATION STYLE

APA

Tidemann, A. (2017). Deep learning for classification of bi-lingual ads in online classifieds. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10630 LNAI, pp. 399–404). Springer Verlag. https://doi.org/10.1007/978-3-319-71078-5_33

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free