The paper is devoted to providing automated solutions to an actual problem of misclassification of goods in cross-border trade. In this paper, we introduce a hybrid approach to Harmonized System (HS) code assessment that combines the knowledge derived from textual descriptions of products, assigned to them HS codes and taxonomy of HS codes nomenclature. We use machine learning for providing HS code’s predictions and recommendations on the basis of a model learned from the textual descriptions of the products. In order to perform an assessment of misclassification of goods we present a novel combined similarity measure based on cosine similarity of texts and semantic similarity of HS codes based on HS code taxonomy (ontology). The method is evaluated on the real open source data set of Bill of Lading Summary 2017 [1] using Gensim Python library [4].
CITATION STYLE
Spichakova, M., & Haav, H. M. (2020). Using machine learning for automated assessment of misclassification of goods for fraud detection. In Communications in Computer and Information Science (Vol. 1243 CCIS, pp. 144–158). Springer. https://doi.org/10.1007/978-3-030-57672-1_12
Mendeley helps you to discover research relevant for your work.