Using machine learning for automated assessment of misclassification of goods for fraud detection

Margarita Spichakova; Hele Mai Haav

Conference Proceedings

Using machine learning for automated assessment of misclassification of goods for fraud detection

Communications in Computer and Information Science (2020) 1243 CCIS 144-158

DOI: 10.1007/978-3-030-57672-1_12

3Citations

16Readers

Get full text

Abstract

The paper is devoted to providing automated solutions to an actual problem of misclassification of goods in cross-border trade. In this paper, we introduce a hybrid approach to Harmonized System (HS) code assessment that combines the knowledge derived from textual descriptions of products, assigned to them HS codes and taxonomy of HS codes nomenclature. We use machine learning for providing HS code’s predictions and recommendations on the basis of a model learned from the textual descriptions of the products. In order to perform an assessment of misclassification of goods we present a novel combined similarity measure based on cosine similarity of texts and semantic similarity of HS codes based on HS code taxonomy (ontology). The method is evaluated on the real open source data set of Bill of Lading Summary 2017 [1] using Gensim Python library [4].

Author supplied keywords

Cite

CITATION STYLE

APA

Spichakova, M., & Haav, H. M. (2020). Using machine learning for automated assessment of misclassification of goods for fraud detection. In Communications in Computer and Information Science (Vol. 1243 CCIS, pp. 144–158). Springer. https://doi.org/10.1007/978-3-030-57672-1_12

Using machine learning for automated assessment of misclassification of goods for fraud detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions