Identifying e-Commerce in Enterprises by means of Text Mining and Classification Algorithms

Gianpiero Bianchi; Renato Bruni; Francesco Scalfati

Journal ArticleOPEN ACCESS

Identifying e-Commerce in Enterprises by means of Text Mining and Classification Algorithms

Mathematical Problems in Engineering (2018) 2018

DOI: 10.1155/2018/7231920

14Citations

31Readers

Abstract

Monitoring specific features of the enterprises, for example, the adoption of e-commerce, is an important and basic task for several economic activities. This type of information is usually obtained by means of surveys, which are costly due to the amount of personnel involved in the task. An automatic detection of this information would allow consistent savings. This can actually be performed by relying on computer engineering, since in general this information is publicly available on-line through the corporate websites. This work describes how to convert the detection of e-commerce into a supervised classification problem, where each record is obtained from the automatic analysis of one corporate website, and the class is the presence or the absence of e-commerce facilities. The automatic generation of similar data records requires the use of several Text Mining phases; in particular we compare six strategies based on the selection of best words and best n-grams. After this, we classify the obtained dataset by means of four classification algorithms: Support Vector Machines; Random Forest; Statistical and Logical Analysis of Data; Logistic Classifier. This turns out to be a difficult case of classification problem. However, after a careful design and set-up of the whole procedure, the results on a practical case of Italian enterprises are encouraging.

Cite

CITATION STYLE

APA

Bianchi, G., Bruni, R., & Scalfati, F. (2018). Identifying e-Commerce in Enterprises by means of Text Mining and Classification Algorithms. Mathematical Problems in Engineering, 2018. https://doi.org/10.1155/2018/7231920

Identifying e-Commerce in Enterprises by means of Text Mining and Classification Algorithms

Abstract

Cite

Register to see more suggestions