Detection of electricity theft behavior based on improved synthetic minority oversampling technique and random forest classifier

56Citations
Citations of this article
53Readers
Mendeley users who have this article in their library.

Abstract

Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users' side. To improve the auditing efficiency of grid enterprises, a new electricity theft detection method based on improved synthetic minority oversampling technique (SMOTE) and improve random forest (RF) method is proposed in this paper. The data of normal and electricity theft users were classified as positive data (PD) and negative data (ND), respectively. In practice, the number of ND was far less than PD, which made the dataset composed of these two types of data become unbalanced. An improved SOMTE based on K-means clustering algorithm (K-SMOTE) was firstly presented to balance the dataset. The cluster center of ND was determined by K-means method. Then, the ND were interpolated by SMOTE on the basis of the cluster center to balance the entire data. Finally, the RF classifier was trained with the balanced dataset, and the optimal number of decision trees in RF was decided according to the convergence of out-of-bag data error (OOB error). Electricity theft behaviors on the user side were detected by the trained RF classifier.

References Powered by Scopus

Random forests

94856Citations
29772Readers

This article is free to access.

SMOTE: Synthetic minority over-sampling technique

22417Citations
10911Readers
19042Citations
584Readers

This article is free to access.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Qu, Z., Li, H., Wang, Y., Zhang, J., Abu-Siada, A., & Yao, Y. (2020). Detection of electricity theft behavior based on improved synthetic minority oversampling technique and random forest classifier. Energies, 13(8). https://doi.org/10.3390/en13082039

Readers over time

‘20‘21‘22‘23‘240481216

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 14

64%

Lecturer / Post doc 7

32%

Professor / Associate Prof. 1

5%

Readers' Discipline

Tooltip

Computer Science 9

45%

Engineering 9

45%

Pharmacology, Toxicology and Pharmaceut... 1

5%

Energy 1

5%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 5

Save time finding and organizing research with Mendeley

Sign up for free
0