Comparing Outlier Detection Methods: An Application on Indonesian Air Quality Data

  • Fitrianto A
  • Kholifatunnisa A
  • Kurnia A
N/ACitations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

There are many methods for detecting outliers, but only a few methods consider data distribution. This research compares outlier detection method on univariate data with a skewed distribution. Outlier detection methods used in this research are Tukey's boxplot, adjusted boxplot, sequential fences, and adjusted sequential fences. It identifies areas of concern due to poor air quality during the Implementation of Micro-Community Activity Restrictions. The study used Indonesian air quality index data.The adjusted boxplot method performs best based on the number of outliers detected, error rate, accuracy, precision, specificity, sensitivity, and robustness. Adjusted boxplot and adjusted sequential fences can detect tails that contain outliers accurately because the skewness coefficient makes them more robust. Meanwhile, Tukey's boxplot and sequential fences are poor methods since they couldn’t detect correctly true outliers. Based on the results, adjusted boxplot is the best method. Then, areas that need attention due to poor air quality include South Sumatera, South Sulawesi, West Java, Riau, North Sumatera, Jambi, Jakarta, and East Java.

Cite

CITATION STYLE

APA

Fitrianto, A., Kholifatunnisa, A., & Kurnia, A. (2024). Comparing Outlier Detection Methods: An Application on Indonesian Air Quality Data. CAUCHY: Jurnal Matematika Murni Dan Aplikasi, 9(2), 341–351. https://doi.org/10.18860/ca.v9i2.29434

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free