Adversarial attack on machine learning models

V. Sahaya Sakila; M. Sandeep; N. Praveen Hari Krishna

Journal Article

Adversarial attack on machine learning models

International Journal of Innovative Technology and Exploring Engineering (2019) 8(6 Special Issue 4) 431-434

DOI: 10.35940/ijitee.F1088.0486S419

0Citations

7Readers

Get full text

Abstract

Machine Learning (ML) models are applied in a variety of tasks such as network intrusion detection or malware classification. Yet, these models are vulnerable to a class of malicious inputs known as adversarial examples. These are slightly perturbed inputs that are classified incorrectly by the ML model. The mitigation of these adversarial inputs remains an open problem. As a step towards understanding adversarial examples, we show that they are not drawn from the same distribution than the original data, and can thus be detected using statistical tests. Using this knowledge, we introduce a complimentary approach to identify specific inputs that are adversarial. Specifically, we augment our ML model with an additional output, in which the model is trained to classify all adversarial inputs.

Author supplied keywords

Cite

CITATION STYLE

APA

Sahaya Sakila, V., Sandeep, M., & Praveen Hari Krishna, N. (2019). Adversarial attack on machine learning models. International Journal of Innovative Technology and Exploring Engineering, 8(6 Special Issue 4), 431–434. https://doi.org/10.35940/ijitee.F1088.0486S419

Adversarial attack on machine learning models

Abstract

Author supplied keywords

Cite

Register to see more suggestions