A Deep Learning Approach for Viral DNA Sequence Classification using Genetic Algorithm

13Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

DNA sequence classification is one of the major challenges in biological data processing. The identification and classification of novel viral genome sequences drastically help in reducing the dangers of a viral outbreak like COVID-19. The more accurate the classification of these viruses, the faster a vaccine can be produced to counter them. Thus, more accurate methods should be utilized to classify the viral DNA. This research proposes a hybrid deep learning model for efficient viral DNA sequence classification. A genetic algorithm (GA) was utilized for weight optimization with Convolutional Neural Networks (CNN) architecture. Furthermore, Long Short-Term Memory (LSTM) as well as Bidirectional CNN-LSTM model architectures are employed. Encoding methods are needed to transform the DNA into numeric format for the proposed model. Three different encoding methods to represent DNA sequences as input to the proposed model were experimented: k-mer, label encoding, and one hot vector encoding. Furthermore, an efficient oversampling method was applied to overcome the imbalanced dataset issues. The performance of the proposed GA optimized CNN hybrid model using label encoding achieved the highest classification accuracy of 94.88% compared with other encoding methods

Cite

CITATION STYLE

APA

El-Tohamy, A., Maghwary, H. A., & Badr, N. (2022). A Deep Learning Approach for Viral DNA Sequence Classification using Genetic Algorithm. International Journal of Advanced Computer Science and Applications, 13(8), 530–538. https://doi.org/10.14569/IJACSA.2022.0130861

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free