Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions

7Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

As the fourth most populous country in the world, Indonesia must increase the annual rice production rate to achieve national food security by 2050. One possible solution comes from the nanoscopic level: a genetic variant called Single Nucleotide Polymorphism (SNP), which can express significant yield-associated genes. The prior benchmark of this study utilized a statistical genetics model where no SNP position information and attention mechanism were involved. Hence, we developed a novel deep polygenic neural network, named the NucleoNet model, to address these obstacles. The NucleoNets were constructed with the combination of prominent components that include positional SNP encoding, the context vector, wide models, Elastic Net, and Shannon’s entropy loss. This polygenic modeling obtained up to 2.779 of Mean Squared Error (MSE) with 47.156% of Symmetric Mean Absolute Percentage Error (SMAPE), while revealing 15 new important SNPs. Furthermore, the NucleoNets reduced the MSE score up to 32.28% compared to the Ordinary Least Squares (OLS) model. Through the ablation study, we learned that the combination of Xavier distribution for weights initialization and Normal distribution for biases initialization sparked more various important SNPs throughout 12 chromosomes. Our findings confirmed that the NucleoNet model was successfully outperformed the OLS model and identified important SNPs to Indonesian rice yields.

References Powered by Scopus

A Mathematical Theory of Communication

37113Citations
N/AReaders
Get full text

A Mathematical Theory of Communication

21166Citations
N/AReaders
Get full text

Regularization and variable selection via the elastic net

13098Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Transformer Architecture and Attention Mechanisms in Genome Data Analysis: A Comprehensive Review

45Citations
N/AReaders
Get full text

Machine Learning Approach for Single Nucleotide Polymorphism Selection in Genetic Testing Results

3Citations
N/AReaders
Get full text

Identification of rice plants via DNA barcoding for securing future food availability

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Dominic, N., Cenggoro, T. W., Budiarto, A., & Pardamean, B. (2022). Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions. Scientific Reports, 12(1). https://doi.org/10.1038/s41598-022-16075-9

Readers over time

‘22‘23‘24‘25036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

55%

Researcher 4

36%

Lecturer / Post doc 1

9%

Readers' Discipline

Tooltip

Computer Science 5

56%

Biochemistry, Genetics and Molecular Bi... 2

22%

Nursing and Health Professions 1

11%

Engineering 1

11%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 15

Save time finding and organizing research with Mendeley

Sign up for free
0