Natural insertions within the N-terminal region of the coat protein of Maize dwarf mosaic potyvirus (MDMV) have an effect on the RNA stability.
- PubMed: 19937270
Abstract
A 13 amino acid residue insertion was found in the N-terminal region of the coat protein of several Maize dwarf mosaic virus isolates (MDMV). These insertions seem to be the result of a direct duplication event, but differ in some positions. In order to evaluate the influence of the insertion on the RNA secondary structure and stability, the RNA secondary structures and minimum free energies (MFE) of all existing MDMV coat protein sequences were estimated using three different softwares, the Vienna RNA Package, NUPACK, and UNAFold, and compared to the secondary structure and MFE of various random sequence collections preserving the nucleotide distribution of MDMV. The bioinformatic analysis showed that the insertion stabilizes the RNA structure of the coat protein gene.
Natural insertions within the N-terminal region of the coat protein of Maize dwarf mosaic potyvirus (MDMV) have an effect on the RNA stability.
of Maize dwarf mosaic potyvirus (MDMV) have an effect
on the RNA stability
Kathrin Petrik • Endre Sebestye´n • Gyo¨ngyve´r Gell •
Ervin Bala´zs
Received: 8 July 2009 / Accepted: 10 November 2009 / Published online: 24 November 2009
Springer Science+Business Media, LLC 2009
Abstract A 13 amino acid residue insertion was found in
the N-terminal region of the coat protein of several Maize
dwarf mosaic virus isolates (MDMV). These insertions seem
to be the result of a direct duplication event, but differ in
some positions. In order to evaluate the influence of the
insertion on the RNA secondary structure and stability, the
RNA secondary structures and minimum free energies
(MFE) of all existing MDMV coat protein sequences were
estimated using three different softwares, the Vienna RNA
Package, NUPACK, and UNAFold, and compared to the
secondary structure and MFE of various random sequence
collections preserving the nucleotide distribution of
MDMV. The bioinformatic analysis showed that the inser-
tion stabilizes the RNA structure of the coat protein gene.
Keywords Potyvirus Coat protein
RNA secondary structure RNA stability Insertion
Potyviruses are single-stranded positive-sense RNA viru-
ses, allowing them to escape from the innate defence
mechanisms and acquired immune surveillance of the host,
due to the inherently error-prone replication, and to rapidly
adapt to new cell types, tissues or species [1]. The N-ter-
minal region of potyvirus coat proteins is one of the most
diverse regions within their genome and is important for the
adaptation of the virus to its host [2]. Maize dwarf mosaic
potyvirus (MDMV), which infects maize, sweet corn, mil-
let, and several sorghum species, is an economically
important virus disease worldwide. The morphological
structure is composed of approximately 2,000 copies of the
coat protein, the main functions of which are the encapsi-
dation of the viral RNA, virus replication, cell-to-cell and
systemic movement, and the regulation of viral RNA
amplification [3, 4]. The coat protein core and C-terminal
regions are mainly involved in the encapsidation and cell-
to-cell movement [5]. The more flexible N-terminal region
plays a role in long distance and systemic movement [6, 7]
and also contains the DAG motif, which is responsible for
the competence of aphid transmission. Differences in the
N-terminal region are important for potyvirus discrimina-
tion [8]. The N- and C-termini of the coat proteins are
surface-located and these termini constitute the most
immunodominant regions in the virus particle. To´bia´s and
Palkovics [9] described a 13 amino acid (39 bp) residue
insertion within the N-terminal region of the MDMV coat
protein, which appears to be the result of a direct duplica-
tion event, although the duplicated region has since accu-
mulated mutations. Earlier investigations showed that these
insertions are relatively frequent, and four other MDMV
isolates with an insertion were recovered [10].
In addition to a large number of functional studies on the
RNA secondary structure of the 50- and 30-terminal regions
of various virus species [11, 12] studies were also carried
out on genome-scale ordered RNA structures (GORS),
which show significant variability between genera [1, 13]. It
was shown that GORS are absent from potyviruses, sug-
gesting that the association with nucleocapsid proteins
limits or restricts the need for this type of viral structure [1].
In the present study, a minimum free energy (MFE)
analysis of the viral RNA is described to estimate the
influence of the 39-bp insertions within the N-terminal
region of MDMV on the RNA secondary structure and
stability.
K. Petrik (&) E. Sebestye´n G. Gell E. Bala´zs
Department of Applied Genomics, Agricultural Research
Institute of the Hungarian Academy of Sciences, Brunszvik u. 2,
2462 Martonva´sa´r, Hungary
e-mail: kathrin.petrik@gmail.com
123
Virus Genes (2010) 40:135–139
DOI 10.1007/s11262-009-0425-3
ab
le
1
T
he
av
er
ag
e
M
FE
,
st
an
da
rd
de
vi
at
io
n,
m
in
im
um
an
d
m
ax
im
um
of
th
e
or
ig
in
al
,
ra
nd
om
an
d
sh
uf
fle
d
se
qu
en
ce
s
va
lu
es
pr
ed
ic
te
d
by
V
ie
nn
a,
U
N
A
Fo
ld
an
d
N
U
PA
C
K
V
ie
nn
a
R
N
A
Pa
ck
ag
e
N
U
PA
C
K
U
N
A
Fo
ld
M
D
M
V
-0
70
2
M
D
M
V
-0
70
2
M
D
M
V
-0
70
2
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
O
ri
gi
na
l
-
26
0.
95
N
A
N
A
N
A
O
ri
gi
na
l
-
20
8.
3
N
A
N
A
N
A
O
ri
gi
na
l
-
21
6.
4
N
A
N
A
N
A
R
an
do
m
-
25
3.
91
14
.3
6
-
30
4.
20
-
21
1.
20
R
an
do
m
-
19
6.
6
13
.3
-
23
9.
1
-
15
8.
4
R
an
do
m
-
20
0.
3
13
.6
-
24
2.
9
-
15
7.
7
Sh
uf
fle
d
-
25
6.
43
8.
08
-
28
3.
30
-
23
1.
57
Sh
uf
fle
d
-
19
7.
2
8.
2
-
22
8.
7
-
17
6.
1
Sh
uf
fle
d
-
20
1.
4
8.
2
-
23
6.
4
-
17
8.
6
R
an
do
m
su
rr
ou
nd
-
25
3.
65
14
.2
2
-
30
0.
60
-
20
4.
44
R
an
do
m
su
rr
ou
nd
-
19
6.
5
13
-
24
4.
9
-
15
2.
4
R
an
do
m
su
rr
ou
nd
-
20
0.
1
13
.3
-
25
0.
3
-
15
7.
8
R
an
do
m
in
se
rt
-
26
2.
81
3.
66
-
27
6.
36
-
25
3.
60
R
an
do
m
in
se
rt
-
21
1.
7
3.
2
-
22
2.
3
-
20
2.
9
R
an
do
m
in
se
rt
-
21
8.
2
3.
3
-
22
9.
1
-
20
7.
8
M
D
M
V
-0
80
1
M
D
M
V
-0
80
1
M
D
M
V
-0
80
1
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
O
ri
gi
na
l
-
27
6.
02
N
A
N
A
N
A
O
ri
gi
na
l
-
22
1.
2
N
A
N
A
N
A
O
ri
gi
na
l
-
22
1.
4
N
A
N
A
N
A
R
an
do
m
-
25
4.
13
14
.5
9
-
29
8.
49
-
20
6.
90
R
an
do
m
R
an
do
m
Sh
uf
fle
d
-
25
9.
31
8.
04
-
28
6.
02
-
23
5.
16
Sh
uf
fle
d
-
20
0.
1
8.
2
-
22
9.
3
-
17
2.
4
Sh
uf
fle
d
-
20
4.
7
8.
2
-
23
1
-
17
6.
7
R
an
do
m
su
rr
ou
nd
-
25
6.
30
14
.2
2
-
29
8.
81
-
21
0.
24
R
an
do
m
su
rr
ou
nd
-
19
9.
4
12
.9
-
24
6.
1
-
15
6.
1
R
an
do
m
su
rr
ou
nd
-
20
2.
8
13
.3
-
25
1.
8
-
16
3.
2
R
an
do
m
in
se
rt
-
27
5.
61
3.
37
-
28
8.
00
-
26
6.
90
R
an
do
m
in
se
rt
-
21
8.
6
3.
3
-
22
9.
1
-
20
9.
9
R
an
do
m
in
se
rt
-
22
2.
5
3.
3
-
23
4.
6
-
21
4.
2
M
D
M
V
-0
81
4
M
D
M
V
-0
81
4
M
D
M
V
-0
81
4
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
A
ve
ra
ge
SD
M
in
.
M
ax
.
O
ri
gi
na
l
-
27
7.
60
N
A
N
A
N
A
O
ri
gi
na
l
-
21
0.
3
N
A
N
A
N
A
O
ri
gi
na
l
-
21
4.
6
N
A
N
A
N
A
R
an
do
m
-
25
3.
91
14
.8
3
-
30
2.
20
-
20
5.
60
R
an
do
m
R
an
do
m
Sh
uf
fle
d
-
25
5.
15
8.
04
-
28
2.
74
-
23
4.
02
Sh
uf
fle
d
-
19
7.
3
7.
9
-
22
0.
9
-
17
2.
4
Sh
uf
fle
d
-
20
1.
7
8.
1
-
22
7.
5
-
17
8
R
an
do
m
su
rr
ou
nd
-
25
3.
35
14
.2
0
-
29
9.
52
-
20
6.
54
R
an
do
m
su
rr
ou
nd
-
19
6.
3
13
.1
-
24
0.
7
-
15
2.
6
R
an
do
m
su
rr
ou
nd
-
19
9.
8
13
.4
-
24
8
-
15
9.
5
R
an
do
m
in
se
rt
-
27
8.
23
3.
46
-
29
0.
30
-
26
9.
90
R
an
do
m
in
se
rt
-
21
0.
7
3.
3
-
22
2.
2
-
20
2.
2
R
an
do
m
in
se
rt
-
21
4.
7
3.
4
-
22
9.
3
-
20
5.
8
A
ve
ra
ge
SD
A
ve
ra
ge
SD
A
ve
ra
ge
SD
A
ll
M
D
M
V
is
ol
at
es
w
it
ho
ut
in
se
rt
io
n
-
25
3.
86
80
6.
45
07
A
ll
M
D
M
V
is
ol
at
es
w
it
ho
ut
in
se
rt
io
n
-
19
5.
25
66
7.
17
01
A
ll
M
D
M
V
is
ol
at
es
w
it
ho
ut
in
se
rt
io
n
-
20
0.
51
59
7.
17
01
M
D
M
V
is
ol
at
es
fr
om
H
un
ga
ry
-
25
3.
90
86
6.
59
22
M
D
M
V
is
ol
at
es
fr
om
H
un
ga
ry
-
19
3.
50
89
6.
74
45
M
D
M
V
is
ol
at
es
fr
om
H
un
ga
ry
-
19
9.
11
85
6.
74
45
M
D
M
V
is
ol
at
es
fr
om
B
ul
ga
ri
a
-
25
7.
92
40
10
.2
32
2
M
D
M
V
is
ol
at
es
fr
om
B
ul
ga
ri
a
-
20
4.
10
00
9.
81
10
M
D
M
V
is
ol
at
es
fr
om
B
ul
ga
ri
a
-
20
8.
40
00
9.
81
10
R
an
do
m
1
0
0
0
87
6
bp
-
24
2.
48
96
14
.1
28
7
R
an
do
m
1
0
0
0
87
6
bp
-
18
8.
31
11
13
.2
93
4
R
an
do
m
1
0
0
0
87
6
bp
-
19
1.
70
57
13
.0
01
2
M
D
M
V
is
ol
at
es
fr
om
Sp
ai
n
-
25
3.
57
47
3.
97
93
M
D
M
V
is
ol
at
es
fr
om
Sp
ai
n
-
19
9.
73
33
4.
17
49
M
D
M
V
is
ol
at
es
fr
om
Sp
ai
n
-
20
4.
14
67
4.
17
49
R
an
do
m
1
0
0
0
86
7
-
23
9.
64
75
13
.7
49
8
R
an
do
m
1
0
0
0
86
7
-
18
6.
30
96
12
.5
00
6
R
an
do
m
1
0
0
0
86
7
-
18
9.
66
78
12
.3
63
9
M
D
M
V
is
ol
at
es
w
it
h
in
se
rt
io
n
-
27
3.
48
20
8.
48
57
M
D
M
V
is
ol
at
es
w
it
h
in
se
rt
io
n
-
21
5.
50
60
5.
92
13
M
D
M
V
is
ol
at
es
w
it
h
in
se
rt
io
n
-
21
8.
56
00
5.
92
13
R
an
do
m
1
0
0
0
91
5
bp
-
25
3.
90
88
14
.3
61
4
R
an
do
m
1
0
0
0
91
5
bp
-
19
6.
64
70
13
.5
71
8
R
an
do
m
1
0
0
0
91
5
bp
-
20
0.
27
24
13
.3
01
0
T
he
av
er
ag
e
va
lu
es
,
th
e
st
an
da
rd
de
vi
at
io
n,
m
in
im
um
an
d
m
ax
im
um
va
lu
es
w
er
e
ba
se
d
on
th
e
or
ig
in
al
se
qu
en
ce
or
a
gi
ve
n
se
qu
en
ce
da
ta
se
t,
co
nt
ai
ni
ng
a
1,
00
0
se
qu
en
ce
s
(s
ee
te
xt
)
N
A
:
no
t
av
ai
la
bl
e
Virus Genes (2010) 40:135–139 137
123
inspected by comparing the MFE of the RNA of 93
MDMV coat protein sequences to various types of random
and shuffled sequences. The dataset used in the compari-
sons contained a collection of 1,000 random sequences
with the same trinucleotide distribution as the MDMV coat
protein sequence RNA. Sequences were generated for three
different lengths: random1000867 bp for Spanish MDMV
isolates (AJ416633-AJ416645, AJ416648) missing the first
three amino acids, random1000915 bp for MDMV isolates
with an insertion (AJ563725, FM883164, FM883177,
FM883200, and DQ973169) and random1000876 bp for all
other MDMV isolates (FM883165-FM883176, FM883178-
FM883199, FM883201-FM883228, AJ563726, A34974,
AF395135, AJ001691, AJ542536, AJ563724, AM490848,
AM490849, AM110758, AM491607, MDU07216, and
NC003377). The average MFE values and the standard
deviation of a given random sequence dataset, containing
1,000 sequences, were calculated and compared with the
MFE values of the respective MDMV isolates.
The comparison of the random sequence datasets with the
MDMV sequence MFE values generated by the Vienna RNA
Package showed that the MFE of MDMV coat protein RNA
without the 39-bp insertion was 5.4% lower on average than
the MFE of the corresponding random sequence collection
(random1000867 bp and random1000876 bp). Isolates with
insertions had 7.7% lower MFE values on average than the
appropriate random sequences (random1000915 bp), while
the MFE average of all MDMV sequences with an insertion
was 7.2 % lower than the MFE average of all MDMV
sequences without an insertion (Fig. 1a). In the following
step, three sequences (FM883164: Mv0801, FM883177:
Mv0814, and FM883200: Mv0702) containing the insertion
were compared to four different types of shuffled or random
sequences (Fig. 1b). Every sequence type was generated for
a 1,000 variations. The first sequence type was identical to
the previously mentioned 915 bp random sequences. The
second sequence type contained the original bases of a given
sequence but in shuffled positions. The third sequence type
consisted of sequences generated with the original insertion
sequence and a random surrounding sequence with the same
trinucleotide distribution as the MDMV isolate. The fourth
sequence type consisted of a 39-bp length random insert
sequences surrounded by the original sequence of the isolate
(Fig. 1c).
The results of UNAFold and NUPACK showed similar
trends, although the exact minimum free energy values are
varying, due to the differences in the algorithms. The
average MFE, the standard deviation, and minimum and
maximum values predicted by the various programs of
these random, shuffled and original sequences are descri-
bed in Table 1.
After comparing the MFE values of the MDMV
sequences and the different generated sequences, a one-way
ANOVA test was carried out on the results of the Vienna
RNA Package to determine the statistically significant dif-
ferences as follows. The MFE values of the above-men-
tioned insert-containing sequences and the average MFE
values of the 915-bp length random sequences, shuffled
isolates, 876 bp original insert containing random sequen-
ces and original isolates with random inserts were selected
(Table 1). The result of the test (F = 8.024722019,
Fcrit = 3.478049691, LSD = 9.327616604) showed that
most of the differences between the groups were statisti-
cally significant at the 95% level. The only exception was
the difference between the original sequences and the
sequences with the sequence type 4 (random insertion and
original surrounding sequence). Significant differences
between the isolate and the various types of shuffled or
random sequences are marked with an asterisk (*).
Insertions within the N-terminal region of the coat protein
are a frequent feature of potyviruses and were associated
with virus adaptation to outside influences and immuno-
dominant features. In general, it can be assumed that pot-
yviruses have a mutational hotspot in the N-terminal region
of the coat protein resulting in insertions and deletions. In
this study, the influences of the insertion of different MDMV
isolates on the basis of the different MFE datasets were
investigated. The positive effect of the insertion on the RNA
stability of MDMV isolates was shown. However, this sta-
bilizing effect does not depend on the composition of the
sequence, but only on the presence of it. The increased RNA
stability could be a further reason for the location specific
integration of short RNA segments in the MDMV genome.
Since the RNA secondary structure is known to play a fun-
damental role in RNA viral replication [1], the decrease in
the MFE values of the RNA sequence with the insertions
raises also the possibility that the insertion may have an
influence of the virus replication. This is the first bioinfor-
matic report on the possibly positive influence of an insertion
within the RNA sequence of the coat protein of MDMV.
Acknowledgments This study was supported by grants from the
German Academic Exchange Service (DAAD), the Hungarian Sci-
entific Research Fund (OTKA NI 61023) and the EU-FP7-REGPOT
2007-1 (AGRISAFE 203288) project. The authors are grateful to
B. Harasztos for the English revision of the manuscript.
References
1. P. Simmonds, A. Tuplin, D.J. Evans, RNA 10, 1337–1351
(2004). doi:10.1261/rna.7640104
2. Z. Ullah, B. Chai, S. Hammar, B. Raccah, A. Gal-On, R. Grumat,
Physiol. Mol. Plant Pathol. 63, 129–139 (2003). doi:10.1016/
j.pmpp.2003.11.001
138 Virus Genes (2010) 40:135–139
123
4. S. Urcuqui-Inchima, A.L. Haenni, F. Bernardi, Virus Res. 74,
157–175 (2001). doi:10.1016/S0168-1702(01)00220-9
5. D.D. Shukla, C.W. Ward, Adv. Virus Res. 36, 273–314
(1989)
6. C.W. Ward, D.D. Shukla, Intervirology 32, 269–296 (1991)
7. V.V. Dolja, R. Haldeman, N.L. Robertson, W.G. Dougherty,
J.C. Carrington, EMBO J. 13, 1482–1491 (1994)
8. Y. Zhong, A. Guo, C. Li, B. Zhuang, M. Lai, C. Wei, J. Luo, Y. Li,
Virus Genes 30, 75–83 (2005). doi:10.1007/s11262-004-4584-y
9. I. To´bia´s, L. Palkovics, J. Phytopathol. 152, 445–447 (2004). doi:
10.1111/j.1439-0434.2004.00863.x
10. G. Gell, K. Petrik, E. Bala´zs, Virus Genes, under submission
11. K. Thivierge, V. Nicaise, P.J. Dufresne, S. Cotton, J.-F. Laliberte´,
O. Le Gall, M.G. Fortin, Plant Physiol. 138, 1822–1827 (2005).
doi:10.1104/pp.105.064105
12. J.C. McCormack, X. Yuan, Y.G. Yingling, R.E. Zamora, B.A.
Shapiro, A.E. Simon, J. Virol. 82, 8706–8720 (2008). doi:
10.1128/JVI.00416-08
13. M. Davis, S.M. Sagan, J.P. Pezacki, D.J. Evans, P. Simmonds,
J. Virol. 82, 11824–11836 (2008). doi:10.1128/JVI.01078-08
14. L. Hofacker, W. Fontana, P.F. Stadler, L.S. Bonhoeffer,
M. Tacker, P. Schuster, Monatshefte fu¨r Chemie/Chemical
Monthly 125, 167–188 (1994)
15. N.R. Markham, M. Zuker, Methods Mol. Biol. 453, 3–31 (2008)
16. R.M. Dirks, N.A. Pierce, J. Comput. Chem. 25, 1295–1304
(2004). doi:10.1002/jcc.20057
17. M. Zuker, P. Stiegler Nucl. Acids Res. 9, 133–148 (1981)
18. S.J. Schroeder, J. Virol. 83, 6326–6334 (2008). doi:10.1128/
JVI.00251-09
19. P. Rice, I. Longden, A. Bleasby, EMBOSS: Trends Genetics 16,
276–277 (2000). doi:10.1016/S0168-9525(00)02024-2
Virus Genes (2010) 40:135–139 139
123
Sign up today - FREE
Mendeley saves you time finding and organizing research. Learn more
- All your research in one place
- Add and import papers easily
- Access it anywhere, anytime


