We describe a new method for identifying the sequences that signal the start of translation, and the boundaries between exons and introns (donor and acceptor sites) in human mRNA. According to the mandatory keyword, ORGANISM, and feature key, CDS, a large set of standard data for each signal site was extracted from the ASCII flat file, gbpri.seq, in the GenBank release 108.0. This was used to generate the scoring matrices, which summarize the sequence information for each signal site. The scoring matrices take into account the independent nucleotide frequencies between adjacent bases in each position within the signal site regions, and the relative weight on each nucleotide in proportion to their probabilities in the known signal sites. Using a scoring scheme that is based on the nucleotide scoring matrices, the method has great sensitivity and specificity when used to locate signals in uncharacterized human genomic DNA. These matrices are especially effective at distinguishing true and false sites. © BSRK & Springer-Verlag 2002.
CITATION STYLE
Kim, K. B., Park, K., & Eun, B. K. (2002). A method for identifying splice sites and translation start sites in human genomic sequences. Journal of Biochemistry and Molecular Biology, 35(5), 513–517. https://doi.org/10.5483/bmbrep.2002.35.5.513
Mendeley helps you to discover research relevant for your work.