A customized program for the identification of conserved protein sequence motifs

1Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We searched for viral protein sequences that could be important for tissue tropism. To achieve this goal, human pathogenic viruses were classified according to the tissue they infect (e.g., pulmonary), irrespective of whether they were enveloped or non-enveloped RNA or DNA viruses. Next, we developed an amino acid sequence alignment program and identified the conserved amino acid motif, VAIVLGG, in alphaviruses. The VAIVLGG sequence is located on the structural capsid protein of the chikungunya virus, a mosquito-borne arthrogenic member of the alphaviruses. Capsid protein translocation onto the host cell membrane is a required step for virion budding. Our identified VAIVLGG consensus sequence might potentially be used for developing a pan-vaccine effective against alphaviruses. METHOD SUMMARY Viral protein sequences are fed into a battery of rolling hashes of 6-14 length, and amino acid subsequences are performed with a time complexity of O(n). The hashes are the keys in a HashMap with values of the sequence ID and index; the space complexity is O(n). A normal alignment is done on 14 length matches to discover longer matches. The upper bound on the time complexity for the alignment is O(k 2*m), where k is the number of viruses containing a matching sequence and mis the length of the longest matching sequence.

Cite

CITATION STYLE

APA

Mian, M., Talada, J., Klobas, A., Torres, S., Rasheed, Y., Javed, H., … Forough, R. (2019). A customized program for the identification of conserved protein sequence motifs. BioTechniques, 68(1), 45–47. https://doi.org/10.2144/btn-2019-0039

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free