Abstract
We searched for viral protein sequences that could be important for tissue tropism. To achieve this goal, human pathogenic viruses were classified according to the tissue they infect (e.g., pulmonary), irrespective of whether they were enveloped or non-enveloped RNA or DNA viruses. Next, we developed an amino acid sequence alignment program and identified the conserved amino acid motif, VAIVLGG, in alphaviruses. The VAIVLGG sequence is located on the structural capsid protein of the chikungunya virus, a mosquito-borne arthrogenic member of the alphaviruses. Capsid protein translocation onto the host cell membrane is a required step for virion budding. Our identified VAIVLGG consensus sequence might potentially be used for developing a pan-vaccine effective against alphaviruses. METHOD SUMMARY Viral protein sequences are fed into a battery of rolling hashes of 6-14 length, and amino acid subsequences are performed with a time complexity of O(n). The hashes are the keys in a HashMap with values of the sequence ID and index; the space complexity is O(n). A normal alignment is done on 14 length matches to discover longer matches. The upper bound on the time complexity for the alignment is O(k 2*m), where k is the number of viruses containing a matching sequence and mis the length of the longest matching sequence.
Author supplied keywords
Cite
CITATION STYLE
Mian, M., Talada, J., Klobas, A., Torres, S., Rasheed, Y., Javed, H., … Forough, R. (2019). A customized program for the identification of conserved protein sequence motifs. BioTechniques, 68(1), 45–47. https://doi.org/10.2144/btn-2019-0039
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.