Application of canonical correlation analysis for identifying viral integration preferences

16Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

Abstract

Motivation: Gene therapy aims at using viral vectors for attaching helpful genetic code to target genes. Therefore, it is of great importance to develop methods that can discover significant patterns around viral integration sites. Canonical correlation analysis is an unsupervised statistical tool that is used to describe the relations between two related views of the same semantic object, which fits well for identifying such salient patterns.Results: Proposed method is demonstrated on a sequence dataset obtained from a study on HIV-1 preferred integration regions. The subsequences on the left and right sides of the integration points are given to the method as the two views, and statistically significant relations are found between sequence-driven features derived from these two views, which suggest that the viral preference must be the factor responsible for this correlation. We found that there are significant correlations at x=5 indicating a palindromic behavior surrounding the viral integration site, which complies with the previously reported results. © The Author 2012. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Gumus, E., Kursun, O., Sertbas, A., & Ustek, D. (2012). Application of canonical correlation analysis for identifying viral integration preferences. Bioinformatics, 28(5), 651–655. https://doi.org/10.1093/bioinformatics/bts027

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free