Motivation: Gene fusions resulting from chromosomal aberrations are an important cause of cancer. The complexity of genomic changes in certain cancer types has hampered the identification of gene fusions by molecular cytogenetic methods, especially in carcinomas. This is changing with the advent of next-generation sequencing, which is detecting a substantial number of new fusion transcripts in individual cancer genomes. However, this poses the challenge of identifying those fusions with greater oncogenic potential amid a background of 'passenger' fusion sequences. Results: In the present work, we have used some recently identified genomic hallmarks of oncogenic fusion genes to develop a pipeline for the classification of fusion sequences, namely, Oncofuse. The pipeline predicts the oncogenic potential of novel fusion genes, calculating the probability that a fusion sequence behaves as 'driver' of the oncogenic process based on features present in known oncogenic fusions. Cross-validation and extensive validation tests on independent datasets suggest a robust behavior with good precision and recall rates. We believe that Oncofuse could become a useful tool to guide experimental validation studies of novel fusion sequences found during next-generation sequencing analysis of cancer transcriptomes. Availability and implementation: Oncofuse is a naive Bayes Network Classifier trained and tested using Weka machine learning package. The pipeline is executed by running a Java/Groovy script, available for download at www.unav.es/genetica/oncofuse.html. Contact: fnovo@unav.es Supplementary information: Supplementary data are available at Bioinformatics online. © The Author 2013. Published by Oxford University Press.
CITATION STYLE
Shugay, M., De Mendíbil, I. O., Vizmanos, J. L., & Novo, F. J. (2013). Oncofuse: A computational framework for the prediction of the oncogenic potential of gene fusions. Bioinformatics, 29(20), 2539–2546. https://doi.org/10.1093/bioinformatics/btt445
Mendeley helps you to discover research relevant for your work.