Chromosomal rearrangements resulting in the creation of novel gene products, termed fusion genes, have been identified as driving events in the development of multiple types of cancer. As these gene products typically do not exist in normal cells, they represent valuable prognostic and therapeutic targets. Advances in next-generation sequencing and computational approaches have greatly improved our ability to detect and identify fusion genes. Nevertheless, these approaches require significant computational resources. Here we describe an approach which leverages cloud computing technologies to perform fusion gene detection from RNA sequencing data at any scale. We additionally highlight methods to enhance reproducibility of bioinformatics analyses which may be applied to any next-generation sequencing experiment.
Arsenijevic, V., & Davis-Dusenbery, B. N. (2016). Reproducible, scalable fusion gene detection from RNA-seq. In Methods in Molecular Biology (Vol. 1381, pp. 223–237). Humana Press Inc. https://doi.org/10.1007/978-1-4939-3204-7_13