Towards heterogeneous network alignment: Design and implementation of a large-scale data processing framework

Marianna Milano; Pierangelo Veltri; Mario Cannataro; Pietro H. Guzzi

Conference ProceedingsOPEN ACCESS

Towards heterogeneous network alignment: Design and implementation of a large-scale data processing framework

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11339 LNCS 692-703

DOI: 10.1007/978-3-030-10549-5_54

2Citations

5Readers

Abstract

The importance of the use of networks to model and analyse biological data and the interplay of bio-molecules is widely recognised. Consequently, many algorithms for the analysis and the comparison of networks (such as alignment algorithms) have been developed in the past. Recently, many different approaches tried to integrate into a single model the interplay of different molecules, such as genes, transcription factors and microRNAs. A possible formalism to model such scenario comes from node coloured networks (or heterogeneous networks) implemented as node/ edge-coloured graphs. Consequently, the need for the introduction of alignment algorithms able to analyse heterogeneous networks arises. To the best of our knowledge, all the existing algorithms are not able to mine heterogeneous networks. We propose a two-step alignment strategy that receives as input two heterogeneous networks (node-coloured graphs) and a similarity function among nodes of two networks extending the previous formulations. We first build a single alignment graph. Then we mine this graph extracting relevant subgraphs. Despite this simple approach, the analysis of such networks relies on graph and subgraph isomorphism and the size of the data is still growing. Therefore the use of high-performance data analytics framework is needed. We here present HetNetAligner a framework built on top of Apache Spark. We also implemented our algorithm, and we tested it on some selected heterogeneous biological networks. Preliminary results confirm that our method may extract relevant knowledge from biological data reducing the computational time.

Author supplied keywords

Cite

CITATION STYLE

APA

Milano, M., Veltri, P., Cannataro, M., & Guzzi, P. H. (2019). Towards heterogeneous network alignment: Design and implementation of a large-scale data processing framework. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11339 LNCS, pp. 692–703). Springer Verlag. https://doi.org/10.1007/978-3-030-10549-5_54

Towards heterogeneous network alignment: Design and implementation of a large-scale data processing framework

Abstract

Author supplied keywords

Cite

Register to see more suggestions