Modern computing technologies are increasingly getting data-centric, addressing a variety of challenges in storing, accessing, processing, and streaming massive amounts of structured and unstructured data effectively. An important analytical task in a number of scientific and technological domains is to retrieve information from all these data, aiming to get a deeper insight into the content represented by the data in order to obtain some useful, often not explicitly stated knowledge and facts, related to a particular domain of interest. The major issue is the size, structural complexity, and frequency of the analyzed data' updates (i.e., the 'big data' aspect), which makes the use of traditional analysis techniques, tools, and infrastructures ineffective. We introduce an innovative approach to parallelise data-centric applications based on the Message-Passing Interface. In contrast to other known parallelisation technologies, our approach enables a very high-utilization rate and thus low costs of using productional high-performance computing and Cloud computing infrastructures. The advantages of the technique are demonstrated on a challenging Semantic Web application that is performing web-scale reasoning.
CITATION STYLE
Cheptsov, A., & Koller, B. (2015). Leveraging High-Performance Computing Infrastructures to Web Data Analytic Applications by Means of Message-Passing Interface. In Modeling and Optimization in Science and Technologies (Vol. 4, pp. 167–185). Springer Verlag. https://doi.org/10.1007/978-3-319-09177-8_7
Mendeley helps you to discover research relevant for your work.