Divide-and-conquer parallelism for learning mixture models

Takaya Kawakatsu; Akira Kinoshita; Atsuhiro Takasu; Jun Adachi

Conference Proceedings

Divide-and-conquer parallelism for learning mixture models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9940 LNCS 23-47

DOI: 10.1007/978-3-662-53455-7_2

1Citations

5Readers

Get full text

Abstract

From the viewpoint of load balancing among processors, the acceleration of machine-learning algorithms by using parallel loops is not realistic for some models involving hierarchical parameter estimation. There are also other serious issues such as memory access speed and race conditions. Some approaches to the race condition problem, such as mutual exclusion and atomic operations, degrade the memory access performance. Another issue is that the first-in-first-out (FIFO) scheduler supported by frameworks such as Hadoop can waste considerable time on queuing and this will also affect the learning speed. In this paper, we propose a recursive divide-and-conquer-based parallelization method for high-speed machine learning. Our approach exploits a tree structure for recursive tasks, which enables effective load balancing. Race conditions are also avoided, without slowing down the memory access, by separating the variables for summation. We have applied our approach to tasks that involve learning mixture models. Our experimental results show scalability superior to FIFO scheduling with an atomic-based solution to race conditions and robustness against load imbalance.

Author supplied keywords

Cite

CITATION STYLE

APA

Kawakatsu, T., Kinoshita, A., Takasu, A., & Adachi, J. (2016). Divide-and-conquer parallelism for learning mixture models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9940 LNCS, pp. 23–47). Springer Verlag. https://doi.org/10.1007/978-3-662-53455-7_2

Divide-and-conquer parallelism for learning mixture models

Abstract

Author supplied keywords

Cite

Register to see more suggestions