Deep reinforcement learning-based balancing and sequencing approach for mixed model assembly lines

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

A multi-agent iterative optimisation method based on deep reinforcement learning is proposed for the balancing and sequencing problem in mixed model assembly lines. Based on the Markov decision process model for balancing and sequencing, a balancing agent using a deep deterministic policy gradient algorithm, a sequencing agent using an Actor–Critic algorithm, as well as an iterative interaction mechanism between these agents' output solutions are designed for realising the global optimisation of mixed model assembly lines. The exchange of solution information including assembly time and station workload in the iterative interaction realises the coordination of the worker assignment policy at the balancing stage and the production arrangement policy at the sequencing stage for the minimisation of work overload and idle time at stations. Through the comparative experiments with heuristic rules, genetic algorithms, and the original deep reinforcement learning algorithm, the effectiveness of the proposed method is demonstrated and discussed for small-scale instances as well as large-scale ones.

Cite

CITATION STYLE

APA

Lv, Y., Tan, Y., Zhong, R., Zhang, P., Wang, J., & Zhang, J. (2022). Deep reinforcement learning-based balancing and sequencing approach for mixed model assembly lines. IET Collaborative Intelligent Manufacturing, 4(3), 181–193. https://doi.org/10.1049/cim2.12061

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free