Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning
Abstract
:1. Introduction
2. Literature Review
2.1. Two-Sided Assembly Line Balance Oriented to Load Balancing
2.2. Deep Reinforcement Learning
3. Mathematical Model for Load Balancing-Oriented TALBP
3.1. Assumptions
3.2. Parameters
3.3. Mathematical Model
4. Deep Reinforcement Learning Algorithm Based on DPPO and CNN (DPPO–CNN)
4.1. DPPO–CNN Agent
4.2. Task Assignment State of TAL
4.3. Decision of Selecting Tasks
4.4. Task Assignment
4.5. Reward Function
4.6. Overall Flow of DPPO–CNN
Algorithm 1 Load balancing-oriented TALBP based on DPPO–CNN |
1: Initializes the Actor–Critic network parameters , for the main process, maximum iteration number, experience pool capacity buffer size, maximum experience pool capacity max buffer size, batch data size, number of network updates of each round epoch; Subprocess Actor–Critic network parameters , . 2: for each episode do: 3: t = 1. 4: The main process empties the experience pool, sends its Actor–Critic network parameters , to each child process; each sub-process bilateral assembly line environment 1, 2, 3, …, m is initialized, generates states , , , …, ; each agent 1, 2, 3, …, m loads the network parameters of the main process. 5: while buffer size < max buffer size do: 6: while all tasks of m’s respective bilateral assembly lines have not been allocated do: 7: Agent 1, 2, 3, …, m observes the environment status , , , …, , respectively, and takes the tasks , , , …, to be assigned according to strategy . 8: Environment 1, 2, 3, …, m allocates tasks , , , …, separately, and feedback reward , , , …, . 9: . 10: Environment 1, 2, 3, …, m update states , , , …, . 11: end while 12: Agent 1, 2, 3, …, m stores the interactive trajectory (solving experience) , , , …, , respectively, in the experience pool and packages it as training data. 13: end while 14: for epoch in {1, 2, …, epochs} do: 15: Randomly extract the training data with the size of batch size from the experience pool. 16: Calculate the network loss function of the actor strategy of the master process; evaluate the critic loss function of the master process. 17: Update the main process Actor’s policy network . 18: Update the main process Critic’s evaluation network . 19: end for 20: . 21: end for |
5. Experimental Verification and Discussions
5.1. Implementation of the DPPO–CNN
5.2. Verification of DPPO–CNN in Term of Model Training
5.3. Verification of DPPO–CNN in Term of Solutions
6. Conclusions and Future Work Avenues
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
NO. | Time | Side | Immediate Sucessors | NO. | Time | Side | Immediate Sucessors |
---|---|---|---|---|---|---|---|
1 | 16 | E | 5, 6, 7, 8 | 75 | 101 | E | 88, 97 |
2 | 30 | E | 3 | 76 | 5 | E | 77 |
3 | 7 | E | 4, 5, 6, 7 | 77 | 28 | E | 78 |
4 | 47 | E | 8 | 78 | 8 | E | 79 |
5 | 29 | E | 14 | 79 | 111 | E | 80 |
6 | 8 | E | 9 | 80 | 7 | E | 81 |
7 | 39 | E | 14 | 81 | 26 | E | 106 |
8 | 37 | E | 10 | 82 | 10 | E | 83, 89, 143, 146 |
9 | 32 | E | 14 | 83 | 21 | E | |
10 | 29 | E | 14 | 84 | 26 | E | 85 |
11 | 17 | E | 12 | 85 | 20 | E | |
12 | 11 | E | 13 | 86 | 21 | E | |
13 | 32 | E | 87 | 47 | E | ||
14 | 15 | E | 15, 16 | 88 | 23 | E | 111 |
15 | 53 | L | 17 | 89 | 13 | E | 90 |
16 | 53 | R | 17 | 90 | 19 | E | 79 |
17 | 8 | E | 18, 19 | 91 | 115 | E | 105 |
18 | 24 | L | 20 | 92 | 35 | E | 135 |
19 | 24 | R | 20 | 93 | 26 | L | |
20 | 8 | E | 21, 22, 23, 24 | 94 | 46 | E | |
21 | 7 | R | 25, 26, 27, 28 | 95 | 20 | E | 101 |
22 | 8 | L | 25, 26, 27, 28 | 96 | 31 | E | 104 |
23 | 14 | L | 25, 26, 27, 28 | 97 | 19 | E | |
24 | 13 | R | 25, 26, 27, 28 | 98 | 34 | E | 101 |
25 | 10 | R | 29 | 99 | 51 | E | 100 |
26 | 25 | R | 29 | 100 | 39 | E | 101 |
27 | 11 | L | 29 | 101 | 30 | E | 102, 103 |
28 | 25 | L | 29 | 102 | 26 | E | 127 |
29 | 11 | E | 31 | 103 | 13 | E | 127 |
30 | 29 | R | 104 | 45 | E | ||
31 | 25 | E | 36 | 105 | 58 | E | 119 |
32 | 10 | L | 34 | 106 | 28 | E | 107 |
33 | 14 | R | 35 | 107 | 8 | E | 108 |
34 | 41 | L | 36 | 108 | 43 | E | 109 |
35 | 42 | R | 36 | 109 | 40 | E | 110 |
36 | 47 | R | 37 | 110 | 34 | E | |
37 | 7 | R | 38, 45 | 111 | 23 | E | 112 |
38 | 80 | R | 39 | 112 | 162 | L | 113 |
39 | 7 | R | 40 | 113 | 11 | L | 114, 116, 120, 123, 128 |
40 | 41 | R | 41, 48, 55 | 114 | 19 | E | 115 |
41 | 47 | R | 115 | 14 | E | 125 | |
42 | 16 | L | 43 | 116 | 31 | E | 117 |
43 | 32 | L | 44 | 117 | 32 | E | 118 |
44 | 66 | L | 118 | 26 | E | 126 | |
45 | 80 | L | 46 | 119 | 55 | E | |
46 | 7 | L | 47 | 120 | 31 | E | 121 |
47 | 41 | L | 48, 49, 55 | 121 | 32 | E | 122 |
48 | 13 | E | 122 | 26 | E | 126 | |
49 | 47 | L | 123 | 19 | E | 124 | |
50 | 33 | E | 51 | 124 | 14 | E | 125 |
51 | 34 | L | 53,69 | 125 | 19 | E | |
52 | 11 | L | 53 | 126 | 48 | E | |
53 | 118 | L | 127 | 55 | E | ||
54 | 25 | L | 133 | 128 | 8 | L | 129 |
55 | 7 | R | 54, 72, 76, 87, 88 | 129 | 11 | L | 130 |
56 | 28 | E | 73 | 130 | 27 | L | 131, 137 |
57 | 12 | L | 79 | 131 | 18 | L | |
58 | 52 | L | 84, 86 | 132 | 36 | E | 135 |
59 | 14 | E | 75, 87 | 133 | 23 | L | 135 |
60 | 3 | E | 134 | 20 | R | 135 | |
61 | 3 | E | 62 | 135 | 46 | E | 136 |
62 | 8 | E | 63 | 136 | 64 | E | |
63 | 16 | E | 67 | 137 | 22 | L | |
64 | 33 | R | 65, 71, 72 | 138 | 15 | E | 139 |
65 | 8 | E | 66,99 | 139 | 34 | E | 140 |
66 | 18 | E | 67 | 140 | 22 | E | |
67 | 10 | E | 68 | 141 | 151 | L | 142 |
68 | 14 | E | 95,98 | 142 | 148 | R | 143, 146, 147, 148 |
69 | 28 | R | 82 | 143 | 64 | L | |
70 | 11 | R | 71 | 144 | 170 | L | 145 |
71 | 118 | R | 145 | 137 | R | 147, 148 | |
72 | 25 | R | 134 | 146 | 64 | R | |
73 | 40 | E | 84, 86, 87, 88, 96 | 147 | 78 | L | |
74 | 40 | E | 75 | 148 | 78 | R |
NO. | Time | Side | Immediate Sucessors | NO. | Time | Side | Immediate Sucessors |
---|---|---|---|---|---|---|---|
1 | 692 | E | 36 | 104 | 68 | R | 113 |
2 | 42 | E | 3, 4 | 105 | 232 | L | 106, 107 |
3 | 261 | R | 5 | 106 | 122 | L | 108 |
4 | 261 | L | 5 | 107 | 151 | E | 108 |
5 | 157 | E | 7, 13 | 108 | 31 | L | 113 |
6 | 90 | E | 36 | 109 | 97 | E | 113 |
7 | 54 | R | 8 | 110 | 308 | R | 113 |
8 | 67 | R | 9 | 111 | 116 | L | 113 |
9 | 30 | R | 10 | 112 | 312 | R | 113 |
10 | 106 | R | 11 | 113 | 34 | E | 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 161, 162, 163, 169, 171, 174, 203, 204, 205 |
11 | 32 | R | 12 | 114 | 128 | L | 160 |
12 | 62 | R | 36 | 115 | 54 | E | 160 |
13 | 54 | L | 14 | 116 | 175 | R | 160 |
14 | 67 | L | 15 | 117 | 55 | E | 160 |
15 | 30 | L | 16 | 118 | 306 | E | 126 |
16 | 106 | L | 17 | 119 | 59 | E | 126 |
17 | 32 | L | 18 | 120 | 59 | E | 126 |
18 | 62 | L | 36 | 121 | 66 | E | 126 |
19 | 56 | E | 36 | 122 | 66 | E | 126 |
20 | 67 | E | 22 | 123 | 23 | E | 126 |
21 | 86 | E | 22 | 124 | 244 | E | 125 |
22 | 37 | E | 23 | 125 | 54 | E | 126 |
23 | 41 | E | 24, 34 | 126 | 294 | R | 127, 128, 129 |
24 | 72 | E | 26, 27, 28 | 127 | 84 | E | 135 |
25 | 86 | R | 28 | 128 | 61 | E | 135 |
26 | 16 | L | 35 | 129 | 57 | E | 135 |
27 | 51 | R | 35 | 130 | 38 | R | 136 |
28 | 66 | R | 29 | 131 | 944 | E | 132 |
29 | 41 | R | 30, 33 | 132 | 511 | R | 133 |
30 | 72 | R | 31, 32 | 133 | 625 | R | 189 |
31 | 51 | R | 35 | 134 | 445 | R | 189 |
32 | 16 | R | 35 | 135 | 68 | L | 136, 137, 138, 139, 140, 141, 142, 144, 145, 147, 148, 149, 150, 151, 152, 153, 158 |
33 | 15 | R | 35 | 136 | 53 | L | 189 |
34 | 15 | L | 35 | 137 | 49 | E | 160 |
35 | 85 | E | 36 | 138 | 92 | E | 160 |
36 | 59 | E | 37, 40, 41, 42, 62, 69, 72, 75, 83, 110, 111, 112 | 139 | 236 | E | 160 |
37 | 23 | L | 38 | 140 | 116 | L | 143 |
38 | 13 | L | 39 | 141 | 265 | L | 143 |
39 | 19 | L | 45 | 142 | 149 | L | 143 |
40 | 108 | E | 43, 54 | 143 | 74 | L | 160 |
41 | 214 | E | 92 | 144 | 332 | E | 160 |
42 | 80 | E | 43, 54 | 145 | 324 | E | 146 |
43 | 37 | L | 44 | 146 | 104 | L | 160 |
44 | 84 | L | 45 | 147 | 51 | L | 160 |
45 | 18 | L | 46, 48, 51, 53 | 148 | 58 | R | 160 |
46 | 12 | L | 47 | 149 | 67 | R | 160 |
47 | 29 | L | 92 | 150 | 49 | R | 160 |
48 | 37 | L | 49 | 151 | 107 | E | 160 |
49 | 13 | L | 50 | 152 | 38 | L | 160 |
50 | 70 | L | 92 | 153 | 27 | L | 154 |
51 | 217 | L | 52 | 154 | 68 | E | 155 |
52 | 72 | L | 92 | 155 | 207 | E | 156 |
53 | 85 | L | 92 | 156 | 202 | E | 157 |
54 | 43 | R | 55 | 157 | 83 | E | 189 |
55 | 97 | R | 56, 59, 61 | 158 | 35 | R | 159 |
56 | 37 | R | 57 | 159 | 58 | R | 189 |
57 | 13 | R | 58 | 160 | 42 | E | 164, 170, 178, 179, 184 |
58 | 35 | R | 92 | 161 | 68 | R | 167 |
59 | 217 | R | 60 | 162 | 68 | R | 165 |
60 | 72 | R | 92 | 163 | 68 | R | 164 |
61 | 85 | R | 92 | 164 | 103 | R | 165 |
62 | 25 | E | 63 | 165 | 103 | R | 166 |
63 | 37 | E | 64 | 166 | 103 | R | 167 |
64 | 37 | E | 65, 68 | 167 | 103 | R | 168 |
65 | 103 | E | 66 | 168 | 103 | R | 177 |
66 | 140 | E | 67 | 169 | 68 | L | 170 |
67 | 49 | E | 80 | 170 | 103 | L | 172 |
68 | 35 | E | 80 | 171 | 68 | L | 172 |
69 | 51 | E | 70 | 172 | 103 | L | 173 |
70 | 88 | E | 71 | 173 | 103 | L | 175 |
71 | 53 | E | 73 | 174 | 68 | L | 175 |
72 | 144 | E | 73 | 175 | 103 | L | 176 |
73 | 337 | E | 74 | 176 | 103 | L | 177 |
74 | 107 | E | 76 | 177 | 10 | E | 185, 186, 187, 188, 194, 195 |
75 | 371 | E | 92 | 178 | 187 | E | 180 |
76 | 97 | E | 77, 78, 79 | 179 | 134 | L | 180 |
77 | 166 | E | 80, 82 | 180 | 89 | L | 181, 183 |
78 | 92 | L | 80 | 181 | 58 | L | 182 |
79 | 92 | R | 80 | 182 | 49 | L | |
80 | 106 | E | 81 | 183 | 134 | L | |
81 | 49 | E | 84 | 184 | 53 | L | |
82 | 92 | E | 92 | 185 | 334 | E | 189 |
83 | 371 | E | 92 | 186 | 24 | R | 189 |
84 | 87 | E | 85 | 187 | 76 | R | 189 |
85 | 162 | E | 86, 88, 90 | 188 | 76 | L | 189 |
86 | 96 | E | 87 | 189 | 192 | E | 190, 191, 193 |
87 | 79 | E | 92 | 190 | 98 | E | |
88 | 96 | E | 89 | 191 | 258 | R | 192 |
89 | 42 | E | 92 | 192 | 165 | E | |
90 | 88 | R | 91 | 193 | 38 | R | |
91 | 90 | R | 92 | 194 | 115 | E | 197 |
92 | 97 | R | 93, 94, 95, 96, 97, 98, 99 | 195 | 83 | L | 196 |
93 | 270 | R | 135 | 196 | 56 | R | 197 |
94 | 452 | E | 135 | 197 | 29 | R | 198, 199, 201 |
95 | 48 | R | 113 | 198 | 303 | R | |
96 | 338 | E | 113 | 199 | 18 | R | 200 |
97 | 34 | E | 100 | 200 | 29 | R | |
98 | 65 | E | 100 | 201 | 154 | L | 202 |
99 | 50 | E | 100 | 202 | 90 | L | |
100 | 112 | E | 101, 103, 105, 109, 130, 131, 134 | 203 | 93 | L | |
101 | 48 | E | 102 | 204 | 94 | E | |
102 | 117 | E | 113 | 205 | 165 | E | |
103 | 50 | E | 104 |
References
- Zhang, Y.; Hu, X.; Wu, C. A modified multi-objective genetic algorithm for two-sided assembly line re-balancing problem of a shovel loader. Int. J. Prod. Res. 2018, 56, 3043–3063. [Google Scholar] [CrossRef]
- Zhang, Y.; Hu, X.; Wu, C. Improved imperialist competitive algorithms for rebalancing multi objective two-sided assembly lines with space and resource constraints. Int. J. Prod. Res. 2020, 58, 3589–3617. [Google Scholar] [CrossRef]
- Hu, X.; Wu, E.; Bao, J.; Jin, Y. A Branch-and-bound Algorithm to Minimize the Line Length of a Two-sided Assembly Line. Eur. J. Oper. Res. 2010, 206, 703–707. [Google Scholar]
- Hu, X. Two-Sided Assembly Line Balancing Algorithm and Its Application, 1st ed.; Science Press: Beijing, China, 2015; pp. 3–10. [Google Scholar]
- Bartholdi, J.J. Balancing Two-sided Assembly Lines: A Case Study. Int. J. Prod. Res. 1993, 31, 2447–2461. [Google Scholar] [CrossRef]
- Li, J.; Pang, D.; Zheng, Y.; Le, X. Digital Twin Enhanced Assembly Based on Deep Reinforcement Learning. In Proceedings of the 11th International Conference on Information Science and Technology (ICIST), Chengdu, China, 21–23 May 2021; pp. 432–437. [Google Scholar]
- Lv, Y.; Tan, Y.; Zhong, R.; Zhang, P.; Wang, J.; Zhang, J. Deep reinforcement learning-based balancing and sequencing approach for mixed model assembly lines. IET Coll. Intell. Manuf. 2022, 4, 181–193. [Google Scholar] [CrossRef]
- Lee, T.O.; Kim, Y.; Kim, Y.K. Two-sided assembly line balancing to maximize work relatedness and slackness. Comput. Ind. Eng. 2001, 40, 273–292. [Google Scholar] [CrossRef]
- Hu, X.; Wu, E.; Jin, Y. A station-oriented enumerative algorithm for two-sided assembly line balancing. Eur. J. Oper. Res. 2008, 186, 435–440. [Google Scholar] [CrossRef]
- Wei, N.; Liu, S.; Chen, C.; Xu, Y.; Shih, Y. An Integrated Method for Solving the Two-Sided Assembly Line Balancing Problems. J. Adv. Manuf. Syst. 2023, 22, 181–203. [Google Scholar] [CrossRef]
- Li, Z.; Tang, Q.; Zhang, L. Two-sided assembly line balancing problem of type I: Improvements, a simple algorithm and a comprehensive study. Comput. Oper. Res. 2016, 79, 78–93. [Google Scholar] [CrossRef]
- Álvarez-Miranda, E.; Pereira, J.; Vargas, C.; Vilà, M. Variable-depth local search heuristic for assembly line balancing problems. Int. J. Prod. Res. 2023, 61, 3102–3120. [Google Scholar] [CrossRef]
- Yılmaz, D.; Emel, K.A.; Uğur, Ö.; Mehmet, S.İ. A modified particle swarm optimization algorithm to mixed-model two-sided assembly line balancing. J. Intell. Manuf. 2017, 28, 23–36. [Google Scholar]
- Huang, D.; Mao, Z.; Fang, K.; Yuan, B. Combinatorial Benders decomposition for mixed-model two-sided assembly line balancing problem. Int. J. Prod. Res. 2022, 60, 2598–2624. [Google Scholar] [CrossRef]
- Kim, Y.K.; Song, W.S.; Kim, J.H. A Mathematical Model and a Genetic Algorithm for Two-sided Assembly Line Balancing. Comput. Oper. Res. 2009, 36, 853–865. [Google Scholar] [CrossRef]
- Lei, D.; Guo, X. Variable neighborhood search for the second type of two-sided assembly line balancing problem. Comput. Oper. Res. 2016, 72, 183–188. [Google Scholar] [CrossRef]
- Kang, H.; Lee, A.H.I. An evolutionary genetic algorithm for a multi-objective two-sided assembly line balancing problem: A case study of automotive manufacturing operations. Qual. Technol. Quant. Manag. 2023, 20, 66–88. [Google Scholar] [CrossRef]
- Azizoglu, M.; Imat, S. Workload smoothing in simple assembly line balancing. Comput. Oper. Res. 2018, 89, 51–57. [Google Scholar] [CrossRef]
- Walter, R.; Schulze, P. On the performance of task-oriented branch-and-bound algorithms for workload smoothing in simple assembly line balancing. Int. J. Prod. Res. 2022, 60, 4654–4667. [Google Scholar] [CrossRef]
- Uğur, Ö.; Bilal, T. A tabu search algorithm for two-sided assembly line balancing. Int. J. Adv. Manuf. Technol. 2009, 43, 822–829. [Google Scholar]
- Lan, C.H.; Yee, M.S. Construct an INLP Mathematical Model to solve the Two-sided Assembly Line Balancing problem of Type-3. Adv. Mater. Res. 2012, 383–390, 4302–4305. [Google Scholar] [CrossRef]
- Purnomo, H.D.; Wee, H.M. Maximizing production rate and workload balancing in a two-sided assembly line using Harmony Search. Comput. Ind. Eng. 2014, 76, 222–230. [Google Scholar] [CrossRef]
- Li, D.; Zhang, C.; Shao, X.; Lin, W. A multi-objective TLBO algorithm for balancing two-sided assembly line with multiple constraints. J. Intell. Manuf. 2016, 7, 725–739. [Google Scholar] [CrossRef]
- Li, Z.; Tang, Q.; Zhang, L.; Hu, J. Balancing two-sided assembly line with a simple and effective iterated local search algorithm. ICIC Express Lett. 2015, 9, 2695–2701. [Google Scholar]
- Wu, Y.; Tang, Q.; Zhang, L. A hybrid gravitational search algorithm for two-sided assembly line balancing problem with zoning constraints. ICIC Express Lett. Part B Appl. 2016, 7, 2633–2640. [Google Scholar]
- Buyukozkan, K.; Kucukkoc, I.; Satoglu, S.I.; Zhang, D.Z. Lexicographic bottleneck mixed-model assembly line balancing problem-Artificial bee colony and tabu search approaches with optimised parameters. Expert Syst. Appl. 2016, 50, 151–166. [Google Scholar] [CrossRef]
- Yadav, A.; Agrawal, S. Minimize idle time in two sided assembly line balancing using exact search approach. In Proceedings of the 2019 International Conference on Management Science and Industrial Engineering, Phuket, Thailand, 24–26 May 2019; pp. 220–227. [Google Scholar]
- Yadav, A.; Verma, P.; Agrawal, S. Mixed model two sided assembly line balancing problem: An exact solution approach. Int. J. Syst. Assur. Eng. 2020, 11, 335–348. [Google Scholar] [CrossRef]
- Abdullah Make, M.R.; Ab Rashid, M.F.F. Optimization of two-sided assembly line balancing with resource constraints using modified particle swarm optimisation. Sci. Iran. 2022, 29, 2084–2098. [Google Scholar]
- Chen, X. Research on Scheduling and Navigation Strategies Based on Reinforcement Learning. Master’s Thesis, Zhejiang University, Hangzhou, China, 2021. [Google Scholar]
- Tercan, H.; Meisen, T. Machine learning and deep learning based predictive quality in manufacturing: A systematic review. J. Intell. Manuf. 2022, 33, 1879–1905. [Google Scholar] [CrossRef]
- Li, C.; Zheng, P.; Yin, Y.; Wang, B.; Wang, L. Deep reinforcement learning in smart manufacturing: A review and prospects. CIRP J. Manuf. Sci. Technol. 2023, 40, 75–101. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
- Chen, X.; Yao, L.; McAuley, J.; Zhou, G.; Wang, X. Deep reinforcement learning in recommender systems: A survey and new perspectives. Knowl.-Based Syst. 2023, 264, 110335. [Google Scholar] [CrossRef]
- Goby, N.; Brandt, T.; Neumann, D. Deep reinforcement learning with combinatorial actions spaces: An application to prescriptive maintenance. Comput. Ind. Eng. 2023, 179, 109165. [Google Scholar] [CrossRef]
- Kallestad, J.; Hasibi, R.; Hemmati, A.; Sorensen, K. A general deep reinforcement learning hyper heuristic framework for solving combinatorial optimization problems. Eur. J. Oper. Res. 2023, 309, 446–468. [Google Scholar] [CrossRef]
- Zhang, Z.; Liu, H.; Zhou, M.; Wang, J. Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning. IEEE Trans. Neural Netw. Learn. 2023, 34, 2119–2132. [Google Scholar] [CrossRef] [PubMed]
- De Sirisuriya, S.C.M.S.; Fernando, T.G.I.; Ariyaratne, M.K.A. Algorithms for path optimizations: A short survey. Computing 2023, 105, 293–319. [Google Scholar] [CrossRef]
- Jiang, Y.; Cao, Z.; Zhang, J. Learning to Solve 3-D Bin Packing Problem via Deep Reinforcement Learning and Constraint Programming. IEEE Trans. Cybern. 2023, 53, 2864–2875. [Google Scholar] [CrossRef]
- Dai, H.; Khalil, E.B.; Zhang, Y.; Dilkina, B.; Song, L. Learning combinatorial optimization algorithms over graphs. Adv. Neural Inf. Process. Syst. 2017, 30, 6349–6359. [Google Scholar]
- Zhang, C.; Wu, Y.; Ma, Y.; Song, W.; Le, Z.; Cao, Z.; Zhang, J. A review on learning to solve combinatorial optimisation problems in manufacturing. IET Collab. Intell. Manuf. 2023, 5, e12072. [Google Scholar] [CrossRef]
- Kim, Y.K.; Kim, Y.; Kim, Y.J. Two-sided assembly line balancing: A genetic algorithm approach. Prod. Plan. Control 2000, 11, 44–53. [Google Scholar] [CrossRef]
- Wang, K.; Li, X.; Gao, L. A multi-objective discrete flower pollination algorithm for stochastic two-sided partial disassembly line balancing problem. Comput. Ind. Eng. 2019, 130, 634–649. [Google Scholar] [CrossRef]
- Raju Bahubalendruni, M.V.A.; Biswal, B.B. An efficient stable subassembly identification method towards assembly sequence generation. Natl. Acad. Sci. Lett. 2018, 41, 375–378. [Google Scholar] [CrossRef]
No. | Features | Description |
---|---|---|
1 | PTime | A task with the longest operation time in the set of tasks without precedence tasks |
2 | MFlow | A task with the largest spare time caused by matched task in the set of tasks without precedence tasks |
3 | SucNum | A task with the largest number of successor tasks in the set of tasks without precedence tasks |
4 | AllSucNum | A task with the largest number of successor tasks |
5 | MTNum | A task with the smallest number of matched tasks in the set of tasks without precedence tasks |
6 | Side | The operating orientation of a task with the smallest sequence number in the set of tasks without precedence tasks (0 is E type of tasks; 1 is non-E type of tasks) |
7 | FAN | The number of tasks can be selected at present |
8 | NPN | The number of tasks without predecessors at present |
9 | NER | The number of remaining E type of tasks |
10 | NR | The number of remaining non-E type of tasks |
11 | LRDiffoverAE | The load difference of remaining non-E type of tasks/the average time of remaining E type of tasks |
12 | RToverAveptO | Remaining spare time of current station/the average time of tasks of current station |
13 | RToverAveptT | Remaining spare time of matched station/the average time of tasks of matched station |
14 | OPToverRemain | The operation time/remaining time of tasks |
15 | ARPW | Improved position weight |
16 | RRPW | Reverse position weight |
17 | PreNum | A task with the largest number of immediate successor tasks in the set of tasks without precedence tasks |
18 | AllPreNum | A task with the smallest number of precedence tasks |
No. | Features | State Feature Value (Task Assignment Scheme I) | State Feature Value (Task Assignment Scheme II) |
---|---|---|---|
1 | PTime | 1 | 1 |
2 | MFlow | 1 | 1 |
3 | SucNum | 1 | 1 |
4 | AllSucNum | 1 | 1 |
5 | MTNum | 1 | 1 |
6 | ARPW | 1 | 1 |
7 | RRPW | 1 | 1 |
8 | PreNum | 1 | 1 |
9 | AllPreNum | 1 | 1 |
10 | Side | 0 | 0 |
11 | FAN | 2 | 1 |
12 | NPN | 2 | 2 |
13 | NER | 10 | 9 |
14 | NR | 6 | 6 |
15 | LRDiffoverAE | 1 | 1 |
16 | RToverAveptO | 3 | 3 |
17 | RToverAveptT | 3 | 2 |
18 | OPToverRemain | 1 | 1 |
Parameter | Value | Parameter | Value |
---|---|---|---|
Number of hidden layers on the Actor and Critic network | 256 | Learning rate of the Actor network | 10−4 |
Activation function of hidden layers on the Actor and Critic network | Leaky Relu | Learning rate of the Critic network | 2 × 10−4 |
Activation function of output layers on the Actor network | Softmax | Sample size | 256 |
Activation function of output layers on the Critic network | Leaky Relu | Maximum capacity of the experience pool | 4096 |
Convolution kernel of convolution layers on the Actor and Critic network | (1, 3) | Study times per round | 8 |
Initialize network parameters | Orthogonal initialization | Return discount factor | 0.99 |
Optimizer | Adam | Cutting coefficient | 0.2 |
Instance | ct | ns | PPO-CNN | DPPO–CNN | ||||||
---|---|---|---|---|---|---|---|---|---|---|
LE | SI | CSI | Time(s) | LE | SI | CSI | Time(s) | |||
P9 | 3 | 3 | 94.44% | 0.41 | 0.41 | 0.004 | 94.44% | 0.41 | 0.41 | 0.001 |
4 | 3 | 85.00% | 1.87 | 1.87 | 0.004 | 94.44% | 0.41 | 0.41 | 0.001 | |
5 | 2 | 85.00% | 1.12 | 1.12 | 0.004 | 85.00% | 1.12 | 1.12 | 0.001 | |
6 | 2 | 70.83% | 2.50 | 1.58 | 0.005 | 85.00% | 1.12 | 1.12 | 0001 | |
7 | 2 | 60.71% | 3.57 | 3.57 | 0.005 | 85.00% | 1.12 | 1.12 | 0.001 | |
P12 | 4 | 4 | 78.13% | 1.37 | 1.37 | 0.007 | 78.13% | 1.37 | 1.37 | 0.002 |
5 | 3 | 83.33% | 1.23 | 0.91 | 0.007 | 83.33% | 1.23 | 0.91 | 0.002 | |
6 | 3 | 83.33% | 2.97 | 2.97 | 0.008 | 83.33% | 1.23 | 0.91 | 0.002 | |
7 | 2 | 89.29% | 1.12 | 1.12 | 0.008 | 89.29% | 1.12 | 1.12 | 0.003 | |
8 | 2 | 78.13% | 1.94 | 1.23 | 0.008 | 89.29% | 1.12 | 1.12 | 0.003 | |
9 | 2 | 69.44% | 3.35 | 1.87 | 0.009 | 89.29% | 1.12 | 1.12 | 0.003 | |
P16 | 15 | 4 | 78.095% | 7.18 | 7.09 | 0.009 | 78.095% | 7.18 | 7.09 | 0.003 |
16 | 3 | 85.42% | 2.94 | 2.52 | 0.009 | 85.42% | 2.94 | 2.52 | 0.004 | |
18 | 3 | 75.93% | 4.97 | 2.55 | 0.010 | 85.42% | 2.94 | 2.52 | 0.004 | |
19 | 3 | 71.73% | 5.86 | 2.38 | 0.011 | 85.42% | 2.94 | 2.52 | 0.006 | |
20 | 3 | 68.33% | 6.81 | 2.83 | 0.011 | 85.42% | 2.94 | 2.52 | 0.006 | |
21 | 3 | 65.08% | 8.56 | 6.18 | 0.012 | 85.42% | 2.94 | 2.52 | 0.007 | |
22 | 2 | 93.18% | 1.87 | 1.58 | 0.013 | 93.18% | 1.87 | 1.58 | 0.007 | |
P24 | 18 | 4 | 97.22% | 0.71 | 0.61 | 0.014 | 97.22% | 0.71 | 0.61 | 0.009 |
20 | 4 | 87.5% | 2.92 | 2.03 | 0.015 | 97.22% | 0.71 | 0.61 | 0.009 | |
24 | 3 | 97.22% | 1.00 | 0.71 | 0.017 | 97.22% | 1.00 | 0.71 | 0.010 | |
25 | 3 | 95.33% | 2.52 | 2.52 | 0.017 | 97.22% | 1.00 | 0.71 | 0.010 | |
30 | 3 | 77.78% | 7.92 | 5.21 | 0.020 | 97.22% | 1.00 | 0.71 | 0.013 | |
35 | 2 | 100% | 0.00 | 0.00 | 0.020 | 100% | 0.00 | 0.00 | 0.014 | |
40 | 2 | 87.50% | 6.82 | 2.55 | 0.020 | 100% | 0.00 | 0.00 | 0.014 | |
P65 | 326 | 8 | 97.76% | 9.99 | 9.94 | 0.076 | 98.36% | 8.79 | 8.79 | 0.024 |
381 | 7 | 95.59% | 27.42 | 25.03 | 0.079 | 98.98% | 6.10 | 6.10 | 0.027 | |
435 | 6 | 97.68% | 12.37 | 11.58 | 0.083 | 99.28% | 4.59 | 4.59 | 0.030 | |
490 | 6 | 86.72% | 102.11 | 40.07 | 0.087 | 99.28% | 4.59 | 4.59 | 0.032 | |
512 | 5 | 99.59% | 2.70 | 2.70 | 0.092 | 99.59% | 2.70 | 2.70 | 0.037 | |
544 | 5 | 93.73% | 43.76 | 36.76 | 0.100 | 99.59% | 2.70 | 2.70 | 0.038 | |
P148 | 204 | 13 | 96.61% | 9.52 | 9.52 | 0.175 | 99.53% | 1.66 | 1.66 | 0.094 |
228 | 12 | 93.64% | 22.61 | 17.38 | 0.181 | 99.30% | 2.43 | 2.43 | 0.097 | |
255 | 11 | 91.34% | 34.17 | 26.78 | 0.193 | 99.53% | 1.98 | 1.98 | 0.101 | |
306 | 9 | 93.03% | 32.58 | 20.76 | 0.211 | 99.35% | 2.08 | 2.08 | 0.113 | |
357 | 8 | 89.71% | 57.34 | 52.67 | 0.242 | 99.46% | 2.18 | 2.18 | 0.118 | |
378 | 7 | 96.83% | 20.44 | 5.09 | 0.256 | 99.73% | 2.00 | 2.00 | 0.123 | |
408 | 7 | 89.71% | 72.79 | 31.40 | 0.263 | 99.73% | 2.00 | 2.00 | 0.127 | |
454 | 6 | 94.05% | 40.96 | 13.82 | 0.271 | 99.77% | 1.58 | 1.58 | 0.136 | |
459 | 6 | 93.03% | 41.95 | 33.69 | 0.281 | 99.77% | 1.58 | 1.58 | 0.144 | |
510 | 6 | 83.73% | 133.08 | 128.98 | 0.297 | 99.77% | 1.58 | 1.58 | 0.149 | |
P205 | 1133 | 11 | 93.66% | 103.255 | 76.98 | 0.761 | 98.44% | 24.44 | 16.49 | 0.391 |
1275 | 10 | 91.55% | 157.90 | 108.27 | 0.783 | 98.34% | 26.75 | 19.20 | 0.420 | |
1322 | 9 | 98.11% | 35.00 | 29.71 | 0.789 | 98.70% | 29.50 | 12.09 | 0.438 | |
1455 | 9 | 89.14% | 202.97 | 119.14 | 0.821 | 98.70% | 29.50 | 12.09 | 0.445 | |
1510 | 8 | 96.63% | 63.64 | 34.84 | 0.843 | 98.85% | 25.94 | 17.83 | 0.461 | |
1650 | 8 | 88.43% | 265.00 | 96.17 | 0.855 | 98.85% | 25.94 | 17.83 | 0.474 | |
1699 | 7 | 98.15% | 46.30 | 19.52 | 0.857 | 98.79% | 30.93 | 13.99 | 0.480 | |
1888 | 7 | 88.32% | 374.85 | 253.90 | 0.864 | 98.79% | 30.93 | 13.99 | 0.499 | |
1920 | 7 | 86.85% | 447.20 | 371.80 | 0.869 | 98.79% | 30.93 | 13.99 | 0.502 | |
2077 | 6 | 93.67% | 197.45 | 121.67 | 0.888 | 98.90% | 36.08 | 11.42 | 0.518 | |
2100 | 6 | 92.64% | 205.35 | 157.83 | 0.891 | 98.90% | 36.08 | 11.42 | 0.523 | |
2266 | 6 | 85.85% | 459.64 | 395.98 | 0.907 | 98.90% | 36.08 | 11.42 | 0.542 | |
2300 | 6 | 84.58% | 516.95 | 456.03 | 0.912 | 98.90% | 36.08 | 11.42 | 0.549 | |
2454 | 5 | 95.13% | 219.16 | 63.68 | 0.920 | 99.08% | 34.69 | 11.02 | 0.558 | |
2500 | 5 | 93.38% | 251.86 | 112.88 | 0.944 | 99.08% | 34.69 | 11.02 | 0.565 | |
2643 | 5 | 88.33% | 419.17 | 277.99 | 0.952 | 99.08% | 34.69 | 11.02 | 0.577 | |
2800 | 5 | 83.38% | 683.82 | 659.15 | 0.981 | 99.08% | 34.69 | 11.02 | 0.582 | |
2832 | 5 | 82.43% | 735.43 | 711.63 | 0.992 | 99.08% | 34.69 | 11.02 | 0.589 |
Instance | Training Time (h) | |
---|---|---|
DPPO–CNN | PPO-CNN | |
P9 | 0.15 | 0.10 |
P12 | 0.20 | 0.12 |
P16 | 0.35 | 0.20 |
P24 | 0.50 | 0.35 |
P65 | 1.00 | 0.68 |
P148 | 1.50 | 0.95 |
P205 | 4.00 | 2.34 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jia, G.; Zhang, Y.; Shen, S.; Liu, B.; Hu, X.; Wu, C. Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning. Appl. Sci. 2023, 13, 7439. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137439
Jia G, Zhang Y, Shen S, Liu B, Hu X, Wu C. Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning. Applied Sciences. 2023; 13(13):7439. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137439
Chicago/Turabian StyleJia, Guangpeng, Yahui Zhang, Shuqi Shen, Bozu Liu, Xiaofeng Hu, and Chuanxun Wu. 2023. "Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning" Applied Sciences 13, no. 13: 7439. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137439