Next Article in Journal
A Memetic Algorithm for an External Depot Production Routing Problem
Previous Article in Journal
Advanced Construction of the Dynamic Matrix in Numerically Efficient Fuzzy MPC Algorithms
Open AccessArticle

Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
*
Author to whom correspondence should be addressed.
Received: 28 December 2020 / Revised: 12 January 2021 / Accepted: 15 January 2021 / Published: 18 January 2021
Existing crowd evacuation guidance systems require the manual design of models and input parameters, incurring a significant workload and a potential for errors. This paper proposed an end-to-end intelligent evacuation guidance method based on deep reinforcement learning, and designed an interactive simulation environment based on the social force model. The agent could automatically learn a scene model and path planning strategy with only scene images as input, and directly output dynamic signage information. Aiming to solve the “dimension disaster” phenomenon of the deep Q network (DQN) algorithm in crowd evacuation, this paper proposed a combined action-space DQN (CA-DQN) algorithm that grouped Q network output layer nodes according to action dimensions, which significantly reduced the network complexity and improved system practicality in complex scenes. In this paper, the evacuation guidance system is defined as a reinforcement learning agent and implemented by the CA-DQN method, which provides a novel approach for the evacuation guidance problem. The experiments demonstrate that the proposed method is superior to the static guidance method, and on par with the manually designed model method. View Full-Text
Keywords: evacuation guidance; crowd simulation; deep Q network; reinforcement learning evacuation guidance; crowd simulation; deep Q network; reinforcement learning
Show Figures

Figure 1

MDPI and ACS Style

Xue, Y.; Wu, R.; Liu, J.; Tang, X. Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning. Algorithms 2021, 14, 26. https://0-doi-org.brum.beds.ac.uk/10.3390/a14010026

AMA Style

Xue Y, Wu R, Liu J, Tang X. Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning. Algorithms. 2021; 14(1):26. https://0-doi-org.brum.beds.ac.uk/10.3390/a14010026

Chicago/Turabian Style

Xue, Yiran; Wu, Rui; Liu, Jiafeng; Tang, Xianglong. 2021. "Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning" Algorithms 14, no. 1: 26. https://0-doi-org.brum.beds.ac.uk/10.3390/a14010026

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop