Review

40 pages, 1371 KiB

Open AccessReview

Robust Reinforcement Learning: A Review of Foundations and Recent Advances

by Janosch Moos, Kay Hansel, Hany Abdulsamad, Svenja Stark, Debora Clever and Jan Peters

Mach. Learn. Knowl. Extr. 2022, 4(1), 276-315; https://0-doi-org.brum.beds.ac.uk/10.3390/make4010013 - 19 Mar 2022

Cited by 31 | Viewed by 11501

Abstract

Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision processes (MDP). Due to the adoption of RL in realistic and complex environments, solution robustness becomes an increasingly important aspect of RL deployment. Nevertheless, current RL algorithms struggle with [...] Read more.

Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision processes (MDP). Due to the adoption of RL in realistic and complex environments, solution robustness becomes an increasingly important aspect of RL deployment. Nevertheless, current RL algorithms struggle with robustness to uncertainty, disturbances, or structural changes in the environment. We survey the literature on robust approaches to reinforcement learning and categorize these methods in four different ways: (i) Transition robust designs account for uncertainties in the system dynamics by manipulating the transition probabilities between states; (ii) Disturbance robust designs leverage external forces to model uncertainty in the system behavior; (iii) Action robust designs redirect transitions of the system by corrupting an agent’s output; (iv) Observation robust designs exploit or distort the perceived system state of the policy. Each of these robust designs alters a different aspect of the MDP. Additionally, we address the connection of robustness to the risk-based and entropy-regularized RL formulations. The resulting survey covers all fundamental concepts underlying the approaches to robust reinforcement learning and their recent advances. Full article

(This article belongs to the Special Issue Advances in Reinforcement Learning)

► Show Figures

Figure 1

50 pages, 1345 KiB

Open AccessReview

Hierarchical Reinforcement Learning: A Survey and Open Research Challenges

by Matthias Hutsebaut-Buysse, Kevin Mets and Steven Latré

Mach. Learn. Knowl. Extr. 2022, 4(1), 172-221; https://0-doi-org.brum.beds.ac.uk/10.3390/make4010009 - 17 Feb 2022

Cited by 21 | Viewed by 17260

Abstract

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems by interacting with an environment in a trial-and-error fashion. When these environments are very complex, pure random exploration of possible solutions often fails, or is very sample inefficient, requiring an unreasonable amount [...] Read more.

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems by interacting with an environment in a trial-and-error fashion. When these environments are very complex, pure random exploration of possible solutions often fails, or is very sample inefficient, requiring an unreasonable amount of interaction with the environment. Hierarchical reinforcement learning (HRL) utilizes forms of temporal- and state-abstractions in order to tackle these challenges, while simultaneously paving the road for behavior reuse and increased interpretability of RL systems. In this survey paper we first introduce a selection of problem-specific approaches, which provided insight in how to utilize often handcrafted abstractions in specific task settings. We then introduce the Options framework, which provides a more generic approach, allowing abstractions to be discovered and learned semi-automatically. Afterwards we introduce the goal-conditional approach, which allows sub-behaviors to be embedded in a continuous space. In order to further advance the development of HRL agents, capable of simultaneously learning abstractions and how to use them, solely from interaction with complex high dimensional environments, we also identify a set of promising research directions. Full article

(This article belongs to the Special Issue Advances in Reinforcement Learning)

► Show Figures

Figure 1

28 pages, 2084 KiB

Open AccessReview

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

by Xuanchen Xiang and Simon Foo

Mach. Learn. Knowl. Extr. 2021, 3(3), 554-581; https://0-doi-org.brum.beds.ac.uk/10.3390/make3030029 - 15 Jul 2021

Cited by 28 | Viewed by 9214

Abstract

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, [...] Read more.

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. Although efficient algorithms are being widely used, it seems essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. In this overview, we introduce Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. A follow-up paper will cover applications in transportation, communications and networking, and industries. Full article

(This article belongs to the Special Issue Advances in Reinforcement Learning)

► Show Figures

Figure 1

Journal Menu

Journal Browser

Advances in Reinforcement Learning

Share This Special Issue

Special Issue Editor

Special Issue Information

Keywords

Published Papers (3 papers)

Review

Further Information

Guidelines

MDPI Initiatives

Follow MDPI