4.6 Article

Enhanced DQN Framework for Selecting Actions and Updating Replay Memory Considering Massive Non-Executable Actions

Journal

APPLIED SCIENCES-BASEL
Volume 11, Issue 23, Pages -

Publisher

MDPI
DOI: 10.3390/app112311162

Keywords

Gomoku; game artificial intelligence; replay memory; Deep-Q-Network; reinforcement learning

Funding

  1. Dongguk University Research Fund
  2. Basic Science Research Program through National Research Foundation of Korea (NRF)
  3. Ministry of Education [2018R1D1A1B07049990]

Ask authors/readers for more resources

The study introduces an enhanced DQN framework to address the batch size issue and reduce the learning time of a DQN in an environment with numerous non-executable actions. By filtering out non-executable actions to reduce the number of selectable actions, it aims to identify the optimal action for the current state.
A Deep-Q-Network (DQN) controls a virtual agent as the level of a player using only screenshots as inputs. Replay memory selects a limited number of experience replays according to an arbitrary batch size and updates them using the associated Q-function. Hence, relatively fewer experience replays of different states are utilized when the number of states is fixed and the state of the randomly selected transitions becomes identical or similar. The DQN may not be applicable in some environments where it is necessary to perform the learning process using more experience replays than is required by the limited batch size. In addition, because it is unknown whether each action can be executed, a problem of an increasing amount of repetitive learning occurs as more non-executable actions are selected. In this study, an enhanced DQN framework is proposed to resolve the batch size problem and reduce the learning time of a DQN in an environment with numerous non-executable actions. In the proposed framework, non-executable actions are filtered to reduce the number of selectable actions to identify the optimal action for the current state. The proposed method was validated in Gomoku, a strategy board game, in which the application of a traditional DQN would be difficult.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available