4.6 Article

Cooperative Multi-Agent Interaction and Evaluation Framework Considering Competitive Networks with Dynamic Topology Changes

期刊

APPLIED SCIENCES-BASEL
卷 10, 期 17, 页码 -

出版社

MDPI
DOI: 10.3390/app10175828

关键词

adaptive algorithm; competitive network agent; dynamically changes networks; human-machine-agent interaction; reinforcement learning

资金

  1. Priority Research Centers Program through the National Research Foundation of Korea (NRF) - Ministry of Education, Science, and Technology [2018R1A6A1A03024003]

向作者/读者索取更多资源

Featured Application The proposed multi-agent framework can be applied to cooperative tasks between human and machines, such as human-robot interaction, autonomous car driving, and artificial intelligence-based industrial tasks. In recent years, the problem of reinforcement learning has become increasingly complex, and the computational demands with respect to such processes have increased. Accordingly, various methods for effective learning have been proposed. With the help of humans, the learning object can learn more accurately and quickly to maximize the reward. However, the rewards calculated by the system and via human intervention (that make up the learning environment) differ and must be used accordingly. In this paper, we propose a framework for learning the problems of competitive network topologies, wherein the environment dynamically changes agent, by computing the rewards via the system and via human evaluation. The proposed method is adaptively updated with the rewards calculated via human evaluation, making it more stable and reducing the penalty incurred while learning. It also ensures learning accuracy, including rewards generated from complex network topology consisting of multiple agents. The proposed framework contributes to fast training process using multi-agent cooperation. By implementing these methods as software programs, this study performs numerical analysis to demonstrate the effectiveness of the adaptive evaluation framework applied to the competitive network problem depicting the dynamic environmental topology changes proposed herein. As per the numerical experiments, the greater is the human intervention, the better is the learning performance with the proposed framework.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据