4.7 Article

Faster Learning and Adaptation in Security Games by Exploiting Information Asymmetry

期刊

IEEE TRANSACTIONS ON SIGNAL PROCESSING
卷 64, 期 13, 页码 3429-3443

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSP.2016.2548987

关键词

Cloud computing; cognitive radio; energy harvesting; jamming; reinforcement learning; security; stochastic game

资金

  1. National Science Foundation [CNS-1016260, ECCS-1307949, EARS-1444009]
  2. Div Of Electrical, Commun & Cyber Sys
  3. Directorate For Engineering [1444009] Funding Source: National Science Foundation
  4. Div Of Electrical, Commun & Cyber Sys
  5. Directorate For Engineering [1307949] Funding Source: National Science Foundation

向作者/读者索取更多资源

With the advancement of modern technologies, the security battle between a legitimate system (LS) and an adversary is becoming increasingly sophisticated, involving complex interactions in unknown dynamic environments. Stochastic game (SG), together with multi-agent reinforcement learning (MARL), offers a systematic framework for the study of information warfare in current and emerging cyber-physical systems. In practical security games, each player usually has only incomplete information about the opponent, which induces information asymmetry. This paper exploits information asymmetry from a new angle, considering how to exploit information unknown to the opponent to the player's advantage. Two new MARL algorithms, termed minimax post-decision state (minimax-PDS) and Win-or-Learn Fast post-decision state (WoLF-PDS), are proposed, which enable the LS to learn and adapt faster in dynamic environments by exploiting its information advantage. The proposed algorithms are provably convergent and rational, respectively. Also, numerical results are presented to show their effectiveness through three important applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据