[an error occurred while processing this directive]

Journal of Systems Engineering and Electronics ›› 2023, Vol. 34 ›› Issue (2): 270-288.doi: 10.23919/JSEE.2023.000012

• • 上一篇    

  

  • 收稿日期:2021-03-12 出版日期:2023-04-18 发布日期:2023-04-18

Recognition and interfere deceptive behavior based on inverse reinforcement learning and game theory

Yunxiu ZENG(), Kai XU()   

  1. 1 College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
  • Received:2021-03-12 Online:2023-04-18 Published:2023-04-18
  • Contact: Yunxiu ZENG E-mail:yuunxiuzeng@hotmail.com;xukai09@nudt.edu.cn
  • About author:
    ZENG Yunxiu was born in 1994. She received her M.S. degree from National University of Defense Technology (NUDT), Changsha, China, in 2018. She is currently a lecturer with NUDT. Her research interests include human behavior modeling and goal recognition. E-mail: yuunxiuzeng@hotmail.com

    XU Kai was born in 1990. He received his M.S. and Ph.D. degrees from National University of Defense Technology (NUDT), Changsha, China, in 2016 and 2020 respectively. He is currently a lecturer with NUDT. His research interests include human behavior modeling and goal recognition. E-mail: xukai09@nudt.edu.cn

Abstract:

In real-time strategy (RTS) games, the ability of recognizing other players’ goals is important for creating artifical intelligence (AI) players. However, most current goal recognition methods do not take the player ’s deceptive behavior into account which often occurs in RTS game scenarios, resulting in poor recognition results. In order to solve this problem, this paper proposes goal recognition for deceptive agent, which is an extended goal recognition method applying the deductive reason method (from general to special) to model the deceptive agent’s behavioral strategy. First of all, the general deceptive behavior model is proposed to abstract features of deception, and then these features are applied to construct a behavior strategy that best matches the deceiver’s historical behavior data by the inverse reinforcement learning (IRL) method. Final, to interfere with the deceptive behavior implementation, we construct a game model to describe the confrontation scenario and the most effective interference measures.

Key words: deceptive path planning, inverse reinforcement learning (IRL), game theory, goal recognition