Key points are not available for this paper at this time.
Abstract Decision-making often overlooks the feedback between agents and the environment. Reinforcement learning is widely employed through exploratory experimentation to address problems related to states, actions, rewards, decision-making in various contexts. This work considers a new perspective, where individuals continually update their policies based on interactions with the spatial environment, aiming to maximize cumulative rewards and learn the optimal strategy. Specifically, we utilize the Q-learning algorithm to study the emergence of cooperation in a spatial population playing the donation game. Each individual has a Q-table that guides their decision-making in the game. Interestingly, we find that cooperation emerges within this introspective learning framework, and a smaller learning rate and higher discount factor make cooperation more likely to occur. Through the analysis of Q-table evolution, we disclose the underlying mechanism for cooperation, which may provide some insights to the emergence of cooperation in the real-world systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jing Zhang
Zhihai Rong
Guozhong Zheng
Journal of Physics Complexity
Donghua University
Shaanxi Normal University
Ningxia University
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68e6ee11b6db643587668a82 — DOI: https://doi.org/10.1088/2632-072x/ad3f65