A single interaction is often nonbeneficial,so repeated interaction strategy is given.
重复交互可以使Agent彼此共享信息,通过惩罚来达到系统的平衡,惩罚是通过忽视被惩罚的Agent的询问来实现的。