Evolving opponent models for Texas Hold 'Em

Opponent models allow software agents to assess a multi-agent environment more accurately and therefore improve the agent's performance. This paper makes use of coarse approximations to game-theoretic player representations to improve the performance of software players in Limit Texas Hold 'Em poker. A 10-parameter model, intended to model a combination, or mixture, of various strategies is developed to represent the opponent. A dasiamixture identifierpsila is then evolved using the NEAT neuroevolution method to estimate values of these parameters for arbitrary opponents. To evaluate this approach, two poker players, represented as neural networks, were evolved under the same conditions, one with the mixture identifier, and one without. The player trained with access to the identifier achieved consistently higher and more stable fitness during evolution compared with the player without the identifier. Further, the player with the identifier outplays the other in a heads-up match after training, winning on average 60% of the money at the table. These results demonstrate that opponent modeling is effective even with low-dimensional models and conveys an advantage to players trained to use these models.