Evolutionary Game Theory
Basic Symmetric Model with Stochastic Strategies
This section is a summary of Chapter 29 of the book “Algorithmic Game Theory”1.
- Agents (organisms)
- The number of agents is infinite.
- Pure strategies (or action sets) are the same: $A.$
- Mixed strategies: $\Pi: \Delta(A).$
- In analysis, it is often assumed that there are only two mixed strategies.
- The smaller group is referred to as
mutants
, with a population of $\epsilon.$ - The larger group is known as
incumbents
, with a population of $1-\epsilon.$
- The smaller group is referred to as
- Env: Evolutionary game
- Uniformly sample 2 agents ($i$ and $j$) to play a symmetric
stage game
.- Each agent just plays its current mixed strategies, $\pi^i$ and $\pi^j.$
- Fitness function (or reward function), the organisms ability to reproduce: $F: \Delta(A)\times\Delta(A) \to \mathbb{R}.$
- Agent $i$ gets a reward of $F(\pi^i\mid\pi^j)$ and agent $j$ gets $F(\pi^j\mid\pi^i).$
- Dynamics: Replication or Imitation.
- Replication: “Each organism reproduces asexually, and spawns a number of offspring proportional to its fitness.”
- Imitation: “each agent initially plays some pure strategy. After interactions, if one agents sees the other agent earned a higher payoff, the agent with the lower payoff may adopt, or imitate, the strategy of the agent who earned the higher payoff.”
- Uniformly sample 2 agents ($i$ and $j$) to play a symmetric
Two Main Characteristics
- “The population is infinite.”
- “Players adopt a very simple, local dynamic, such as replication or imitation, for choosing and updating their strategies.”
The Goal
To find resilient strategies for a given environment.
Evolutionarily Stable Strategy (ESS)
Description
“An evolutionarily stable strategy (ESS) is a strategy such that if all the members of a population adopt it, then no mutant strategy could overrun the population.”
Expected Fitness
Incumbents and mutants are denoted as $i$ and $j,$ respectively.
- An incumbent $i$
- Meets an incumbemt: $(1-\epsilon)\cdot F(\pi^i\mid\pi^i)$
- Meets an mutant: $\epsilon \cdot F(\pi^i\mid\pi^j)$
- $\mathbb{E}_{\epsilon, \boldsymbol{\pi}}\left[f^i\right] = (1-\epsilon)\cdot F(\pi^i\mid\pi^i) + \epsilon \cdot F(\pi^i\mid\pi^j)$
- A mutant $j$
- Meets an incumbemt: $(1-\epsilon)\cdot F(\pi^j\mid\pi^i)$
- Meets an mutant: $\epsilon \cdot F(\pi^j\mid\pi^j)$
- $\mathbb{E}_{\epsilon, \boldsymbol{\pi}}\left[f^j\right] = (1-\epsilon)\cdot F(\pi^j\mid\pi^i) + \epsilon \cdot F(\pi^j\mid\pi^j)$
Definition
Definition 29.11 A strategy s is an evolutionarily stable strategy (ESS) for the 2-player, symmetric game given by fitness function F , if for every strategy $\pi^j\ne \pi^i,$ there exists an $\epsilon’$ such that for all $0<\epsilon<\epsilon’,$ $\mathbb{E}_{\epsilon, \boldsymbol{\pi}}\left[f^i\right] > \mathbb{E}_{\epsilon, \boldsymbol{\pi}}\left[f^j\right],$ i.e., $(1-\epsilon)\cdot F(\pi^i\mid\pi^i) + \epsilon \cdot F(\pi^i\mid\pi^j) > (1-\epsilon)\cdot F(\pi^j\mid\pi^i) + \epsilon \cdot F(\pi^j\mid\pi^j).$
Definition 29.1 holds iff. either of 2 conditions on $\pi^i$ is satisfied $\forall\pi^j\ne\pi^i$
- $F(\pi^i\mid\pi^i) > F(\pi^j\mid \pi^i),$ or
- $F(\pi^i\mid\pi^i) = F(\pi^j\mid\pi^i)$ and $F(\pi^i\mid\pi^j) > F(\pi^j\mid\pi^j).$
Because $\epsilon$ can be approaching to $0.$
Corollaries
For $\pi^i$ to be an ESS, it must be the case that $F(\pi^i\mid\pi^i) \ge F(\pi^j\mid \pi^i), \forall \pi^j.$
It means:
- $\pi^i$ must be a best response to itself.
- $\pi^i$ is an ESS $\to$ The strategy $(\pi^i, \pi^i)$ is a Nash equilibrium. (Equivalent to the previous one.)
Theorem 29.21(An alternative definition of ESS) A strategy s is an ESS for the 2-player, symmetric game given by fitness function F , iff. $(\pi^i,\pi^i)$ is a Nash equilibrium of $F,$ and for every best response $\pi^j$ to $\pi^i$ where $\pi^j\ne\pi^i,$ $F(\pi^i\mid\pi^j) > F(\pi^j\mid \pi^j).$
Where
- “Profile $(\pi^i,\pi^i)$ is a Nash equilibrium of $F$” means “$F(\pi^i\mid\pi^i) \ge F(\pi^j\mid \pi^i), \forall \pi^j.$”
- “For every best response $\pi^j$ to $\pi^i$” means “$F(\pi^i\mid\pi^i) = F(\pi^j\mid\pi^i).$”
In the long run, the fraction of mutants in the population will tend to $0$.
Memos
- “ESS are a refinement of Nash equilibria. ESS is more restrictive than Nash equilibrium.”
- “Not all 2-player, symmetric games have an ESS.”
- “A continuous time analysis of the replicator dynamics shows that every ESS is asymptotically stable.”
An Example: Hawks and Doves
The fitness matrix of the stage game is:
Player1\Player2 | Hawk | Dove |
---|---|---|
Hawk | $(V-C)/2,(V-C)/2$ | $V,0$ |
Dove | $0,V$ | $V/2,V/2$ |
Where $V$ means value, and $C$ means conflict.
The strategy profile $(D,D)$ is not a Nash equilibrium because one player can unilaterally deviate and play $H$ and increase its payoff from $V/2$ to $V.$
The strategy profile $(D,D)$ is not a Nash equilibrium $\to$ $D$ is not an ESS. (ESS are a refinement of Nash equilibria.)
Let $\pi$ denotes any policy with probability $0<p<1$ of playing $H.$
If $V > C$ then $H$ is an ESS: $F(\pi\mid H) = p\cdot (V-C) / 2 + (1-p)\cdot 0 < F(H\mid H) = (V-C)/2, \forall \pi\ne H.$
If $V\le C$ then $p=V/C$ is an ESS. The condition is, still, $F(p\mid p*) < F(p^*\mid p^*), \forall p\ne p^*.$
Replicator Dynamics
It requires the strategy set to be finite.
Replicator dynamics is a mathematical model that describes how the proportion of individuals using certain strategies changes over time within a population. This dynamics is particularly important for studying Evolutionarily Stable Strategies (ESS) and the dynamic stability in game theory.
Definition and Mathematical Formulation
Assume there is a set of strategies $\set{1, 2, \dots, n}$, where each strategy $i$ has a proportion $x_i$ in the population, with the constraint $\sum_{i=1}^n x_i = 1$. The fitness (or payoff) of each strategy $i$ is denoted as $f_i(x)$, which typically depends on the current distribution of all strategies $x = (x_1, x_2, \dots, x_n)$.
In replicator dynamics, the rate of change in the proportion of strategy $i$ over time is proportional to the deviation of its fitness from the average fitness. Specifically, the dynamics of strategy $i$ can be expressed as:
\[\dot{x}_i = x_i (f_i(x) - \bar{f}(x))\]where $\bar{f}(x) = \sum_{j=1}^n x_j f_j(x)$ represents the expected fitness of the population.
Intuitive Explanation
Intuitively, if a strategy has a fitness above the average, individuals adopting this strategy will reproduce at a faster rate, causing the proportion of this strategy in the population to increase. Conversely, strategies with a fitness below the average will decrease in proportion.
Applications
Replicator dynamics is applied across various fields including ecology, economics, and social sciences, helping researchers understand how individual strategies evolve over time in competitive and cooperative environments. In evolutionary game theory, it serves as a key tool for analyzing the stability of strategies and the dynamics of group behavior.
Other Dynamics
Neutral Drift
Neutral drift is a genetic concept applied within evolutionary game theory to describe random changes in strategy frequencies when there are no significant adaptive advantages or disadvantages among the choices.
If a strategy is adaptively equivalent to others (i.e., the differences in fitness between them are negligible), then their relative proportions may be determined solely by random factors.
This phenomenon can theoretically be described using neutral drift models, which do not rely on fitness differences between strategies but emphasize the role of randomness and chance in evolutionary processes.
Risk Dominance
In game theory, the concept of “risk dominance” is used to analyze how players choose strategies in games that have multiple equilibria. Risk dominance is based on the idea of minimizing the worst-case outcome—essentially minimizing potential losses—assuming opponents may choose different strategies.
Simply put, if in a game, strategy A is risk dominant over strategy B, it means that choosing strategy A ensures a higher minimum payoff compared to the minimum payoff of choosing strategy B, given the uncertainty of the opponent’s strategy. This is a conservative decision-making approach under uncertainty.
A strategy $s_i$ is said to risk dominate another strategy $s_i’$ and can be mathematically expressed as follows:
Consider a two-player game where each player has two strategy options, $s_i$ and $s_i’$. For player $i$, the condition for strategy $s_i$ to risk dominate strategy $s_i’$ is:
\[\min_{s_{-i} \in S_{-i}} u_i(s_i, s_{-i}) > \min_{s_{-i} \in S_{-i}} u_i(s_i', s_{-i}),\]where:
- $u_i(s_i, s_{-i})$ is the utility (or payoff) of player $i$ when they choose strategy $s_i$ and other players choose strategy $s_{-i}$.
- $S_{-i}$ represents the strategy set of all other players except player $i$.
This expression states that if player $i$ chooses strategy $s_i$, considering all possible strategies of the other players, the worst-case utility of $s_i$ is still higher than the worst-case utility of $s_i’$.
This criterion is a conservative strategy.