Game theory: Fundamental Principle of Game Theory and Reduction by Dominance/Principio fundamental de la teoria de juegos y reducir por predominio

Recall the following from Parts A and B of this tutorial:

For every (pure or mixed) strategy one player uses, the other player has a best pure counterstrategy.
A player using the minimax criterion will choose an optimal strategy that minimizes the negative effect of the other player's best pure counterstrategy.
To solve the game (using the minimax criterion criterion) means to find each player's optimal strategy using the minimax criterion.

At this point we only know how to solve a game if it has a saddle point or happens to reduce to a 1×1 game by dominance. Any two-person zero-sum game can be solved using linear programming as described in Section 4.5 of or . In this tutorial we discuss a graphical method to solve a 2×2 game or any game that reduces to a 2×2 one.

To best illustrate how this procedure works, we look at a specific example: Recuerda el siguiente de Partes A y B de este tutorial:

Para cada estrategia (pura o mixta) que usa un jugador, su contrincante tiene un mejor contraestrategia pura.
Un jugador quien usa el criterio minimax escogerá una estrategia óptima que minimiza el efecto negativo del mejor contraestrategia pura de su contrincante.
Solucionar el juego (a través del criterio minimax) significa hallar la estrategia óptima de ambos jugadores por usar el criterio minimax.

En este momento solo podemos solucionar un juego si tiene un punto de silla o, por causalidad, reduce por dominio a un juego 1×1. Se puede solucionar cualquier juego bipersonal de suma cero por usar la programación lineal, descrito en la sección 4.5 de y . En este tutorial estudiamos un método grafical para solucionar cualquier juego 2×2 o cualquier juego que reduce por dominio a un juego 2×2.

Ilustramos este método por un ejemplo específico:

Notice that this game cannot reduce to a smaller game by dominance. Observa que este juego no puede ser reducido por dominio a un juego más puqeuño.

Finding the row player's optimal strategy Hallar la estrategia óptima para el jugador renglón

The row player's optimal strategy is a mixed strategy that minimizes the worst damage that can be caused by a pure strategy used by the column player. Right now we don't know what the optimal row strategy is, so let us call it La estrategia óptima del jugador renglón es aquella estrategia que minimiza el daño peor que puede resultar por una estrategia pura por parte del jugador columna. En este momento no sabemos que será la estrategia óptima. Por lo tanto, la llamemos

We used $1-x$ for the second entry because the entries have to add up to 1. Hemos usado $1-x$ por la segunda entrada porque las entradas deben sumar 1.

Our job is to find the value or values of $x$ that minimize the worst damage the column player can inflict using a pure strategy. Now there are only two possible pure strategies the column player can use: Nuestra tarea es hallar el valor (o valores) de $x$ que minimiza el daño peor que resulta por una estrategia pura por parte del jugador columna. Ahora, hay solo dos estrategias puras disponibles al jugador columna:

	$C = \mat[6]{\[ 1 \]! 0 }$	Always choose the first move. Siempre elegir la primera acción.
or o
	$C = \mat[6]{\[ 0 \]! 1 }$	Always choose the second move. Siempre elegir la segunda acción.

Case 1. Caso 1. $C = \mat[6]{\[ 1 \]! 0 }$
To measure the effect of this pure strategy, we calculate the expected payoff that results: Medimos el efecto de esta estrategia pura por calcular el pago esperado que resulta:

Of course, this answer depends on $x$. Por supuesto, este resultado depende de $x$.

Case 2. Caso 2. $C = \mat[6]{\[ 0 \]! 1 }$
Again, we calculate the expected payoff that results from this strategy: Calculamos de nuevo el pago esperado que resulta de esta estrategia:

Here are the graphs corresponding to the two cases above: Aquí están las gráficas que coresponde a los dos casos más arriba:

$e$

This graph and the next will appear only when you have entered both answers above. Esta gráfica y la proxima aparecereán solo cuando has ingresado las dos respuestas más arriba.

As we see (when the graphs appear), for each value of $x$ there are (usually) two values of $e$ corresponding to the two pure strategies the column player can use; one lower (worse for the row player) than the other.

Now the fundamental principle of game theory tells us that, for every value of $x$ contemplated by the row player (for instance $x = 0.8$) the column player can be expected to choose the column strategy that does the most harm to the row player (in this instance, the Case strategy). Thus the column player can be expected to choose the strategy that leads to the lower of the two $e$-values. The same is true for every choice of $x$. Below we see the graph again with these choices shown in red.: Como vimos (cuando aparecen las gráficas), para cada valor de $x$ hay (usualmente) dos valores de $e$ que coresponde a los dos estrategias puras del jugador columna; una más baja (peor para el jugador renglón) que la otra.

Ahora, el principio fundamental de la teoría de juegos nos dice que, para cada valor de $x$ contemplado por el jugador renglón (por ejemplo $x = 0.8$) se espera que el jugador columna escogerá la estrategia columna que hace el daño peor al jugador renglón (cuando $x = 0.8$, la estrategia de Caso ).Por lo tanto, se espera que el jugador columna escoja la estrategia que resulta en el valor más bajo de $e$. Lo mismo se aplica a cada elección de $x$. Más bajo vimos de nuevo la gráfica con estas elecciones mostradas en rojo:

$e$

For each choice of $x$, the red part of the graph gives the corresponding worse outcome for the row player. Para cada elección de $x$, la parte rojo pesada de la gráfica de el resultado peor para el jugador renglón.

&7 In view of this graph, which $x$ should the row player use? En vista de esta gráfica, ¿cuál valor de $x$ debe escoger el jugador renglón?
&8 The row player is seeking to maximize the expected payoff assuming the worst; that is, assuming the expected payoff is given by the red portion of the graph. Thus, all we need to do is determine the value of $x$ corresponding to the highest point of the red portion of the graph. In this case, it occurs at the point of intersection of the two lines. El jugador renglón intenta maximizar el pago esperado mientras suponiendo el peor; es decir, suponiendo que el pago esperado se da por la porción roja de la gráfica. Por lo tanto, todo que falta hacer es determinar el valor de $x$ que corresponde al punto más alto de la porción roja de la gráfica. En este caso, ocurre al punto de intersección le las dos rectas.

The value of $x$ and associated expected payoff $e$ are given by: El valor de $x$ y el pago esperado $e$ que resulta son:

Since we now know $x$, we also know the row player's optimal mixed strategy: Ya que sabemos el valor de $x$, también sabemos la estrategia mixta óptima para el jugador renglón:

$R = \mat[6]{\[x , 1-x\]} = $

Further, if the row player uses that strategy, the expected value of the game will be Además, si el jugador renglón usa aquella estrategia, el valor esperado del juego será $e = $ ??.

Finding the column player's optimal strategy Hallar la estrategia óptima para el jugador columna

We have solved the game for the row player, and we now need to solve the game for the column player. We will follow the same procedure as above. Here again is the payoff matrix: Hemos solucionado el juego para el jugador renglón, y ahora queda solucionar el juego para el jugador columna. Seguiremos el mismo procedimiento que más arriba. Aquí de nuevo está la matriz de pagos:

Let us take the optimal strategy of the column player to be Tomemos, para la estrategia óptima del jugador columna,

$C = \mat[6]{\[x \]!1-x}$ The row player's best pure counterstrategy could be either $R = \mat[6]{\[1 , 0\]}$ or $R = \mat[6]{\[0 , 1\]}$. El jugador renglón tiene dos opciones para us mejor contraestrategia: $R = \mat[6]{\[1 , 0\]}$ o $R = \mat[6]{\[0 , 1\]}$.

Case 1. Caso 1. $R = \mat[6]{\[1 , 0\]}$
As before, we calculate the expected payoff that results: Como más arriba, calculamos el pago esperado que resulta:

Case 2. Caso 2. $R = \mat[6]{\[0 , 1\]}$
In this case, the expected payoff is: En este caso, el pago esperado es:

Here are the graphs corresponding to the two cases above: Aquí están las gráficas que coresponde a los dos casos más arriba:

$e$

This graph and the next will appear only when you have entered both answers above. Esta gráfica y la proxima aparecereán solo cuando has ingresado las dos respuestas más arriba.

This time, for each value of $x$ there are (usually) two values of $e$ corresponding to the two pure strategies the row player can use. Now, the higher one is worse for the column player than the other.

Thus, by the fundamental principle of game theory, for every value of $x$ contemplated by the column player (for instance $x = 0.8$) the row player can be expected to choose the row strategy that does the most harm to the column player (in this instance, the Case strategy). Thus the row player can be expected to choose the strategy that leads to the higher of the two $e$-values. The same is true for every choice of $x$. Below we see the graph again with these choices shown in red.: Esta vez, para cada valor de $x$ hay (usualmente) dos valores de $e$ que coresponde a los dos estrategias puras del jugador columna. Ahora, el valor mayor es peor para el jugador columna que la otra.

Por lo tanto, el principio fundamental de la teoría de juegos nos dice que, para cada valor de $x$ contemplado por el jugador columna (por ejemplo $x = 0.8$) se espera que el jugador renglón escogerá la estrategia columna que hace el daño peor al jugador columna (cuando $x = 0.8$, la estrategia de Caso ).Por lo tanto, se espera que el jugador row escoja la estrategia que resulta en el valor más alto de $e$. Lo mismo se aplica a cada elección de $x$. Más bajo vimos de nuevo la gráfica con estas elecciones mostradas en rojo:

$e$

For each choice of $x$, the red part of the graph gives the corresponding worse outcome for the column player. Para cada elección de $x$, la parte rojo pesada de la gráfica de el resultado peor para el jugador columna.

The column player wants to minimize the expected payoff assuming the worst&emdash;that the expected payoff is given by the red portion of the graph. The value of $x$ corresponding to the lowest point of the red portion of the graph occurs (again) at the point of intersection of the two lines. El jugador columna intenta minimzar el pago esperado mientras suponiendo el peor&emdash;que el pago esperado se da por la porción roja de la gráfica. El valor de $x$ que corresponde al punto más bajo de la porción roja de la gráfica ocurre (otra vez) al punto de intersección le las dos rectas.

The value of $x$ and associated expected payoff $e$ are given by: El valor de $x$ y el pago esperado $e$ que resulta son:

Since we now know $x$, we also know the column player's optimal mixed strategy: Ya que sabemos el valor de $x$, también sabemos la estrategia mixta óptima para el jugador columna:

$C = \mat[6]{\[ x \]! 1-x } = $

Further, if the column player uses that strategy, the expected value of the game will be Además, si el jugador columna usa aquella estrategia, el valor esperado del juego será $e = $ ??.

Notice that the expected payoff is the same if either player uses their optimal mixed strategy. You can also check that, if both players use their optimal mixed strategies, then the expected value of the game is still the same: Observa que es pago esperado es lo mismo si cualquiera de los dos jugadores usa su estrategia mixta óptima. Puedes comprobar taqmbién que, si ambos jugadores usan sus estrategias mixtas óptimas, entonces será lo mismo el valor esperado del juego:

...

This value is the value of the of the game. We have in fact solved the game: Each player's optimal strategy mixed is the one shown above. Este valor es el valor esperado del juego. De hecho, hemos solucionado el juego: El estrategia óptima de cada jugador es la estrategia mixta mostrada arriba.

Now try some of the exercises in Section 3.4 of or , or try the topic true false quiz. Ahora prueba algunos ejercicios en la sección 3.4 de o , o bien el concurso verdadero falso de este tema.

var i = 0; if (theLanguage == "es") i = 1; var theTitle = parent.theTut.titles[i]; document.writeln(theTitle); if (parent.playingGame) document.writeln('<i><font color = #C93344>' + gameVersion + '</font></i>')

writeText("Goodies","Cosas buenas")

Finding the row player's optimal strategy Hallar la estrategia óptima para el jugador renglón

Finding the column player's optimal strategy Hallar la estrategia óptima para el jugador columna