Use this skill for any strategic interaction where players move one at a time, each player can observe what the previous player did, and the game ends in a finite number of moves.
The core principle: Rule 1 — Look forward and reason backward. Anticipate where your initial decisions will ultimately lead and use that information to calculate your best current choice. This rule applies whether the game lasts two moves or two hundred.
The key insight backward reasoning delivers: a future action that "lies in the future does not mean it is uncertain." If you can deduce what a rational opponent will choose at a later node — because you know their preferences — you can treat that future action as a known fact today when computing your own best move now.
This skill applies when all of the following hold:
This skill does NOT apply to:
-> Ask: "Walk me through who acts and in what order."
-> Ask: "At each turn, what are the available actions?"
-> Ask: "At each possible ending, how do you rank the outcomes? Best to worst is enough if exact numbers are unavailable."
-> Ask: "Does each player see what the other did before choosing their own move?"
Why: Backward induction is the correct tool only for sequential games with observable moves. Applying it to simultaneous games produces wrong answers. Spending thirty seconds on classification prevents wasted analysis.
1a. Is it sequential or simultaneous?
1b. Is information perfect?
1c. Is it finite?
Why: The game tree is the visual logic of the game. It makes the sequence of decisions explicit and prevents the common error of optimizing the first branch without considering what happens downstream. A game tree and a decision tree differ in one critical way: at decision-tree nodes only one person chooses; at game-tree nodes, different players may choose at different nodes, each optimizing for their own preferences.
Structure of a game tree:
Construction procedure:
Critical rule: Even if you know that certain branches will never be reached (because a player will not rationally choose them), you must still resolve what each player would do at every conceivable node. The reason: a player at an earlier node makes choices based on what the other player would do if that branch were taken. Pruning a branch without resolving it first is a mistake.
Why: Forward reasoning — picking the branch that looks best at the first move — fails because it ignores what rational opponents will do in response. The correct procedure is the reverse: start at the end, where outcomes are fully known, and fold optimal decisions back toward the start. This converts every future branch into a known consequence, making the first move's true value calculable.
Backward induction procedure (step by step):
Worked illustration (Charlie Brown / Fredo Investment):
The same tree structure covers both cases:
Player A
├── Action 1 → Player B
│ ├── B's preferred action → Outcome: [A: bad, B: good]
│ └── B's other action → Outcome: [A: good, B: okay]
└── Action 2 → Outcome: [A: neutral, B: neutral]
Backward induction at B's node: B prefers "B's preferred action." Fold back: if A takes Action 1, the realized outcome is [A: bad, B: good]. Compare to Action 2: [A: neutral]. A prefers neutral to bad → A takes Action 2. Equilibrium: Action 2 without ever reaching B's node.
Common error to avoid: Do not prune B's node without resolving it. A must compute what B would do if reached — even if A ends up not taking that branch.
Why: For games where players alternately remove objects from a pile (or similar combinatorial structures), drawing the full tree is impractical for large games. The pattern in the terminal logic propagates backward into a closed-form formula that immediately identifies winning and losing positions.
The k+1 formula:
In a game where each player may take 1 to k objects per turn, and the player who takes the last object wins:
21-flags example (k=3, so k+1=4):
The "hot potato" variant (player who takes last loses):
The same formula applies but the parity flips: losing positions are still multiples of (k+1), but now the positions are interpreted in reverse — check whether facing a multiple means you are forced to take the last item.
Generalizing: Any time a combinatorial game has a cycle length derivable from the rules, apply backward induction to the smallest cases (1, 2, 3, ... objects) to identify the pattern, then project it forward.
Why: When a player needs multiple unlikely successes to reach a goal, and can attempt them in any order, the correct sequence is to attempt the harder or riskier action first while fallback options remain open. Attempting the easier action first and failing the harder one second results in a forced loss that the reversed sequence would have avoided.
The Orange Bowl principle (Dixit and Nalebuff):
Nebraska needed two touchdowns plus net extra points to win. The coach kicked the safe one-point conversion after the first touchdown. When the second touchdown was scored, he was forced to attempt the two-point conversion with no margin. The correct strategy: attempt the two-point conversion first. If it succeeds, the subsequent one-point kick covers the margin. If it fails, there is still a chance to tie via the one-point kick after the second touchdown. Attempting the risky action first keeps more options alive.
Generalizable rule: If you need both A (risky, ~50%) and B (safe, ~90%), and either order is feasible:
Attempt the riskier action first. This applies to: new product launches before marketing commitments are locked in, career pivots before exhausting current options, negotiation concessions where the harder ask should come before the easier one is spent.
Why: Backward induction gives the correct answer only given its assumptions. Violating those assumptions without acknowledging it produces overconfident, wrong predictions. Identifying limits is part of delivering a sound analysis.
Three conditions that limit backward induction:
Paradox of expanded options: More choices or greater freedom can hurt a player in a sequential game. Adding options to one player (like a presidential line-item veto) changes what the other player anticipates and may cause the other player to respond in a way that leaves the first player worse off. Backward induction reveals these paradoxes; intuition does not.
Structure your output as:
Optimal opening move: [Specific action for the first player, stated plainly]
Full contingent strategy: [Complete if-then plan: "If opponent does X, do Y; if opponent does Z, do W"]
Predicted equilibrium outcome: [The terminal state that backward induction leads to, and each player's payoff there]
Key reasoning: [The one or two backward induction steps that most determine the outcome — the nodes where the game is effectively decided]
Applicability notes: [Any assumptions made about opponent preferences or probability estimates, and how violations would change the answer]
Rule 1 is the foundation. Look forward and reason backward. All other steps implement this rule.
Future actions are predictable, not uncertain. In a sequential game, a rational opponent's future choice follows from their preferences — it is deducible, not random. Treating predictable future actions as "uncertain" is hope over analysis.
Resolve every node, even unreachable ones. A player at node A chooses based on what would happen at node B if A chose to go there. If node B is unresolved, node A cannot be correctly decided.
Game trees and decision trees differ in kind. A decision tree has one optimizer throughout. A game tree has multiple optimizers, each at their own nodes, each optimizing for themselves. Do not collapse a game tree into a single-optimizer problem.
More options can hurt. In sequential games, gaining additional choices can signal different intentions to opponents and cause their responses to shift unfavorably. Backward reasoning reveals this paradox; forward intuition misses it.
Attempt risky actions while fallback options remain. Risk sequencing is a corollary of backward induction: work backward from what you need to achieve, and structure the order of attempts so that failure of the hardest step leaves the most options open.
The first-mover advantage is derivable, not assumed. Whether moving first helps or hurts depends entirely on the structure of the game tree — backward induction tells you which.
Setup: 21 flags; players alternate; each turn take 1, 2, or 3; player who takes the last flag(s) wins.
Apply k+1 formula (k=3, so k+1=4):
Full contingent strategy for first player: Take 1. Whatever opponent takes (n), take 4-n. After your turn the count is always a multiple of 4. Opponent is trapped.
Why Sook Jai lost: They took 2 on the opening move, leaving 19 — not a multiple of 4. This handed the initiative to Chuay Gahn. The tribe needed to reason all the way back to the opening move; recognizing the 4-flag trap mid-game is too late.
Setup: Charlie considers investing $100K with Fredo. If Charlie invests, Fredo can honor the contract (Charlie nets $150K, Fredo nets $250K) or abscond (Charlie loses $100K, Fredo keeps $500K). If Charlie does not invest, both get $0.
Game tree:
Charlie
├── Invest → Fredo
│ ├── Abscond → [Charlie: -$100K, Fredo: +$500K]
│ └── Honor → [Charlie: +$150K, Fredo: +$250K]
└── Don't → [Charlie: $0, Fredo: $0]
Backward induction at Fredo's node: Fredo prefers $500K over $250K → will Abscond.
Fold back: If Charlie invests, realized outcome is [Charlie: -$100K]. Compare to Don't: [Charlie: $0]. Charlie prefers $0 → Don't invest.
Equilibrium: No investment; both get $0. The mutually beneficial outcome ($150K/$250K) is blocked by the inability to commit credibly.
Caveat: This changes if the game is repeated or if Fredo has other US-dependent business interests — these create an ongoing game with reputation effects that can sustain cooperation.
Setup: Nebraska needs 2 extra points net across two touchdowns. Option A: two-point conversion attempt (~50% success). Option B: one-point kick (~95% success). Osborne's order: B then A. Alternative: A then B.
Backward induction on Osborne's order (B first):
Backward induction on alternative order (A first):
Key insight: The only scenario where order matters is when exactly one attempt fails. If A fails first, the fallback (B on the second touchdown) still yields a tie. If B fails first (Osborne's plan), the second touchdown forces a must-make two-pointer with no margin. Attempt the risky action first.
references/game-tree-construction.md — Detailed node-labeling conventions, tree notation for complex games, handling chance nodesreferences/combinatorial-game-formulas.md — k+1 formula derivations, hot-potato variants, multi-pile Nim, worked tables for games up to 100 objectsreferences/applicability-checklist.md — Diagnostic questions for classifying a game (sequential vs. simultaneous, perfect vs. imperfect information, finite vs. infinite)references/risk-sequencing-patterns.md — Generalization of the Orange Bowl principle to business, negotiation, and career scenarios; worked examples with probability treesThis skill is licensed under CC-BY-SA-4.0.
Source: BookForge — The Art of Strategy by Avinash K. Dixit, Barry J. Nalebuff.
This skill is standalone. Browse more BookForge skills: bookforge-skills
共 1 个版本
暂无安全检测报告