Heterogeneously Perceived Incentives in Dynamic Environments:
Rationalization, Robustness and Unique Selections††thanks: August 12, 2025. Contact: evan.piermont@rhul.ac.uk (Piermont), p.zuazogarin@hse.ru (Zuazo-Garin)
Abstract
In dynamic settings each economic agent’s choices can be revealing of her private information. This elicitation via the rationalization of observable behavior depends each agent’s perception of which payoff-relevant contingencies other agents persistently deem as impossible. We formalize the potential heterogeneity of these perceptions as disagreements at higher-orders about the set of payoff states of a dynamic game. We find that apparently negligible disagreements greatly affect how agents interpret information and assess the optimality of subsequent behavior: When knowledge of the state space is only ‘almost common’, strategic uncertainty may be greater when choices are rationalized than when they are not— forward and backward induction predictions, respectively, and while backward induction predictions are robust to small disagreements about the state space, forward induction predictions are not. We also prove that forward induction predictions always admit unique selections à la Weinstein and Yildiz (2007) (also for spaces not satisfying richness) and backward induction predictions do not.
Keywords: Dynamic Games, Incomplete Information, Rationality, Robustness, Model Misspecification, Forward and Backward Induction, Common Knowledge, Unique Selections
JEL Classification: C72, D82, D83
Introduction
On October 26th, 1597, 13 vessels commanded by Joseon Kingdom admiral Yi Sun-sin stood in front of a Japanese fleet of over 120 warships in the Myeongnyang Strait, on the southwestern coast of the current Republic of Korea. Aware of the disparity of forces, Yi planned the location of the battle carefully and Myeongnyang was chosen due to its narrowness, which would preclude his small fleet to be vastly outnumbered once attacked, and its unique tidal conditions, with currents that strongly flow first to the North and abruptly switch southward in three-hour intervals. With both fleets at Myeongnyang, the favorable current and their formidable numerical superiority led the Japanese to attack. Yi’s army stood in the Northern end of the strait, where the northward flow was already calmer. Together with the narrowness of the strait, this enabled the Joseon fleet to repeal the attack until the direction of the currents switched. The unexpected reversion left the Japanese warships unable to maneuver, drifting backward and colliding among themselves, and the initial violent impulse of the new current in the Northern end helped the Joseon vessels strike their shaky enemy and sink 30 warships, eventually leading to the retreat of the Japanese fleet and the collapse of their campaign.
The tidal configuration of the strait, that is, the state (of, literally, nature), was a key determinant of the outcome of the conflict. Joseon’s commanders were aware of it, Japan’s were not; Joseon’s army based their strategic decisions on their knowledge of the state, Japan’s army based their strategic decisions persistently ignoring the state: They did not conceive the possibility of such an disadvantageous contingency, not even after observing that their enemy, who they knew to be familiar with the terrain and led by a commander of extraordinary skill, decided to fight in so extremely unbalanced conditions. The history relies thus on two elements every economist is familiar with. On the one hand, (nonstrategic) contingencies surrounding the interaction can affect outcomes, making economic agents eager to anticipate them; on the other, agents can update their initial assessments about these contingencies by eliciting others’ private information via the rationalization of observed behavior. For example, the gains from bargaining depend on the valuation parties assign to each agreement, and the initial sequence of offers is often aimed at outguessing the valuation of the other party and hiding one’s own valuation; the optimality of the price a financial trader offers depends on the risk-costs that govern other traders’ supply and demand distributions, and the trader can make inferences about these costs by making sense of others’ trading patterns.111See Kennan (2001) and Sannikov and Skrzypack (2016), respectively. Like at Myeongnyang, the success of rationalization relies on correctly assessing the possible contingencies others have in mind. The possible heterogeneity is incommensurable, but standard economic modeling systematically assumes that, despite different initial probabilistic assessments of each contingency, agents commonly agree on the set of all the possible contingencies, so that any state, even if initially ignored, can be considered upon observed behavior.
This paper introduces a general, tractable theory of persistent disagreements about the possible payoff-relevant, nonstrategic contingencies involved in a dynamic interaction. In it, we show that these disagreements, no matter how apparently negligible, can have a drastic impact on the way information is inferred from observed behavior and, in consequence, can revert our insights about some key features of standard economic predictions. Our findings cast doubts on the methodological validity of equilibrium refinements based on forward induction arguments (i.e., the idea that observed behavior is informative of private information and future plans): Arbitrarily small discrepancies regarding the set of possible contingencies can lead to an erratic interpretation of behavior that results in ‘anything goes’. Predictions based on forward induction are extremely sensitive to modeling details, to the extent that arbitrarily small misspecifications about what agents agree on can lead to unexpected predictions ignored in the benchmark model. This lack of robustness vanishes if, instead, predictions are based on backward induction; that is, if agents are assumed to render observed erratic behavior as a mistake and not as a product of unexpected private information and a signal of future intentions. Finally, we present an contagion argument à la global games that allows for creating unique selections for every forward induction outcome.
More specifically, our first contribution is methodological and consists in formalizing the idea that the set of payoff states of a dynamic game might not be commonly known. This is attained by endowing each player with a subjective payoff structure, which is a hierarchical construction whose first component is the set of states considered by the player, whose second component is the set of states the player understands her opponents to consider, and so on.
Unlike initial belief-hierarchies about the set of states, these subjective payoff structures are assumed to remain invariant in the player’s mind as the game unfolds, and place limits to how the player’s initial belief-hierarchy can be updated in response to observations throughout the game. We refer to a pair consisting of a subjective payoff structure and an initial belief-hierarchy as a subjective model, and to the profiles of subjective models in which players share common knowledge of a set of states, as a standard model.222Thus, in a sense, we reinterpret the set of states as a persistent belief instead of an objective, material object. This way, a model captures the dichotomy between the beliefs that can be updated, the initial ones, and the ones that will never be abandoned. In this respect, our definition of a subjective payoff structure goes hand by hand with the idea of -restrictions à la Battigalli and Siniscalchi (2003), with the novelty that these restrictions may be not commonly known. In this sense, and in line with the ‘Wilson doctrine’, our approach consists in tearing apart common knowledge assumptions and increasing the level of subjectivity in the formalization of a game:333As recalled by Chung and Ely (2007), Wilson (1987) writes: “Game theory has a great advantage in explicitly analyzing the consequences of trading rules that presumably are really common knowledge; it is deficient to the extent it assumes other features to be common knowledge, such as one agent’s probability assessment about another’s preferences or information. […] I foresee the progress of game theory as depending on successive reduction in the base of common knowledge required to conduct useful analyses of practical problems. Only by repeated weakening of common knowledge assumptions will the theory approximate reality.” While we maintain common knowledge of the extensive form of the game (i.e., the rules: Players, turns, available actions, information feedback, etc.), the payoff structure (i.e, the information regarding preferences) is completely subjective, possibly representing delusion, and envisioned as a collection of element of a universal construction which is trivially commonly known, namely, the set of all possible utility functions. We would like to be emphatic about how natural this separation of rules and stakes is: While the possible outcomes of an economic interaction can be objectively described, there is nothing intrinsically objective in the notion of utility; the allocation of resources that results from each combination of reported preferences can be commonly known, the market shares that correspond to each different combination of pricing strategies can be commonly known, etc. But each individual agent’s preference over material outcomes is a personal trait of hers. In consequence, it seems reasonable to remove the set of states, or possible utilities, from the objective (in the sense of ‘commonly believed’) part of the game.444Of course, the subjectivization of payoff-functions is standard practice since Harsanyi (1967–1968); the formalization in this paper goes beyond that: The state space being subjectively perceived implies that the ways in which beliefs can be updated is subjective as well. By doing so, our approach allows for great heterogeneity regarding players’ interactive perception, to the extent that the standard case of a commonly known set of states turns out to be a knife-edge situation, and important established facts and understandings concluded from this kind of modeling, extremely fragile.
The first of such understandings is the insight that the outcomes of forward induction reasoning, usually formalized by extensive form rationalizability (Pearce, 1984, Battigalli, 1997), refine those of backward induction. This is known to be the case for games with complete information, both for the case of perfect information (Battigalli, 1997, Heifetz and Perea, 2015 and Perea, 2018a) and imperfect information (Chen and Micali, 2013, Perea, 2018c and Catonini, 2020). Whether an analogous result holds for the case of incomplete information remains an open question (backward induction being formalized here as an interim version of Penta’s (2011; 2015) backward rationalizability); however, our counterexamples establish that the answer is negative when the set of payoff-states of the dynamic game is not commonly known, no matter how small players’ discrepancies are.555While we did not attempt to prove this generalization, we conjecture that the refinement result should hold for dynamic games with commonly known sets of payoff-states. In our counterexamples for the case of not commonly known state spaces, it is precisely the lack of common knowledge what seems the key driving force. The key idea that triggers the observation is that when the state space is not commonly known, it can sometimes be possible for a player to choose rationally an action the other player cannot rationalize (the action may be undominated given the state space the first player considers, but not in the state space the first player thinks the second player considers). Thus, the second player may entertain any arbitrary conjecture about the future behavior of the first one, and this is something the first player can rationally exploit. The observation has some intrinsic interest. While the literature on strategic communication teaches that agents can have incentives to act is if they had less information than they do,666Dye (1985), for instance. in standard game-theoretic models it cannot be that a player rationally pretends not to be rational: If she did so, her opponents would be able to find ways to rationalize her behavior. Subjective payoff structures circumvent this impossibility in a natural manner.777The strategy of playing dumb is a commonplace in popular culture; as Suetonius (121) writes about Emperor Claudius: “He did not even keep quiet about his own stupidity, but in certain brief speeches he declared that he had purposely feigned it under [Caligula], because otherwise he could not have escaped alive and attained his present station. But he convinced no one, and within a short time a book was published, the title of which was ‘The Elevation of Fools and its thesis, that no one feigned folly.” The italics are ours.
The second understanding that collapses is the robustness of rationalizability to small misspecifications of players’ perception of payoff structures. Dekel, Fudenberg and Morris (2007) show that in static settings the predictions of interim (correlated) rationalizability are robust to small misspecifications of players’ higher-order beliefs about the payoff-states: Predictions excluded in the benchmark model do not to arise in slightly misspecified models (i.e., the solution concept is upper hemicontinuous on players’ belief-hierarchies). We partially extend this insight by proving in Proposition 2 that the correspondence that describes extensive form rationalizability is upper hemicontinuous on initial belief-hierarchies.888While we take for granted that this result is ‘folk knowledge’, to the best of our knowledge this is the first available proof of the result. However, the conclusion that extensive form rationalizability is robust in a general sense would be erroneous. Our examples show that the solution concept is not robust to misspecifications of models. It is possible to perturb a dynamic game with a commonly known set of payoff-states and unique extensive form rationalizable outcome so that, for every arbitrarily close by model, extensive form rationalizable outcomes are multiple. These observations shed important light on two aspects of the role of the state space in rationalization:
-
•
First, Penta (2012) and Chen (2012) show that in dynamic games with sufficiently rich set of payoff-states (i.e., state spaces in which for every strategy of every player there exists some payoff-state for which the strategy is strictly dominant), no refinement of interim rationalizability is robust to perturbations of initial beliefs.999Penta’s (2012) result is slightly stronger. By allowing for payoff-states about which players may have persistent private information, his result pertains not plain vanilla interim rationalizability, but interim sequential rationalizability, a solution concept that refines interim rationalizability by employing sequential rationality as a notion of optimal behavior, instead of plain rationality. For expositional purposes, throughout the paper we refer solely to ‘interim rationalizability’ and bundle Penta’s (2012) and Chen (2012) work together: The moral of both papers is that no refinement of the predictions given by solution concepts characterizing the behavioral implications of rationality and (only) initial common belief in rationality is robust. Our Proposition 2 shows that extensive form rationalizability is robust in this sense. Since extensive form rationalizability refines interim rationalizability, these results together imply that in games where the state space is rich, interim rationalizability and extensive form rationalizability coincide (Corollary 1). The intuition is obvious. If for every strategy there is some state that makes that strategy dominant then every observed behavior can be rationalized; thus, there is nothing rationalization helps to falsify. In consequence, while richness assumptions are innocuous in static games (specifying an interim belief makes every state ignored by the belief strategically irrelevant), our results highlight their problematic nature in dynamic environments.
-
•
Second, the failure of upper hemicontinuity of extensive form rationalizability on subjective payoff structures reveals the high sensitivity that rationalization processes display w.r.t common knowledge assumptions. On the one hand, as soon as the knife-edge supposition that players commonly agree about the state space is abandoned, the rationalization a player makes might be erroneous, or a player may fail to rationalize some observed behavior which was indeed rational. In consequence, despite the conceptual appeal of forward induction, the conditions for it to be applied successfully seem to crucially rely on knife-edge assumptions that are tremendously demanding, and hard to meet in practice.
We end our analysis of robustness to misspecifications of models by noting that there is a standard solution concept which is robust. Proposition 3 shows that the individual strategies given by backward induction, formalized here as an interim version of Penta’s (2011; 2015) backward rationalizability, are upper hemicontinuous in the whole space of subjective models. The key difference with extensive form rationalizability lies in the fact that, in the latter, the restrictions on beliefs are placed only on the histories that are reachable given the subjective payoff structure. Since this set of histories is not, in general, lower hemicontinuous on the subjective payoff structure, upper hemicontinuity of extensive form rationalizability collapses (that restrictions are satisfied along a sequence cannot guarantee that they are also satisfied in the limit). On the contrary, backward induction reasoning places restrictions on beliefs at every history, and thus, in a way, independent of the subjective payoff structure. We end by noting that, since backward rationalizability also refines interim rationalizability, in dynamic games satisfying richness backward rationalizability, extensive form rationalizability and interim rationalizability all coincide (Corollary 2), what further reinforces our critique of richness assumptions in dynamic games.
Finally, we present a partial structure theorem for extensive form rationalizability in the line of the seminal result by Weinstein and Yildiz (2007). Theorem 1 shows that for every standard model, every outcome consistent with extensive form rationalizability is the unique one also consistent with extensive form rationalizability for a sequence of profiles of perturbed subjective models. Notably, the theorem does not require the state space of the benchmark standard model to satisfy richness: Even if it does not, it can be approximated by profiles of subjective payoff structures that do satisfy richness at increasingly higher-orders (a player’s space may not be rich, but she may think that her opponents’ are, or may think that her opponents think that their opponents’ are, etc.; indeed, we show in Proposition 1 that this notion of higher-order richness is generic in the space of all subjective payoff structures). Thus, our structure theorem applies to virtually all finite dynamic games with incomplete information, unlike those by Penta (2012) and Chen (2012), that are restricted to games whose state space satisfies richness (and in which, as seen above, rationalization has no bite). However, while its proof is technically challenging, the conceptual consequences of Theorem 1 remain unclear. On the one hand, unlike Weinstein and Yildiz (2007), Penta (2012) and Chen (2012) we cannot claim that the predictions of the solution concept we study are generically unique: Our counterexamples document that a standard model can be approximated by a sequence of profiles of perturbed subjective models in which multiplicity of extensive form rationalizable outcomes is robust. On the other hand, it is not clear which relevant solution concept, if any, is a refinement of extensive form rationalizability, so that it is not clear how the theorem could be applied to deny the robustness of such hypothetical solution concept.
Nonetheless, the result suggests two possible lines of research. First, it leaves the characterization of the sharpest predictions robust to misspecifications of models as an open question. While the predictions of backwards rationalizability are robust, we document via a counterexample that they do not always admit unique selections. We conjecture then that, if a robust solution concept admitting unique selections exists, it must consist in some hybrid reasoning procedure combining elements of both backward and extensive-form rationalizability. Second, the proof of the theorem relies on a contagion argument that elaborates the classical one in global games and involves an explicit interplay between initial assessments, persistent information and the interpretation of observed behavior. It remains then to be explored whether such an interplay, slightly more complex than the standard one in dynamic global games, provides a rationale for the arousal of endogenous coordination.
The rest of the paper is structured as follows. Section 2 formalizes the basic elements for the analysis of dynamic games with incomplete information. Section 3 presents a tool to formalize players’ disagreements of arbitrary high-order about the set of payoff-states of the game, and extends the definition of extensive form rationalizability and backward rationalizability to account for this feature. Section 4 presents the main findings of the paper. Section 5 ends with some literature review and new directions for research. All the proofs are relegated to the appendix.
Preliminaries
The formalization of dynamic games in Section 2.1 is standard (see Penta, 2012), except for a small addition in Section 2.1.3 concerning the representation of the utility functions and the information thereof players entertain. Section 2.2 reviews the formalization of initial types (or belief-hierarchies) and is completely standard (see Harsanyi, 1967–1968, or Mertens and Zamir, 1985).
Dynamic games
A dynamic game consists in a list where , an extensive form, specifies the set of players, admissible sequences of choices, turns and past choices observed by each player at each turn, and , a payoff structure, formalizes players’ possible preferences over the outcomes of the game and the information players may have about each others’ preferences.
Extensive forms
An extensive form consists in a list where is s finite set of players, and:
-
•
For each player , is a finite set of actions. A history represents the unfolding of the game and consists in a finite sequence of possibly simultaneous choices, i.e., on a finite sequence of elements from , where and for any . We say that history follows history , denoted by , if obtains from adding finitely many possibly simultaneous choices to .101010That is, when there exists some such that .
-
•
and are finite and disjoint sets of histories such that is a rooted and oriented tree with terminal nodes . Symbol denotes the ex ante stage of the game, i.e., the root of the tree, and histories in and are referred to as partial and terminal, respectively. For any player and partial history , let denote the set of actions available to at . Player is active at if is nonempty; let denote the set of these histories. We assume that: a player is never the only active one twice in a row, and whenever a player is active, at least two actions are available to her.
In this context, the set of player ’s strategies is and, as usual, the set of strategy profiles is denoted by and the set of player ’s opponents strategies, by . Obviously, for each partial history each strategy induces a unique terminal history, . Finally, let and denote, respectively, the set of player ’s strategies and the set of ’s opponents’ strategies that reach partial history , and , the set of player ’s histories that can be reached when she chooses strategy .111111To be precise, and on the one hand, and on the other.
Payoff structures
A standard payoff structure (for extensive form ) consists of a list , where:
-
•
is a compact and metrizable set of states of nature and, for each player , is a compact and metrizable set of payoff types. We denote the set of payoff states by and, for each player , the set of profiles of player ’s opponents’ payoff types by .
-
•
For each player , is player ’s continuous utility function.121212Throughout the paper finite sets are endowed with the discrete topology, is endowed with the Euclidean topology, product sets are endowed with the product topology and spaces consisting of closed sets, with the Hausdorff metric. For a given topological space , its corresponding set of measures over the Borel -algebra is endowed with the weak topology. Unless stated otherwise, measurability always refers to Borel measurability.
The set of all standard payoff structures (for extensive form ) is denoted by .131313A more general formalization of standard payoff structures allows for the set of payoff states not being a Cartesian product. We note that the whole theory developed in the paper extends to that case by introducing the necessary obvious changes. Thus, within a dynamic game with incomplete information , each payoff state fully characterizes the profile of players’ utility functions: . Besides, each player is assumed to know her payoff type and (possibly) face uncertainty about the rest of components of . Hence, besides the description of preferences, each payoff state also determines the information players have about the payoff states: In every moment of the game, a player with payoff type knows that the true payoff state is some element in ; in consequence, is, precisely, the set of payoff states that player can eventually deem as possible as the game unfolds.
Canonical representation and convergence of payoff structures
The questions we explore throughout the paper are related to the strategic impact of small relaxation of players’ common knowledge of an information structure . We develop this point in detail in Section 3.1, but before introducing it we need to first clarify what is understood as a small, or perturbation, in a standard payoff structure. The fact that the sets of payoff states can vary vastly across different standard payoff structures and be of radically different nature presents some inconvenience for an analyst interested in assessing how close two different standard payoff structures are form each other. To circumvent this issue we introduce a canonical representation that allows for envisioning every standard payoff structure as part of an object directly derived from the extensive form:
Definition 1 (Canonical representation of payoff structures).
Let be an extensive form and , a payoff structure. Then, the canonical representation of is the list , where:
-
is the set of possible profiles of utility functions:
-
For each player , is the set of profiles of utility functions that player can consider possible as the game unfolds:
The interpretation of each element is similar to that of each payoff state : A description of each player ’s preferences and information. First, consists in a profile specifying a utility function . Second, can be interpreted as the set of profiles that can be conceived by a player already endowed with a payoff type as the game unfolds—anything outside of will never be part of ’s beliefs (even if necessary for rationalizing observed behavior).
Now, the main reason for introducing Definition 1 is that it allows for envisioning payoff structures as informational constraints over a canonical object directly derived from the extensive form, namely, the space of profiles of all conceivable utility functions, : Simply notice that, given , is a compact subset of and that for each , each is a compact subset of —so that is a compact subset of compact subsets of . The notion of convergence for canonical representations of standard payoff is very natural: Say that converges to if:
-
1.
Sequence converges to in the Hausdorff metric for compact subsets of .
-
2.
For each player , sequence converges to in the Hausdorff metric for compact subsets of compact subsets of
The use of canonical representations greatly simplifies (at a conceptual level) our analysis of robustness in Section 4; perturbing payoff structures will consist on simply perturbing profiles of sets instead of perturbing both sets and utility functions whose domain includes these perturbed sets:
Definition 2 (Convergence of payoff structures).
Let be an extensive form. Then, we consider that a sequence of standard payoff structures converges to a standard payoff structure if converges to .
Initial beliefs
At the beginning of dynamic game each player is endowed with a payoff type and an initial belief hierarchy about the state of nature and their opponents’ payoff types, . The payoff type is assumed to remain invariant as the game progresses, but the initial belief hierarchy may be updated in response to observed behavior. More formally, for each player we have a type , where is a payoff type and is a belief hierarchy à la Brandenburger and Dekel (1993), where represents player ’s first-order belief about , represents player second-order belief about both and ’s opponents first-order beliefs, and so on. We assume that the belief hierarchy is coherent, that is, that higher order beliefs marginalize to lower order beliefs and that coherency is common belief.141414The details of the formalization are standard in the literature but, to be more precise, a belief hierarchy for player consists in a sequence where: is an element of and for each subsequence is an element of .
Let denote the set of all such belief hierarchies of player and , the set of all types. We know from Brandenburger and Dekel (1993) that there exists a homeomorphism , where . Throughout the paper, for any player and type we write to represent type ’s payoff type and to represent type ’s belief hierarchy.
Heterogeneously perceived incentives
In these section we introduce the main methodological novelty of the paper. We develop a formalism that allows for players to entertain arbitrary, different perceptions, also at the higher-orders, regarding the payoff structure of the game: Player might be deluded about which the payoff structure player considers it to be, or may think that player is deluded about which the payoff structure player considers it to be, and so on. We formalize the objects that allow for this heterogeneous subjective approach to the game in Section 3.1 and, in Section 3.2, we extend the definitions of extensive form rationalizability and backward rationalizability to the present set-up.
Subjective payoff structures and subjective models
Definitions
Next, we introduce the main new formal element of the paper. A subjective payoff structure is a hierarchical construction that represents how a player perceives the payoff structure of a game, how she perceives her opponents to perceive the payoff structure of the game, and so on. Thus, technically, a subjective payoff structure consists in a list in which the first component is a standard payoff structure (the one the player considers to be the right one), the second component is a map that assigns a subjective payoff structure to each opponent (the one the player considers each opponent to consider the right one), and so on. One consistency condition is required (see part in the definition below): If player considers that are the possible payoff types of player , then she must consider that considers that is (at least in terms of canonical representations) the set of her own possible payoff types—i.e., there is common knowledge in the event that players know their own set of payoff types.
Definition 3 (Subjective payoff structure).
Let be an extensive form. A subjective payoff structure for player is a sequence such that and, for every , we have , where:
Let denote the set of all subjective payoff structures for player .
Throughout the paper, with some abuse of notation, we will indistinctly use to refer to a standard payoff structure or to the subjective payoff structure that represents common knowledge of ; accordingly, we will treat as a subset of .151515That is, for player , such that , , , and so on. For a given subjective payoff structure we denote:
-
•
By and , the subjective payoff structure ascribed by to player , and the list of subjective payoff structures ascribed by to every player , respectively.
-
•
By , the set of states of nature that corresponds to the set of payoff states in standard payoff structure , and by , for each , the set of player ’s payoff types in that same set.
Since subjective payoff structures put restrictions on which payoff states can be considered by the player, obviously, not every type is necessarily consistent with every subjective payoff structure ; in order for the pair be consistent, the payoff type of must be in , the support of its first-order belief must be included in the projection on of , and so on.161616Formally, we require that contains ; that contains the support of ; that for every in the support of and every , and hold w.r.t. ; and so on. For each profile of subjective payoff structures , we let and denote, respectively, the set of player ’s types that are consistent with , and the set of opponents types that are consistent with . Finally, let . Consistent combinations of subjective payoff structures and types play a central role in our analysis, since they fully specify players’ perception of the incentives of the game:
Definition 4 (Subjective models, Standard models).
Let be an extensive form. Then:
-
A subjective model for player is a pair where is a subjective payoff structure for player and is a type consistent with . Let denote the set of subjective models for player .
-
We say that is standard if represents common knowledge of a standard payoff structure. Let denote the set of standard subjective models for player .
-
A standard model is a profile of standard subjective models where the standard payoff structure associated to each , , is the same for every player . Let denote the set of standard models.
Thus, a subjective model fully specifies the player’s perception of the payoff structure of the game; the subjective payoff structure describes the information about preferences the player will restrict her attention to, and the type, her initial probabilistic assessment about the payoff states. If the player perceives the set of states as commonly known, her perception is represented by a standard subjective model and, if every player agree on their perception of the set of payoff states and this fact is commonly known, players’ perceptions are represented by a standard model. We present now an example aimed at illustrating these notions.
Example
Consider the following extensive form:
The mechanics of the game are simple: There are two players ( and ) with two available actions each (to advance and to go down); if, at her turn, a player chooses to go down the game ends, if, instead, she chooses to advance it is then the other player’s turn. Actions are observed, player chooses first and has at most two turns, and player has at most only one turn. Now, by adding a standard payoff structure , we can have a standard dynamic game as the following one, in which for a given , utility functions are commonly known:
Now, if we want to introduce the possibility of discrepancies among players, we can assume that player ’s subjective payoff structure is , representing common knowledge of ,171717I.e., such that , , , etc. and that player ’s subjective structure is given by , with a first component representing the following payoff structure, :
and such that considers that player considers to be commonly known.181818I.e., such that and . Thus, represent a situation in which player considers that is commonly known and player knows this, but considers the payoff structure to be . For each subjective payoff structure, there is a unique type for the player consistent with it: For player , the one that represents common belief in the payoffs given by the unique state in , , and for player , the one that represents probability 1 belief in the payoffs given by the unique state in and in type , . Thus, and are the unique possible subjective models; the first one is not standard, the second one is. Thus, is not a standard model.
Richness and higher-order richness
Richness assumptions play a central role in the literature of structure theorems and unique selections (see Weinstein and Yildiz, 2007, Penta, 2012 or Chen, 2012). The reason is that states in which a strategy is dominant are employed as ‘seeds’ to initiate a contagion argument that propagates through the hierarchy of beliefs: A player might not believe in one of those states, but if she believes that her opponent believes in one of then, then the player should choose a best response to the strategy her opponent considers to be dominant. Formally, richness is defined as follows:
Definition 5 (Richness, c.f. Penta, 2012).
Let be an extensive form and , a standard payoff structure. We say that is rich if for every payoff type and every strategy there exists some payoff type such that is conditionally dominant for player at every payoff state with and such that and are payoff equivalent for every for every pair .
Notice that in the case of standard payoff structures, the notion is rather demanding. It requires that it is common knowledge among players that no strategy is known not to be conditionally dominant. Or, in other words, that it is commonly known among players that every strategy can be found to be conditionally dominant for some appropriate belief about the payoff states. On the contrary, subjective payoff structures allow for greatly relaxing this common knowledge assumption while keeping its bite. Players may know that richness does not hold, but they may consider that their opponents consider it to hold; or they may consider that it does not hold, that their opponents consider it not to hold, but consider that their opponents consider their opponents to consider that it holds. Thus, it is possible to approximate common knowledge of absence of richness while still having the chance to exploit an infection argument: I could perceive the game as not being rich, but perceive that my opponents perceive it to be rich, or only perceive that they perceive that their opponents perceive that it is rich, and so on. In order to formalize the idea that richness holds at high orders but not low ones let us introduce the following definition:
Definition 6 (Higher-order richness).
Let be an extensive form and , a subjective payoff structure for player . Then we say that satisfies:
-
-order richness if is rich.
-
-order richness, for , if satisfies -richness for every .
-
Higher-order richness if satisfies -order richness for some .
Notably, it turns out that higher-order richness is a generic property within the space of subjective payoff structures; thus, while richness is a very demanding notion for games with a commonly known state space, higher-order richness is extremely mild for games with subjective payoff structures:
Proposition 1 (Genericity of higher-order richness).
Let be an extensive form. Then, the set of subjective payoff structures that satisfy higher-order richness is open and dense in .
Discussion: Relaxation of all common knowledge assumptions
We would like to be emphatic about how natural this separation of rules and stakes is: While the possible outcomes of an economic interaction can be objectively described, there is nothing intrinsically objective in the notion of utility; the allocation of resources that results from each combination of reported preferences can be commonly known, the market shares that correspond to each different combination of pricing strategies can be commonly known, etc. But each individual agent’s preference over material outcomes is a personal trait of hers. In consequence, it seems reasonable to remove the set of states, or possible utilities, from the objective (in the sense commonly known) part of the game
Subjective payoff structures allow for relaxing all common knowledge assumptions about payoffs in a simple, tractable way; the only common knowledge assumptions they entail is the trivial one: The set of utility functions players reason about consists in functions that map outcomes to utilities. Furthermore, not only do subjective payoff structures allow for approximating models not satisfying any richness assumption with models that satisfy richness at high orders, but they also allow for approximating models satisfying certain knowledge assumptions (say, the no information case, as in Weinstein and Yildiz, 2007, or Chen, 2012) with models that satisfy different knowledge assumptions at high orders (say, models with persistent private information, as in Penta, 2012). In this sense, subjective payoff structures allow for enhancing the subjective nature of individual strategic decisions and, following the ‘Wilson doctrine’, develop game-theoretic analyses that dispense with unrealistic, excessively demanding common knowledge assumptions.
To the best of our knowledge, the methods for reducing common knowledge assumptions closest to our approach build on Battigalli and Siniscalchi’s (2003) -restrictions. Formally, a -restrictions is a specification of profiles of first-order beliefs, , so that for each player we have a collection where for every payoff type . The difference between -restrictions and profiles of subjective payoff structures is mainly twofold. On the one hand, -restrictions allow for placing restrictions on the evolution of strategic uncertainty, not only payoff uncertainty. On the other, -restrictions impose constraints over higher-order beliefs by assuming that is commonly known among players. Thus, it is possible to envision subjective payoff structures as -restrictions that do not place constrains on beliefs about opponents’ behavior, and include specifications of constraints on higher-order beliefs that do not necessarily entail nontrivial common knowledge assumptions.
The restrictions embodied by subjective payoff structures also provide a reduced from method of representing unawareness in games. The uncertainty about the payoff state could be thought of as arising from the resolution of some unmodeled contingencies regarding physical aspects of the world external to the choices made by the players—a state of nature being shorthand for the particular resolution of this uncertainty that induces such payoffs to each player. Thus, although a player may easily envision the entirety of the abstract space , in actuality he considers on those states relating to some contingency he is aware of.
Now, when a player is aware of a contingency but initially deems it probability zero, he may revise his beliefs by reasoning about his opponents behavior. This is not so when the player is unaware of the contingency, since observing his opponents actions will not (in general) enlighten him to new contingencies. Thus, the essential attribute of our formalism—persistent, heterogeneous belief about the set of payoff states—can arise as a natural consequence of asymmetric awareness.
Of course, an agent’s perception of other agents’ awareness must be expressible in terms of things he is himself aware of. This is the motivation of various extant restrictions in the awareness literature—closure under subformula in Fagin and Halpern (1988), the confinidness property in Heifetz, Meier and Schipper (2006), property F1 in Piermont (2019). The translation to our set up would be: if is aware that is aware of then must be himself aware of . This conceptual restriction can be made formal via the edict: . The set of payoff states that considers to consider is a subset of those he himself is aware of.
Solution concepts
Conjectures and sequential rationality
The evolution throughout the game of each player’s subjective beliefs is presented by conjectures. Formally, for each player endowed with a subjective model , a conjecture consists in a list of beliefs satisfying that:
-
For every history , either initial or in which player is active, is a probability measure on that assigns probability to .
-
Whenever possible, beliefs are updated following conditional probability.
-
The initial beliefs represented by are consisted with those represented by :
We denote the set of conjectures consistent with subjective model by .
Given a payoff type and a history , each conjecture naturally induces a conditional expected payoff for each strategy :
Finally, given a payoff type and a conjecture , the notion of sequential rationality is then captured via the set of (sequential) best responses:191919Notice that, if we denote the set of all conjecture of player as , then is upper hemicontinuous and nonempty-valued.
Extensive form rationalizability
Extensive form rationalizability formalizes the notion of forward induction; that is, the idea that while forming conjectures about opponents’ behavior, players take observed past choices into account. Specifically, as clarified by the epistemic analysis by Battigalli and Siniscalchi (2002), players rationalize observed behavior to the ‘highest possible degree’ of strategic sophistication the behavior is consistent with: Rationality; or rationality and belief whenever possible in opponents’ rationality; or rationality, belief whenever possible in opponents’ rationality and belief whenever possible in opponents’ belief whenever possible in their opponents’ rationality; etc. Thus, the main conceptual feature of the solution concept is that observed behavior is employed to infer other players’ private information and that conjectures about behavior are based on this inference. To be more specific, if a player entertains certain beliefs about the payoff type of an opponent, upon observing a choice that would have been irrational for that payoff type, if possible, she should update her beliefs about the payoff type and believe that the true type is one for which the observed behavior is rational.
Backward rationalizability
Backward rationalizability (see Penta, 2011, 2015) extends the standard notion of backward induction to games with incomplete information. The reasoning process behind differs from that behind extensive form rationalizability seen above: Here, upon observing unexpected behavior, players do not necessarily try to find a rationale for said behavior (even at the cost of dropping their belief in ‘common belief in rationality’); instead, they can interpret observed behavior as a mistake and maintain their belief that ‘common belief in rationality’ holds.202020It is not conceptually accurate to talk about common belief in rationality in a dynamic game, not at every history this belief can be held; however, we keep this terminology in the present discussion for expositional simplicity. Thus, the difference between the extensive form rationalizability and backward rationalizability is one of epistemic priority (see Catonini, 2019): While the former dictates to update higher-order beliefs about payoff states in order to perform a rationalization to the highest possible degree, the latter permits to maintain the higher-order beliefs about payoff states, believe that unexpected behavior is uninformative and keep the faith on continuation behavior being consistent with rationality and common belief thereof.
Under subjective payoff structures, Penta’s (2011; 2015) original notion is generalized as a correspondence that maps each player ’s subjective model to a certain subset of strategies . It is easy to check that the following Definition 8 below captures the idea, formalized in epistemic language by Perea (2014) and Battigalli and De Vito (2018),212121Unlike Penta (2011, 2015), and in order to make the comparison with extensive form rationalizability more immediate, we drop the requirement that a player has beliefs about her own strategy. Nonetheless, it is immediate that the version we employ captures the idea of ‘continuation behavior consistent with common belief in rationality’ as formalized by rationality and common belief in future rationality in Perea (2014) or common full belief in optimal planning and in belief in continuation consistency in Battigalli and De Vito (2018). that players believe that rationality will hold at every possible higher-order in the continuation game, even after moves inconsistent with it. First, define the set of profiles of player ’s opponents’ strategies that are equivalent to in the continuation game following history as:
Then, backward rationalizability is defined in our context as:
Definition 8 (Backward rationalizability, c.f. Penta, 2011).
Let be an extensive form. Player ’s set of (interim) backward rationalizable strategies for subjective model is where and , and for each ,
Results
The application of the theoretical framework developed above will shed light on the impact on strategic behavior of players’ discrepancies about the state space. Section 4.1 shows that extensive form rationalizable outcomes are not always a refinement of those by backward induction, and that they are not robust to misspecifications of subjective models. Section 4.2 shows that extensive form rationalizability delivers predictions that are robust to misspecifications of initial beliefs (Proposition 2) and that backward rationalizability delivers predictions that are robust to misspecification of subjective models (Proposition 3); a direct consequence of these results is that, in dynamic games where the set of payoff states is rich, extensive form rationalizability, backward rationalizability and interim rationalizability all coincide (Corollaries 1 and 2).222222By interim rationalizability we refer to Dekel, Fudenberg and Morris’s (2007) interim correlated rationalizability applied to the normal-form representation of the dynamic game. In Penta (2012) interim rationalizability is defined using sequential rationality instead of ex ante rationality, and the solution concept is dubbed interim sequential rationalizability. In the games considered by Chen (2012), where players have no private information, interim sequential rationalizability and interim rationalizability coincide (see Penta (2012), p. 648). Finally, in Section 4.3 we prove that, by perturbing standard models, it is possible to create unique selection arguments à la Weinstein and Yildiz (2007) so that, for any standard model and any extensive form rationalizable outcome, the outcome is the unique prediction along the misspecified subjective models (Theorem 1), and we end by presenting an example that illustrates why such a unique selection argument is impossible for backward rationalizability.
Rationalization as a refinement criterion
Main counterexample
Consider again the extensive form studied in Section 3.1.2. Let us endow it with a standard payoff structure such that the resulting dynamic game is:
Now, for each player let represent the standard subjective payoff structure associated to . Clearly, there is a unique type for player consistent with , namely the one representing common belief in , which we denote by . Thus, there is a unique profile of subjective models, where for both , which is, indeed, a standard model. The strategic analysis of this game yields the following conclusions:
-
•
The unique prediction consistent with extensive form rationalizability is (outcomewise). This is easy to see. First, player will never choose at her second turn. Second, if player finds herself at , while she can be surprised that took the risk of advancing instead of guaranteeing a payoff of , she need not abandon the belief that 1 is rational. Player may have advanced, rationally, by believing that does not expect her (i.e., 2 does not expect 1) to be rational in her second turn, and therefore believing that will advance as well. Thus, since maintains the belief that is rational, she will expect to choose in her first second turn, and will therefore choose . Being able to forecast this, player will choose in her first turn.
-
•
The unique prediction consistent with backward rationalizability is as well. This is immediate.
Thus, both solution concepts uniquely predict is played. Suppose now that we perturb the game using a sequence consisting of the subjective models defined in Section 3.1.2, . For each , the situation can be represented as follows:
Remember that here players commonly agree on the payoffs that correspond to terminal histories , and . However, in player ’s mind they also commonly agree about the payoffs corresponding to being , but this is wrong: In player ’s mind, the payoff that corresponds to is , but she knows that player believes in the commonly agreement on . Obviously, the larger , the closer we are to the game with commonly known payoff structure . This is the sense in which we say that converges to . For each given , the corresponding strategic analysis yields the following conclusions:
-
•
The predictions consistent with extensive form rationalizability are multiple now: , and . This is easy to see. Upon observing , which in her mind is strictly dominated, player may drop the belief that player is rational, and may have any arbitrary belief about 1’s future behavior. She might believe that player will be completely irrational and choose in her second turn, or she might believe player will be more sensible next time and choose . In consequence, both and are rationalizable choices for player . As a result, , and are all rationalizable strategies for player , depending on what she expects player to choose.
-
•
The unique prediction consistent with backward rationalizability is again. Also again, this is immediate.
The example shows that extensive form rationalizability refining backward rationalizability crucially depends on players commonly agreeing on the set of possible payoff states. As arbitrarily small disagreements arise, both and become consistent with extensive form rationalizability; however, none of them is consistent with backward rationalizability. Importantly, the example delivers an additional lesson: Extensive form rationalizable outcomes fail to be upper hemicontinuous, even in standard models; all , and are consistent with extensive form rationalizability along the sequence, but only is so in the limit.
Conclusions
Section 4.1.1 illustrates the critical dependence of rationalization processes to apparently negligible modeling details of the state space. As a result, the intuition that forward induction (i.e., extensive form rationalizability) always provides refinements of backward induction (i.e., backward rationalizability) is revealed as both extremely fragile and dependent on agents perfectly agreeing on the possible payoff-relevant nonstrategic contingencies of the environment. That is, arbitrarily small discrepancies among players can lead to the collapse of usual refinement criteria based on the use of observed behavior in order to form conjectures about future behavior.
The reasons for the conceptual discontinuity are evident. There is an option that is rational for player that, because of an arbitrarily small discrepancy on the perception of the payoff structure, cannot be rationalized by player . Hence, player can try to play dumb by choosing to advance and obtain, depending on the reaction of , a better payoff than the one backward rationalizability allows for. In contrast to the standard environment with a commonly known payoff structure—where it is always impossible that a choice rendered as irrational by her opponents is potentially beneficial for a player—arbitrarily small disagreements about the payoff structure ca permit an agent to profit from seemingly irrational choices. In technical terms, the observation is the result of the interplay between two factors:
-
1.
The correspondence that maps subjective payoff structures to histories that can be reached by extensive form rationalizable strategies of the opponents is not lower hemicontinuous. This is an obvious fact, if only because it is well-known that the best response correspondence is not lower hemicontinuous. It is also easily visible in the example. In the limit, initial action is rational for player 1 and thus, player ’s history is reachable by strategies in . On the contrary, for every , history is not reachable by any strategy in . To put it explicitly: A history that can be reached in the limit may not be reached along the perturbation.
-
2.
Extensive form rationalizability puts certain restrictions on beliefs only at some histories: requiring that player believes that player is rational at every history that could have been reached if this was the case, but no constraint is placed in the remaining histories. This, together with the lack of lower hemicontinuity mentioned above leads to the collapse of upper hemicontinuity. To see it, let be a sequence of player ’s conjectures such that, for each , assigns, at every history reachable by some strategy in , probability to player choosing a strategy in . Clearly, this constraint only applies in . Hence, even if the sequence is convergent, there is no way to guarantee that its limit, , will assign, at history , probability 1 to player choosing a strategy in : The elements in the sequence do not satisfy any particular requirement at history . In consequence, it is impossible to ensure that a sequence of conjectures that justifies the inclusion of strategy in each converges (or has a subsequence that does so) to a conjecture that justifies the inclusion of in .
Besides the above, the example delivers other two additional important takeaways. First, that extensive form rationalizability is not robust to misspecifications of models (in particular, of subjective payoff structures): Arbitrarily small discrepancies on the set of payoff states can lead to predictions that where ignored in the benchmark model ( and in the example). Second, that multiplicity of the extensive form rationalizable outcomes can be robust: It is easy to see that, in the example, for every , outcomes , and are all consistent with extensive form rationalizability for every model close enough to . These two observations, further discussed in Sections 4.2 and 4.3 respectively, contradict some established facts of static games. On the one hand, since in static games interim beliefs are not updated, all the uncertainty regarding utility functions can be encapsulated by the type, and we know from Dekel, Fudenberg and Morris (2007) that interim rationalizability (the static version of extensive form rationalizability) is robust to misspecifications of types. On the other hand, the seminal analysis of Weinstein and Yildiz (2007) reveals that multiplicity of interim rationalizability is nowhere robust, there exists no type with a neighborhood within which interim rationalizability is always multiple.
Robustness to misspecifications
Extensive form rationalizability
The example above shows that extensive form rationalizability is not robust: Arbitrarily small misspecifications of subjects’ perceptions of the payoff structure can lead to outcomes ignored by the benchmark model.232323Specifically, while the original standard model only admits , predictions , and all are consistent with the perturbed models. Notice however, that the models employed in the example, both the benchmark one and the perturbations, are such the corresponding payoff structures only admit one type. Thus, the lack of robustness of extensive form rationalizability follows from its lack of upper hemicontinuity on subjective payoffs structures. In this section we show that extensive form rationalizability does pass some partial robustness test: For a fixed subjective payoff structure, it is upper hemicontinuous on initial types.
Proposition 2 (Robustness of extensive form rationalizability to misspecifications of initial beliefs).
Let be an extensive form. Then for any player and any subjective payoff structure , the correspondence is upper hemicontinuous.
The conceptual relevance of the result is twofold. First, standard economic modeling assumes that the state space of the game is commonly known and thus, imperturbable. Proposition 2 guarantees that, in these settings, extensive form rationalizable predictions are robust (also at individual level, i.e., in terms of strategies, not only outcomes). Second, as we argue next, the preposition warns against the perils and unfitting consequences of richness assumption in dynamic games. We know from Penta (2012) and Chen’s (2012) work that under richness, no refinement of interim rationalizability is robust. Now, Proposition 2 establishes that extensive form rationalizability is upper hemicontinuous in the settings studied by Penta (2012) and Chen (2012), and we know that extensive form rationalizability is a refinement of interim rationalizability. Hence, the conclusion, formally stated in Corollary 1, is immediate: Under richness, interim rationalizability (both ex ante and sequential) and extensive form rationalizability coincide.242424The argument is so straightforward that we omit any proof Thus, richness assumptions seem unsuitable for dynamic games; by killing all the bite of rationalization, they render the use of dynamic modeling a mostly redundant tool. Let us end the discussion by formally presenting the equivalence:252525Battigalli and Siniscalchi (2007) provide a similar insight in a weaker version of the following result (Proposition 5 in p.177): For rich and state of nature extensive form rationalizability and interim rationalizability coincide for those types consistent with initial common belief of .
Corollary 1 (Triviality of rationalization under richness I).
Let be a dynamic game. Then, if is rich, extensive form rationalizability and interim rationalizability coincide.
Backward rationalizability
The example in Section 3 shows that the failure of upper hemicontinuity of extensive form rationalizability on subjective payoff structures is due to the interplay of: the set of histories reachable by extensive-from rationalizability not being lower hemicontinuous on subjective payoff structures, and the rationalization requirement intrinsic to extensive form rationalizability depending on the subjective payoff structure.262626See bullet points and in Section 4.1.2. Fact implies that beliefs satisfying along a sequence may not imply that the limit belief satisfies , because in the limit, the number of histories at which rationalization should me made can have exploded. Now, the reasoning process that explains backward rationalizability imposes the same constraint at every history,272727Namely, that continuation play will follow according to what backward rationalizability of the previous order would have dictated. and thus, if it holds for beliefs in a sequence, it is always satisfied as well by the limit belief. As the following proposition shows, this intuition backward rationalizability is immune to the mechanism that causes extensive form not to be robust, proves to be correct:
Proposition 3 (Robustness of backward rationalizability to misspecifications of models).
Let be an extensive form. Then for any player , the correspondence is upper hemicontinuous.
It is immediate then that the result below obtains as a corollary. Clearly, this reinforces the critique on the assumption of richness in dynamic modeling.
Corollary 2 (Triviality of rationalization under richness II).
Let be a dynamic game. Then, if is rich, backward rationalizability, extensive form rationalizability and interim rationalizability coincide.
Once backward rationalizable predictions are confirmed to be robust, a question arises naturally: Does backward rationalizability characterize the strongest predictions that are robust to misspecifications of models? That is, is it true that no (nontrivial) refinement of backward rationalizability is robust? The standard argument for results of this kind in static games is that by Weinstein and Yildiz (2007). In those games every interim rationalizable prediction is uniquely selected by some perturbation of the type. In consequence, no refinement of interim rationalizability is robust.282828Suppose it is and take the prediction ignored by the hypothetical refinement. In the perturbation, this prediction should be the unique prediction made by the hypothetical refinement. If the latter was upper hemicontinuous, it would also include it in the limit. The example in Section 4.3.2 below shows that not every prediction of backward rationalizability admits a perturbation that uniquely selects it. In consequence, our analysis leaves two open questions: does backward rationalizability characterize the strongest predictions that are robust to misspecifications of models (which, if true, would require a proof different from the usual approach)? And in case the answer to the previous question is negative, which is the solution concept that does characterize the strongest predictions robust to misspecifications of models?
Unique selections
Theorem for extensive form rationalizability
The following theorem shows that, for any subjective model and any outcome consistent with extensive form rationalizability, it is possible to create a perturbation of the model that allows for the chosen outcome to be uniquely selected along the perturbed sequence—uniquely selected outcomewise, not in terms of strategies. Notice that the theorem does not require that the state space associated to the standard model is rich: It is possible to perturb the state space using profiles of subjective payoff structures whose associated subjective models satisfy higher-order richness, at increasingly higher order the more the state space is approximated. This allows for the usual infection argument to be easily constructed. In consequence, the theorem establishes that dynamic games not satisfying richness also admit unique selections:
Theorem 1 (Structure theorem for extensive form rationalizability).
Let be a dynamic game. Then, for any profile of finite types and any strategy profile there exists a sequence of profiles of models such that:
-
converges to .
-
For every and every strategy profile , .
The interest of the theorem lies on illustrating that extensive form rationalizability admits unique selections in settings in which rationalization adds bite, not only on those in which extensive-from rationalizability is necessarily equivalent to interim rationalizability (because richness of the state space is assumed). On top of that, while the perturbations required do entail higher-order richness assumptions, we know from Proposition 1 that this requirement is generically satisfied in the set of all subjective payoff structures. Thus, not only the consequence, but also the very demanding nature of standard richness assumptions is alleviated.
However, the conceptual implications beyond the previous observations are not clear. On the one hand, (perfect Bayesian) equilibrium outcomes do not, in general, refine those of extensive form rationalizability, so Theorem 1 does not allow for concluding that equilibrium predictions are not robust—as Weinstein and Yildiz’s (2007) structure theorem allows for in the static case. On the other hand, only standard models are perturbed in the theorem (not arbitrary profiles of subjective models), and extensive form rationalizability is not robust to perturbations of subjective models. In consequence it cannot be concluded either that uniqueness of extensive form rationalizable predictions is a generic phenomenon in the space of profiles of subjective models. Indeed, the contrary is illustrated in the example in Section 4.1: It is possible that a standard model is perturbed so that, along the sequence, multiplicity of extensive form rationalizable outcomes is robust. Our last example below shows that unique selection arguments do not exist, at least as a rule, for backward rationalizability.
Counterexample for backward rationalizability
The following example illustrates that, while backward rationalizability is robust to misspecifications of models—see Proposition 3 above, the questions of whether it is the strongest robust solution concept remains open: It is not a conclusion that follows from a standard unique selection argument. As exemplified below, there exist situations in which it is not possible to perturb standard models so that backward rationalizable outcomes are uniquely selected. In consequence, even if true, there is no obvious way to claim that no refinement of backward rationalizability delivers robust predictions.
Consider the following dynamic game with standard payoff structure consisting of states and :
The description of the payoff structure is completed by assuming that player always knows the payoff state, that the second player never knows it, and that these two are common knowledge. The set of backward rationalizable strategies of player for arbitrary type is:
-
(A)
, if assigns positive probability to . At state strategy is strictly dominant for player 1 (who, remember, knows that the state is ). Thus, if player observes that player advances in her first round, she updates her beliefs by excluding the possibility of being the true state. In consequence, becomes her only rational choice.
-
(B)
, if assigns null probability to . In this case it is unexpected for player to observe player 1 advance, would have been strictly dominant if the state had been . Thus, player needs to perform an update of her beliefs from scratch, what allows for the following two possibilities: Was choosing a mistake and, effectively, the state is ? Or where player ’s beliefs wrong and the state is , what justifies player ’s decision to advance? Both updating alternatives (and the corresponding mixed beliefs) are admissible according to backward rationalizable reasoning, and the corresponding only rational choices are and , respectively (and one or either of them, if the updated is a mixed belief).
Pick now standard model , where and and represent common belief in states and , respectively. We claim now that no perturbation of can lead to being uniquely selected for player and thus, to outcome (which is backward rationalizable for ) being uniquely selected. To see it notice first that if a standard subjective model is close enough to then it is the union of two disjoint sets of states, and , such that it is strictly dominant for player to choose at every and to choose at every . Accordingly, for any perturbation of , the same argument as in (A) and (B) leads to the following conclusions: (A’) if in the perturbed model gets positive probability then is player 2’s unique backward rationalizable strategy, and (B’) if in the perturbed model gets zero probability then both and are backward rationalizable for player . As a result, there is no perturbation of in which is not backward rationalizable for player .
Related literature
Our approach bears some similarity with the notion of unawareness (see Fagin and Halpern, 1988; Modica and Rustichini, 1994), and in particular with the state-space or semantic approach to modeling awareness (see Dekel, Lipman and Rustichini, 1998; Heifetz, Meier and Schipper, 2006, 2008; Piermont, 2019), in which agent’s are endowed with a coarse understanding of the true space of uncertainty. In particular, these approaches often model uncertainty via a partial order of increasingly expressive state-spaces: agent’s residing in more expressive state-spaces can only reason about events in lower state-spaces.
Agents who are introspectively unaware, however, might reason that there exist contingencies they are unaware of, without, of course, knowing exactly what such contingencies entail (see Halpern and Rêgo, 2009; Halpern and Piermont, 2019). Subjective payoff structures can capture both naive and introspective unawareness by changing restrictions on the relation between and : when is required to contain then the agent is naively unaware, as he does not consider it possible that considers a contingency he does not.
Note, most game-theoretic analyses of unawareness—Heifetz, Meier and Schipper (2014), Perea (2018b) and Guarino (2020), for instance—have focused on unawareness w.r.t. actions or strategies, not payoff states.
Lack of common knowledge of the type structure has also been studied by Ziegler (2019), and alternative misspecifications of models, by Esponda and Pouzo (2016) and, in the context of macroeconomic models, Hansen and Sargent (2001) and Cho and Kassa (2017).
The study of disagreements about payoff structures follows the literature on how small changes in beliefs and information at high orders affects strategic behavior. Rubinstein’s (1989) email game documents that behavior under common knowledge or under almost common knowledge can vary drastically. Later, Weinstein and Yildiz (2007) show that these discontinuities of behavior are not an isolated phenomenon, but rather, a pervasive feature of games with incomplete information. Penta (2012) and Chen (2012) extend this observation to dynamic games. Ely and Pȩski (2011) and Ruiz G. (2018) characterize the types in which these discontinuities arise in static and dynamic settings, respectively. Penta and Zuazo-Garin (2021) study the strategic impact of the discontinuities corresponding to higher-order uncertainty about the observability of choices, not preferences. Within the literature of robust mechanism design, Oury and Tercieux (2012) and Chen, Mueller-Frank and Pai (2020) study which social choice functions are implementable is a way robust to small misspecifications of higher-order beliefs. To this respect, the new notion of continuity in this paper and our results on the continuity of different solution concepts suggest novel questions and techniques for the study of implementability in dynamic mechanism design.
The counterintuitive nature of behavior being so discontinuous on information has sprung a literature that argues that these discontinuities are an artifact of very specific formalization or unrealistic assumptions on behavior. An approach consists in varying the notions of ‘similarity’ of belief hierarchies or ‘approximation’, as studied by Dekel, Fudenberg and Morris (2006), Chen, Di Tillio, Faingold and Xiong (2010; 2017) or Morris, Shin and Yildiz (2016). Another approach consists in showing that these discontinuities vanish with the introduction of bounded rationality and, specifically, under arbitrarily small departures from the benchmark of rationality and common belief thereof, as in Strzalecki (2014), Heifetz and Kets (2018), Germano, Weinstein and Zuazo-Garin (2020), Murayama (2020) or Jimenez-Gomez (2019). On the contrary, the literature on global games, starting from Carlsson and van Damme (1993), has embraced the discontinuities of choices as an intrinsic feature of strategic behavior and leveraged on them to explain diverse economic phenomena such as currency crises (Morris and Shin, 1998), bank runs (Angeletos, Hellwig and Pavan, 2006; 2007), conflict (Baliga and Sjöström, 2012), overvaluation in financial markets (Han and Kyle, 2017) or disclosure policies for stress tests (Inostroza and Pavan, 2018). Perturbations of state spaces, together with the unique selection argument in Theorem 1, allow for exploring variations of dynamic global games in which: information about the fundamentals is exogenously obtained solely at the beginning of the interaction, and new one is inferred only through observed behavior, and agents can entertain different perceptions of which payoff-relevant contingencies can eventually take place.
Acknowledgments and disclaimers
A very preliminary version of this project was occasionally presented under the title “Rationalization and robustness in dynamic games with incomplete information.” Thanks are due to Pierpaolo Battigalli, Emiliano Catonini, Pierfrancesco Guarino, Jaromír Kovářík, Avi Lichtig, Antonio Penta and Gabriel Ziegler for insightful comments and valuable feedback, and to audiences at HSE University-International College of Economics and Finance and Università Bocconi. Zuazo-Garin acknowledges financial support from the Spanish Ministry of Economy and Competitiveness, from the Department of Education, Language Policy and Culture of the Basque Government (grants ECO2012-31346 and POS-2016-2-0003 and IT568-13, respectively), from the ERC Programme (ERC Grant 579424) and from the Russian Academic Excellence Project ‘5-100’. Zuazo-Garin also expresses his gratitude to Northwestern University and Kellogg School of Management (the Department of Economics, MEDS and the CMS-EMS, particularly) for immense hospitality. The authors are responsible for all errors.
Appendix A Proofs I: Propositions
Proposition 1
See 1
Proof.
For denseness, take any arbitrary standard payoff structure satisfying richness . Then, for any subjective payoff structure and any define by setting and, for any , and . This way, is a subjective payoff structure satisfying that for every . Thus, clearly, converges to . Openness follows from a standard inductive argument: It is immediate that the set of subjective payoff structure satisfying 1-order richness is open; given that, it is immediate as well that the set of subjective payoff structure satisfying 2-order richness is open as well, and so on. ∎
Proposition 2
See 2
Proof.
We proceed by induction and verify that the correspondence is upper hemicontinuous for every . The claim holds trivially for the initial case () so we can focus on the proof of the inductive step. Fix such that the claim holds; we verify next that it also does for . Fix player , subjective payoff structure and sequence of types converging to some type , and pick strategy such that for every . For each , pick conjecture that justifies the inclusion of in . Let be a convergent subsequence of with limit . Notice then that for any and any history reachable by some profile of opponents’ strategies in we have that:
because we know from the induction hypothesis that has closed graph. This, together with the continuity of marginalization and the upper hemicontinuity of the best response correspondence allows for concluding that is a conjecture that justifies the inclusion of in . ∎
Proposition 3
Auxiliary results
Lemma 1.
Let be a set of conditional probability systems with (compact and metrizable) space of uncertainty and family of conditioning events . Consider sequence of closed subsets of and and closed such that: converges to in the Hausdorff metric, and for any , has non-empty intersection with every conditioning event . Then, the following two hold:
-
For any sequence of conditional probability systems with limit and such that for every it holds that .
-
Sequence converges to in the Hausdorff metric.
Proof.
For claim we only need to check that for any .292929That updating according conditional probability is respected in the limit is a well-established fact. Fix arbitrary and pick . Since the support correspondence is lower hemicontinuous,303030See Theorem 17.14 in Aliprantis and Border (2007), p. 563. we know that there exists a subsequence and a sequence with limit such that for any . Then, pick metric that topologizes , and let denote its corresponding Hausdorff metric. We know that and therefore, that . Since we have that and is closed, we conclude that .
For claim , proceed by contradiction and suppose that there exists some such that for any there exists some such that . Then, for any there exists some such that for any . Now, since is compact, we know that there exists a convergent subsequence with limit . We know from that , and thus, since , we reached a contradiction. ∎
Corollary 3.
If sequence of subjective payoff structures converges to subjective payoff structure , then, sequence of sets of opponents’ types converges to in the Hausdorff metric.
Proof.
We proceed by induction. Let for any subjective payoff structure . Obviously, converges to in the Hausdorff metric, and thus, we know from Lemma 1 that , where for any , converges to in the Hausdorff metric. Now, set inductively and for every and every and , and suppose that is such that converges to and converges to for any (all of them in the Hausdorff metric). Then, obviously, , defined in the obvious way, converges to in the Hausdorff metric, and therefore, we know from Lemma 1 that converges to . Obviously, it follows that converges to and hence, that converges to . ∎
Corollary 4.
The set of player ’s subjective models is closed; i.e., if pair is the limit of a sequence of subjective models , then is a subjective model too.
Proof.
Just notice that we know from the previous corollary that converges to , and thus, we know from Lemma 1 that, if for every , and converges to , then . ∎
Proof of the proposition
See 3
Proof.
We will proceed by induction and verify that the correspondence is upper hemicontinuous for every . The claim holds trivially for the initial case so we can focus on the proof of the inductive step. Fix such that the claim holds; we verify next that it also does for . Fix player , convergent sequence of subjective models with limit and strategy such that for every . For each , pick conjecture that justifies the inclusion of in . Let be a convergent subsequence of with limit . For each let denote the subjective payoff structure associated to subjective model and , the one associated to . Notice then that for any and any history we have that:
and thus, we know by finiteness of and the induction hypothesis that is closed. Then, it holds that:
where:
Notice that the last inclusion holds by virtue of being upper hemicontinuous on because of the induction hypothesis. The above, together with the continuity of marginalization and the upper hemicontinuity of the best response correspondence allows for concluding that is a conjecture that justifies the inclusion of in . ∎
Appendix B Proofs II: Theorem
Additional notation
Distinguished histories
The histories of player that might be reached when every opponent plays according to some given correspondence is of special interest and formalized as follows:
Another kind of histories that play an important role throughout the proof are those in which the player updates beliefs from scratch. For each player and conjecture , these are the histories that are considered unlikely to be reached by every history preceding them:
Notice that every conjecture is fully described by the beliefs where : The belief that corresponds to any other history is obtained via the chain rule.
Strict rationalizability
First, for each player and strategy let denote the set of player ’s strategies that are outcome-equivalent to –those strategies such that for every history . Then, player ’s set of (extensive form) strictly rationalizable strategies for subjective model is where and for each ,
Related to this, we also add the following two notational shorthands, convenient for lemmas 3 and 5 in the proof of the theorem:
-
•
For each subjective payoff structure and we denote by the set of player ’s strategies that are equivalent to strategy at every history consistent with the opponents’ playing some profile of strategies in , that is:
-
•
The types consistent with subjective payoff structure that admit strategy as via some conjecture that assigns, initially, positive probability to every history reachable by some profile of opponents’ strategies in is formalized as:
Proof of the theorem
See 1
Proof.
Fix standard payoff structure , profile of finite types and strategy profile . Then, we know from lemmas 2 and 3 below that for each :
-
(A)
There exist some and a sequence of standard models converging to such that, for any and any player , is finite, and for every ,
-
(B)
For any there exists a sequence of profiles of finite types converging to such that for each player , (as defined in Section B.1.2) for every .
Now, for each player and each set , and notice that:
-
•
is finite and included in .
-
•
.
We know then from Lemma 5 below that:
-
(C)
For any there exists some subjective model such that: for , and as (for defined in Section B.1.2),
and hence,
It follows from (A) that:
Thus, is a sequence of profiles of subjective models converging to such that for any player and ,
Now, fix and . Let be the set of those players for which . Clearly, and thus, it must hold that . Then, obviously, for every player such that , it holds that as well. Hence, an easy inductive argument enables to conclude that . ∎
Proof of Lemma 2
First perturbation of payoff structures
Let be a standard payoff structure. We first construct a perturbation that will ensure that any strategy of any player that is extensive form rationalizable given is also strictly rationalizable along the tail of the perturbation. We proceed in three steps:
-
1.
Set .
-
2.
Fix and define the following set for each player :
-
3.
For each player , define utility function:
for each terminal history and each .
Set . Clearly, is a well-defined sequence of standard payoff structures converging to . Now, let us make few comments to better motivate the need of this construction:
-
(A)
For every player , every strategy , every payoff type , and every , there exists a payoff type that mimics the payoffs of for player except for a slight improvement in the payoffs corresponding to the terminal nodes consistent with (thus breaking all possible ties in expected utility in favor of ). This payoff types are the key elements we use in Lemma 2 below to break ties and move from extensive form rationalizability to strict rationalizability.
-
(B)
With some abuse of terminology it can be considered that is a subset of . That the states of nature of the former are contained in the latter is immediate from the construction. Player ’s payoff types in are not per se contained in those in ; however, there is a natural identification between payoff types and . For our proof of Lemma 7 below we will, with some abuse of notation, leverage on this fact.
From rationalizability to strict rationalizability
The proof of the following lemma (and also of Lemma 3) will depend on the concept of finite conjecture. Given standard payoff structure we say that conjecture is finite if for every history it holds that the belief hierarchy on induced by , , is that of a finite type. Then, we have that:
Lemma 2.
Let be an extensive form and , a standard payoff structure. Then, for any player and any the following two hold:
-
For any finite type , and any strategy there exists a and a sequence of finite types converging to such that and for any .
-
There exists some such that, for any , for every type and for every .
Proof.
We proceed by induction on . The initial case hold trivially for both claims, so we can focus on the proofs of the inductive step. Suppose that is such that the claims hold; we verify next that they also hold for . We prove each claim separately:
Claim . Fix player , finite type strategy and finite conjecture that justifies the inclusion of in (whose existence is guaranteed by Lemma 6 below). Next we will base on each to construct a conditional probability system for , , that will ensure that is in for some finite type . To this end set first and next, for each , . Now, fix and history . We know from the induction hypothesis that there exists some such that for every and that for any pair in the support of the marginal of on there exists a sequence of finite types converging to and satisfying the conditions in claim for . Then, set (remember that the support of is finite):
and define, for each ,
Finiteness of guarantees that is a well-defined probability measure on . Set then .
Notice now that the following three properties hold:313131Remember that was defined in SectionB.1.1 as the set of histories in which conjecture updates beliefs from scratch.
-
(1)
For every pair of histories and , if the marginal on of assigns positive probability to , then . To see this fix history and notice first that:
Thus, for every history such that the marginal of on assigns positive probability to we necessarily have that:
and thus the claim must hold.
-
(2)
For every history there exists some such that the marginal on of assigns positive probability to . To see this, fix history . If then there is nothing needed to be proved. If then we know that there exists some such that and the marginal of on puts positive probability on . Obviously, since and we conclude that, indeed, . The fact that the marginals on of and coincide ensures that the claim is correct.
-
(3)
For every pair of histories and the marginal of on assigns zero probability to . To see it remember that we know from the induction hypothesis that . Then, it follows from property (1) above that for every , the marginal of on assigns zero probability to for every history .
We already defined for histories in . Properties and above allows for defining exclusively for every history using conditional probability. Clearly, is a well-defined conjecture. Set now type (notice the use of comment (A) in Section B.3.1), where:
Then, we have that is a finite element of , and sequence converges to . Notice that, by construction, is an element of , the induction hypotheses guarantee that at each history belief assigns probability 1 to the graph of and that, for each , at each history belief assigns probability 1 to the graph of . In addition, it is evident that . Hence, we conclude that .
Claim . Fix player and set where each verifies part of the induction hypothesis for player . Then, pick type , strategy and conjecture that justifies the inclusion of in . We know from part of the induction hypothesis that and for every . Hence, it clearly follow that justifies the inclusion of in . Now, we divide the proof of the second claim in the following three-part cycle:
-
•
for any and any . This follows immediately from part , just proved above.
-
•
There exists some such that for any player and any . Pick ; according to part of Lemma 7 we have that for any .
-
•
There exists some such that for any and any . Let denote the set of finite types in . Because of upper hemicontinuity of we know that every has some open neighborhood such that:
Then, because of denseness of we have that:
Thus, if and only if there exists some finite such that . In consequence, by finiteness of and claim there exists some such that, for every , if , then .
Thus, by setting we conclude that for any and any ,
and hence, the proof is complete. ∎
Proof of Lemma 3
Lemma 3.
Let be an extensive form. Then, for any , any player , any subjective payoff structure , any finite type and any strategy there exists a sequence of finite types in converging to .
Proof.
Fix , player , subjective payoff structure , type and strategy , and pick finite conjecture that justifies the inclusion of in (the existence of such a conjecture is guaranteed by Lemma 6 below). Based on we define next:323232Remember that was defined in SectionB.1.1 as the set of histories in which conjecture updates beliefs from scratch.
The interest of is that it will eventually allow for recovering conjecture at those histories consistent . For each history we are going to find a sequence of finitely-supported beliefs that converges to and satisfies certain desirable properties described below. For each define probability measure on by setting:
Clearly, is well-defined and finitely-supported. Now, for each denote:
and for any define belief by extending via conditional probability. Finally, let conjecture and type be respectively defined as follows:
We claim now that the following four properties are satisfied:
-
is a sequence of finite types that converges to . For finiteness simply notice that is finitely-supported for every . Convergence follows from converging to and marginalization being continuous.
-
For any the marginal on of assigns positive probability to for every history . Fix such . If , then:
If , then there must exist some history such that and , and thus:
-
For any , is a well-defined conditional probability system consistent with and . Consistency with and holds by construction; thus, all we need to check is that does not violate the chain rule. This is easy to see by simply noticing that for any pair of different histories the marginal on of puts zero probability on . This implies that for any there are no inconsistency issues arising from belief update.333333Because if then there exist such that induces and induces . Since the marginal on of (resp. ) puts zero probability on (resp. ), it follows that the marginal on of (resp. ) puts zero probability on (resp. ). Clearly, there are no problems either for any pair (due to being a conditional probability system). Since for any the marginal on of puts zero probability on for any , it follows that pairs and are not problematic either.
In addition it holds by construction that:
Thus we have that if then:
Hence belief in appropriate behavior is maintained whenever possible to do so. If then:
what guarantees belief in appropriate continuation behavior.
-
There exists some such that for any . This follows trivially from finiteness of , continuity of conditional expected payoffs and the fact that converges to .
This way, we conclude that for every . Hence, if for each we relabel as , is the sequence we are looking for. ∎
Proof of Lemma 5
Second perturbation of payoff structures
Fix some standard payoff structure . For each , each player and each let denote the subjective payoff structure constructed recursively by setting:
-
, where is some arbitrary standard payoff structure satisfying richness.
-
is defined by setting and .
-
is defined by setting and
Obviously, each is a subjective payoff structure satisfying higher-order richness, and sequence converges to .
From strictness to uniqueness: Initial step
Lemma 4.
Let be an extensive form and , a standard payoff structure. Then, for any player , any finite type , any strategy such that , there exists a finite type such that:
-
.
-
.
-
.
Proof.
For notational convenience, set and . We are going to show first that for any player , any finite type and any strategy there exists a finite type such that:
-
.
-
and are payoff-equivalent for player ’s opponents.
To see it fix player , finite type and strategy and pick player ’s payoff type that makes conditionally dominant and is payoff-equivalent to for player ’s opponents.343434Richness of allows for such a selection. Then, set . Obviously, is finite. Pick now arbitrary conjecture consistent with subjective model . By definition, for any history and any strategy such that :
Hence, and thus, we conclude that . In addition, as said above, is by construction payoff-equivalent to for player ’s opponents.
Now, to prove the claim of the lemma fix player , finite type and strategy such that and . We know then that there exists some conjecture consistent with and and such that: and for every history . Remember now that we just proved above that for any pair in the support of the marginal of on there exists some finite type such that for any player
-
•
.
-
•
is payoff-equivalent to for player ’s opponents.
Define then as follows:
for any measurable . Since and are finite it is immediate that is a well-defined probability measure in . Define next belief hierarchy:
and set . Clearly, is finite and satisfies that . Note in addition that the supports of the marginals on of and coincide, and thus, that the support of the marginal on of is contained in . It follows then that . Let’s finish verifying that:
To see it, pick arbitrary and conjecture that justifies the inclusion of in . Now, for each pair denote:
and notice that for any we have that:353535We provide further clarification of the following development. The first two equalities follow from the construction of , from initially assigning probability to the graph of and the fact that . The third one follows from the construction of . The fourth is obvious. The fifth equality follows, again, from the construction of . The last one follows from the fact that if then , because .
Now, notice two things. First, we know that for any in the support of the marginal of on and are payoff-equivalent for player ; thus, it follows from the above that and induce the same conditional expected payoffs at . Second, the marginal of on assigns positive probability to for every ; in consequence, it follows from the above that and induce the same conditional expected payoffs at for every history . Thus, we conclude that and, thus, that . ∎
From strictness to uniqueness: Inductive step
Lemma 5.
Let be an extensive form. Then for any , any player , any standard payoff structure such that for every , any finite and any strategy such that there exists a finite type such that:
-
.
-
and .
-
.
Proof.
We proceed by induction on . The initial case () is covered by Lemma 4 (the second part of property holds by vacuity) so we can focus on the proof of the inductive step. Suppose that is such that the claims hold. We verify next that they also holds for . Fix first standard payoff structure such that for every and set, for notational convenience, ; let us proceed now in two steps:
1. Parts and : Construction of the required finite type
Fix player , finite and strategy such that and . Pick conjecture that justifies the inclusion of in and the inclusion of in . Now, we know from the induction hypothesis that for any pair in the support of the marginal of on there exists some finite type such that for every player ,
-
•
.
-
•
and for and .
-
•
.
Define then probability measure by setting:
for any measurable . Finiteness of guarantees that is well-defined. Obviously, does not necessarily induce a whole conditional probability system via conditional probability (its marginal on may not have full-support); however, if we set conditional probability enables to obtain a family of beliefs . Notice that for any measurable it holds that:
for every history . Thus, for any such history we can define the conditional expected payoff induced by belief and any strategy for payoff type , which, with some abuse of notation, we represent by , and, clearly, satisfies that:
(1) |
Define now belief hierarchy:
and set . Clearly, is finite and has the same payoff type and -order beliefs as . Thus, it follows that .363636Finiteness is immediate. To see that lower-order beliefs are maintained, simply notice that the probability assigned by to is assigned by to , being the -order beliefs of exactly those of . Finally, that follows from the fact that only assigns positive probability to pairs where is in the graph of .
2. Part : Verification of convenient behavior
It remains to be checked that:
To see it fix strategy and conjecture that justifies the inclusion of in . We want to verify that . The process is slightly arduous, so let us first present a road map describing the six intermediate steps required:
-
S1.
For any we have that:
In general, and may induce different marginals on because the types to which assigns positive probability do not necessarily have a unique strategy in . However, the marginals both beliefs induce expect ’s opponents to behave analogously while each opponent finds herself in a history reachable by some strategy profile in .
-
S2.
Furthermore, for any and any we have that:
That is, despite both beliefs inducing possibly different marginals on at different stages of the game, the observation in S1 generalizes from the initial history to every history that is consistent players having chosen some strategy profile in .
-
S3.
For any pair in the graph of and any strategy we have the following outcome equivalence:
-
S4.
For any pair in the graph of and any strategy we have the following outcome equivalence:
-
S5.
For any strategy and any history :
At every history consistent players having chosen a strategy profile in player ’s type ’s expected utilities under and coincide either when playing or . Notice that this is a direct consequence of the outcome equivalences in S3 and S4, and of S2, where we saw that and induce beliefs that expect analogous behavior while each player finds herself in a history she finds reachable when opponents play according to .
-
S6.
For any history , . This follows from S5 and the fact that is optimal given and is optimal given , which in turn, induces an expected payoff similar enough to that of .
Once these six steps are verified part the lemma will follow immediately. Let’s proceed step by step:
-
S1.
First, for every and denote:
Then, notice that for any triple we have that:
-
S2.
Fix history and , and pick some . It is easy to see that : for any player and any history that precedes or coincides with we have that and thus, it holds that . Hence, and, in consequence, there must exist some such that the family is a partition of . Then, it follows from S1 that:
Remember now that puts positive probability on and thus, it follows that for any :
-
S3.
Fix pair in the graph of and . Next, pick player such that there exists some history that precedes and is reached by both and .373737 I.e., such that and . Obviously, if there exists some satisfying the first requirement, there exists some history that satisfies both. We do not have to worry about players that lack such history : they do not make any choice along the path leading to . To see that simply notice that: for every player and . Thus, we have that:
Then, we know by definition of , that . This lets us conclude that for every .
-
S4.
Fix pair in the graph of and . Next, pick player such that there exists some history that precedes and is reached by both and .383838See Footnote 37; the same logic applies here, mutatis mutandi. We verify first that . To begin, remember that we have that, for player , . Thus:
Next, since , in particular, we know that . In consequence, since and only depends on the -order specification of the directory and the type, there must exist some finite type such that .393939The possibility of finding finite follows from itself being finite. Thus, we have that:
the last inclusion being one of the conditions on in the statement of the lemma. Hence we conclude that:
and thus, by definition of , that . Obviously, the latter implies that for every .
-
S5.
Fix history . Next pick strategies such that family is a partition of . Based on the latter, set and define the two measures on induced by setting, for any ,
Notice that we know from S2 that . It follows then that for any strategy :
being the second and fourth equalities consequences of the outcome equivalences in S4 and S5 (the abuse of notation is is innocuous due to S4).
-
S6.
Proceed by contradiction and suppose that there exists some history such that . Pick then strategy that maximizes at every history and define new strategy as follows:
Let’s check next that . To do it conceive as the disjoint union of the following four components:
We distinguish then four cases:
-
. The construction of ensures that ; thus must be a maximizer of .
-
. The above, together with , implies that:404040For the first inequality notice that and are identical at , and that is a maximizer of for every . The equality follows from (1) above.
Now, we also have that:414141The first two equalities follow from S5 and the last, from equation (1) above. The inequality is a consequence of being a best response to given .
and remember that is a maximizer of . Thus, it must necessarily hold that . Hence, we conclude that is a maximizer of .
-
. Notice that the construction of on the one hand, and the facts that and and are equivalent at any history preceding , on the other, imply that . Thus, since maximizes it follows that must maximize as well.
-
For any other , clearly, we have that:
We have then reached the following contradictory conclusions: and .
-
Hence, if is a best response to for it must hold that for every , that is,
and in consequence, the proof is finally complete. ∎
Proofs of the auxiliary results
Lemma 6: Availability of finite conjectures
Lemma 6.
Let be an extensive form and , a standard payoff structure. Then for any , any player , any finite type and every strategy there exists a finite conjecture that justifies the inclusion of in .
Proof.
Fix , player , finite type and strategy and pick conjecture that justifies the inclusion of in . For convenience, let us denote . Since is separable, we ca pick countable where is dense in .
Now, for each let denote the ball of radius around . We know because of the upper hemicontinuity of for each that for each there exists some open set such that for every is in the projection of on , being the projection on of . Furthermore, for each and , the family is an open cover of , which, due to being compact, we can assume as finite: .
Now, for each and set:
Notice that for each and family is a partition of consisting of measurable sets, contained in a ball of radius . Since the set of finite types is dense in for each and , there exists some list such that, for each , the projection on of is finite and .
We turn now back to . Let us denote and then, define recursive, for each , set:
Next, we will construct a conditional probability system for each . First, for each each and each set:
for every and every . Second, set . Finally, for each define using the chain rule. Notice that the marginals on of and each coincide for every history , and this guarantees that is (or, has been) well-defined.
Now, notice also that, for each , assigns probability one to the graph of . Obviously, every is consistent with type , and sequence converges to . Thus, the upper hemicontinuity of ensures the existence of some such that for every . Hence, every where is a finite conjecture that justifies the inclusion of in ∎
Lemma 7: A partial upper hemicontinuity result
Lemma 7.
Let be an extensive form and , a standard payoff structure. Then for any , any player there exists some such that the following three hold:
-
For every type , for every and every .
-
For every type , .
-
For every player , for every .
Proof.
We proceed by induction on . The claims hold trivially for the initial case () so we can focus on the proof of the inductive step. Suppose that the claims hold for ; we verify now that they also hold for :
Claim . Fix player and set . We know from part of the induction hypothesis that for every and every ,
for every type and,
We also know from part that,
Fix now , , type , strategy and conjecture that justifies the inclusion of in . It follows from the above that and that is a conjecture that justifies the inclusion of in . Fix now and conjecture that satisfies the inclusion of in . Again, it clearly follows from the above that and that is a conjecture that satisfies the inclusion of in (see the inclusion property in comment (B) in Section B.3.1).
Claim . Fix player and set again . Fix type and strategy , and take convergent sequence of conjectures such that, for each , justifies the inclusion of in (being ). Let denote the limit of this sequence. Notice that we know from part of the induction hypothesis that:
Now, set and pick history . Notice that:424242The first implication follows from parts and of the induction hypothesis. The second is a consequence of the graph of being closed. The third and the fourth inclusions are obvious and the last one follows from part the induction hypothesis.
Given the above, upper hemicontinuity of the best response operator ensures that is a conjecture that justifies the inclusion of in .
Claim . Fix players and . It follow from part that for every (the latter has been defined in the proofs of the previous two claims). To prove the reverse inclusion simply notice that, since is finite, it follows from part that there must exist some such that:434343To better see it suppose by contradiction that for any there exists some such that . Then, we know from part that there exists some such that for every and thus, it follows from part that
for every . Hence, it follows that for every .
Thus, to finish the proof simply set . ∎
References
- Aliprantis and Border (2007) Aliprantis, Charambolos D. and Kim C. Border (2007). Infinite dimensional analysis: A hitchhiker’s guide. Springer, Berlin, 3rd edn.
- Angeletos, Hellwig and Pavan (2006) Angeletos, George-Marios, Christian Hellwig and Alessandro Pavan (2006). “Signaling in a global game: Coordination and policy traps”. Journal of Political Economy, 114, 452–484.
- Angeletos, Hellwig and Pavan (2007) Angeletos, George-Marios, Christian Hellwig and Alessandro Pavan (2007). “Dynamic global games of regime change: Learning, multiplicity and timing of attacks”. Econometrica, 75, 711–756.
- Baliga and Sjöström (2012) Baliga, Sandeep and Tjomas Sjöström (2012). “The strategy of manipulating conflict”. American Economic Review, 102, 2897–2922.
- Battigalli (1997) Battigalli, Pierpaolo (1997). “On rationalizability on extensive-form games”. Journal of Economic Theory, 74, 40–61.
- Battigalli and De Vito (2018) Battigalli, Pierpaolo and Nicodemo De Vito (2018). “Beliefs, plans, and perceived intentions in dynamic games”. IGIER Working Paper Series #629.
- Battigalli and Siniscalchi (2002) Battigalli, Pierpaolo and Marciano Siniscalchi (2002). “Strong belief and forward induction reasoning”. Journal of Economic Theory, 106, 356–391.
- Battigalli and Siniscalchi (2003) Battigalli, Pierpaolo and Marciano Siniscalchi (2003). “Rationalization and incomplete information”. The B.E. Journal of Theoretical Economics, 3, 1–46.
- Battigalli and Siniscalchi (2007) Battigalli, Pierpaolo and Marciano Siniscalchi (2007). “Interactive epistemology in games with payoff uncertainty”. Research in Economics, 61, 165–184.
- Brandenburger and Dekel (1993) Brandenburger, Adam and Eddie Dekel (1993). “Hierarchies of beliefs and common knowledge”. Journal of Economic Theory, 59, 189–198.
- Carlsson and van Damme (1993) Carlsson, Hans and Eric van Damme (1993). “Global games and equilibrium selection”. Econometrica, 61, 989–1018.
- Catonini (2019) Catonini, Emiliano (2019). “Rationalizability and epistemic priority orderings”. Games and Economic Behavior, 114, 101–117.
- Catonini (2020) Catonini, Emiliano (2020). “On non-monotonic strategic reasoning”. Games and Economic Behavior, 120, 209–224.
- Chen and Micali (2013) Chen, Jing and Silvio Micali (2013). “The order independence of iterated dominance in extensive games”. Theoretical Economics, 8, 25–63.
- Chen (2012) Chen, Yi-Chun (2012). “A structure theorem for rationalizability in the normal form of dynamic games”. Games and Economic Behavior, 75, 587–597.
- Chen, Di Tillio, Faingold and Xiong (2010) Chen, Yi-Chun, Alfredo Di Tillio, Eduardo Faingold and Siyang Xiong (2010). “Uniform topologies on types”. Theoretical Economics, 5, 445–478.
- Chen, Mueller-Frank and Pai (2020) Chen, Yi-Chun, Manuel Mueller-Frank and Mallesh M. Pai (2020). “Continuous implementation with direct revelation mechanisms”. Mimeo.
- Chen, Tillio, Faingold and Xiong (2017) Chen, Yi-Chun, Alfredo Di Tillio, Eduardo Faingold and Siyang Xiong (2017). “Characterizing the stratrategic impact of misspecified beliefs”. The Review of Economic Studies, 84, 1424–1471.
- Cho and Kassa (2017) Cho, In-Koo and Kenneth Kassa (2017). “Model averaging and persistent disagreement”. Federal Reserve Bank of St. Louis Review,, 99, 279–294.
- Chung and Ely (2007) Chung, Kim-Sau and Jeffrey C. Ely (2007). “Foundations of dominant-strategy mechanisms”. Review of Economic Studies, 74, 447–476.
- Dekel, Fudenberg and Morris (2006) Dekel, Eddie, Drew Fudenberg and Stephen Morris (2006). “Topologies on types”. Theoretical Economics, 1, 275–309.
- Dekel, Fudenberg and Morris (2007) Dekel, Eddie, Drew Fudenberg and Stephen Morris (2007). “Interim correlated rationalizability”. Theoretical Economics, 2, 15–40.
- Dekel, Lipman and Rustichini (1998) Dekel, Eddie, Barton L. Lipman and Aldo Rustichini (1998). “Standard state-space models preclude unawareness”. Econometrica, 66, 159–173.
- Dye (1985) Dye, Ronal A. (1985). “Disclosure of nonproprietary information”. Journal of Accounting Research, 23, 123–145.
- Ely and Pȩski (2011) Ely, Jeffrey and Marcin Pȩski (2011). “Critical types”. Review of Economic Studies, 78, 907–937.
- Esponda and Pouzo (2016) Esponda, Ignacio and Demian Pouzo (2016). “Berk-Nash equilibrium: A framework for modeling agents with misspecified models”. Econometrica, 84, 1093–1130.
- Fagin and Halpern (1988) Fagin, Ronald and Joseph Y Halpern (1988). Belief, awareness, and limited reasoning. Artificial intelligence, 34(1), 39–76.
- Germano, Weinstein and Zuazo-Garin (2020) Germano, Fabrizio, Jonathan Weinstein and Peio Zuazo-Garin (2020). “Uncertain rationality, depth of reasoning and robustness in games with incomplete information”. Theoretical Economics, 15, 89–122.
- Guarino (2020) Guarino, Pierfrancesco (2020). “An epistemic analysis of dynamic games with unawareness”. Games and Economic Behavior, 120, 257–288.
- Halpern and Piermont (2019) Halpern, Joseph Y and Evan Piermont (2019). Partial awareness. In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 2851–2858.
- Halpern and Rêgo (2009) Halpern, Joseph Y. and Leandro C. Rêgo (2009). “Reasoning about knowledge of unawareness”. Games and Economic Behavior, 67, 503–525.
- Han and Kyle (2017) Han, Jungsuk and Albert S. Kyle (2017). “Speculative equilibrium with differences in higher-order beliefs. Management Science, 69, 4317–4332.
- Hansen and Sargent (2001) Hansen, Lars Peter and Thomas J. Sargent (2001). “Robust control and model uncertainty”. American Economic Review, 91, 60–66.
- Harsanyi (1967–1968) Harsanyi, John C. (1967–1968). “Games with incomplete information played by ‘Bayesian’ players, I–III”. Management Science, 14, 159–182, 320–334, 486–502.
- Heifetz and Kets (2018) Heifetz, Aviad and Willemien Kets (2018). “Robust multiplicity with a grain of naiveté. Theoretical Economics, 13, 415–465.
- Heifetz, Meier and Schipper (2006) Heifetz, Aviad, Martin Meier and Burkhard Schipper (2006). “Interactive unawareness”. Journal of Economic Theory, 130, 78–94.
- Heifetz, Meier and Schipper (2008) Heifetz, Aviad, Martin Meier and Burkhard Schipper (2008). “A canonical model for interactive unawareness”. Games and Economic Behavior, 62, 304–324.
- Heifetz, Meier and Schipper (2014) Heifetz, Aviad, Martin Meier and Burkhard Schipper (2014). “Dynamic unawareness and rationalizable behavior”. Games and Economic Behavior, 81, 50–68.
- Heifetz and Perea (2015) Heifetz, Aviad and Andrés Perea (2015). “On the outcome equivalence of backward induction and extensive form rationalizability”. International Journal of Game Theory, 44, 37–59.
- Inostroza and Pavan (2018) Inostroza, Nicolas and Alessandro Pavan (2018). “”Persuasion in global games with application to stress testing”. Mimeo.
- Jimenez-Gomez (2019) Jimenez-Gomez, David (2019). “False consensus in games: Embedding level- models into games of incomplete information”. Mimeo.
- Kennan (2001) Kennan, John (2001). “Repeated bargaining with persistent private information”. Review of Economic Studies, 68, 719–755.
- Mertens and Zamir (1985) Mertens, Jean-François and Shmuel Zamir (1985). “Formulation of Bayesian analysis for games with incomplete information”. International Journal of Game Theory, 14, 1–29.
- Modica and Rustichini (1994) Modica, Salvatore and Aldo Rustichini (1994). Awareness and partitional information structures. Theory and decision, 37(1), 107–124.
- Morris, Shin and Yildiz (2016) Morris, Stepehen, Hyung Song Shin and Muhamet Yildiz (2016). “Common belief foundations of global games”. Journal of Economic Theory, 163, 826–848.
- Morris and Shin (1998) Morris, Stephen and Hyun Song Shin (1998). “Unique Equilibrium in a Model of Self-Fulfilling Currency Attacks”. American Economic Review, 88, 587–597.
- Murayama (2020) Murayama, Kota (2020). “Robust predictions under finite depth of reasoning”. The Japanese Economic Review, 71, 59–84.
- Oury and Tercieux (2012) Oury, Marion and Olivier Tercieux (2012). “Continuous implementation”. Econometrica, 80, 1605–1637.
- Pearce (1984) Pearce, David G. (1984). “Rationalizable strategic behavior and the problem of perfection”. Econometrica, 52, 1029–1050.
- Penta (2011) Penta, Antonio (2011). “Backward induction reasoning in games with incomplete information”. Mimeo. University of Winsonsin-Madison.
- Penta (2012) Penta, Antonio (2012). “Higher order uncertainty and information: Static and dynamic games”. Econometrica, 80, 631–660.
- Penta (2015) Penta, Antonio (2015). “Robust dynamic implementation”. Journal of Economic Theory, 160, 280–316.
- Penta and Zuazo-Garin (2021) Penta, Antonio and Peio Zuazo-Garin (2021). “Rationalizability, observability and common knolwedge. Review of Economic Studies. (forthcoming).
- Perea (2014) Perea, Andrés (2014). “Belief in the opponent’s future rationality”. Games and Economic Behavior, 81, 235–254.
- Perea (2018a) Perea, Andrés (2018a). “Why forward induction leads to the backward induction outcome: A new proof for Battigalli’s theorem”. Games and Economic Behavior, 110, 120–138.
- Perea (2018b) Perea, Andrés (2018b). “Common belief in rationality in games with unawareness”. Mimeo (previous version, Epicenter Working Paper No. 13).
- Perea (2018c) Perea, Andrés (2018c). “Order independence in dynamic games”. Mimeo (previous version, Epicenter Working Paper No. 8).
- Piermont (2019) Piermont, Evan (2019). “Algebraic semantics for propositional awareness logics”. arXiv preprint arXiv:1910.07275.
- Rubinstein (1989) Rubinstein, Ariel (1989). “The electronic mail game: strategic behaviour under ‘almost common knowledge”’. American Economic Review, 79, 385–391.
- Ruiz G. (2018) Ruiz G., David (2018). “Critical types in dynamic games”. Mimeo.
- Sannikov and Skrzypack (2016) Sannikov, Yuliy and Andrzej Skrzypack (2016). “Dynamic trading: Price inertia and front-running”. Mimeo.
- Strzalecki (2014) Strzalecki, Tomasz (2014). “Depth of reasoning and higher order beliefs”. Journal of Economic Behavior and Organization, 108, 108–122. Mimeo.
- Suetonius (121) Suetonius, Gaius Tranquillus (121). “The twelve Caesars”. Pengüin Classics.
- Weinstein and Yildiz (2007) Weinstein, Jonathan and Muhamet Yildiz (2007). “A structure theorem for rationalizability with application to robust predictions of refinements”. Econometrica, 75, 365–400.
- Wilson (1987) Wilson, Robert (1987). “Game-theoretic approaches to trading proceses”. In Truman F. Bewley (editor), Advances in economic theory: Fifth world congress. Cambridge University Press.
- Ziegler (2019) Ziegler, Gabriel (2019). “Adversarial bilateral information design”. Mimeo.