经济代写|博弈论代写Game Theory代考|Games with Overlapping Generations of Players

博弈论Game theory在20世纪50年代被许多学者广泛地发展。它在20世纪70年代被明确地应用于进化论,尽管类似的发展至少可以追溯到20世纪30年代。博弈论已被广泛认为是许多领域的重要工具。截至2020年,随着诺贝尔经济学纪念奖被授予博弈理论家保罗-米尔格伦和罗伯特-B-威尔逊,已有15位博弈理论家获得了诺贝尔经济学奖。约翰-梅纳德-史密斯因其对进化博弈论的应用而被授予克拉福德奖。

博弈论Game theory是对理性主体之间战略互动的数学模型的研究。它在社会科学的所有领域,以及逻辑学、系统科学和计算机科学中都有应用。最初,它针对的是两人的零和博弈,其中每个参与者的收益或损失都与其他参与者的收益或损失完全平衡。在21世纪,博弈论适用于广泛的行为关系;它现在是人类、动物以及计算机的逻辑决策科学的一个总称。

Crémer (1986) considcred a repeated game in which overlapping generations of players live for $T$ periods, so that at each date $t$ there is one player of age $T$ who is playing his last round, one player of age $T-1$ who has two rounds still to play, and so on down to the new player who will play $T$ times. Each period, the $T$ players simultaneously choose whether to work or to shirk, and their choices are revealed at the end of each period; players share equally in the resulting output, which is an increasing function of the number who chose to work. ${ }^{13}$ The cost of effort exceeds a $1 / T$ share of the increases in output, so shirking is a dominant strategy in the stage game, which has the flavor of a $T$-player prisoner’s dilemma. Payoffs in the repeated game are the average of the per-period utilities.
Suppose that the efficient outcome is for all players to work. This outcome cannot occur in any Nash equilibrium, since the age- $T$ player will always shirk. Nevertheless, there can be equilibria where most of the players work. This will be easicst to sec if we further specialize the model. Let $T=10$. Suppose that if $k$ players work the aggregate output is $2 k$, and that the disutility of effort is 1 . Then if preferences are linear in output and effort, the payoff to working when $k$ opponents work is $2(k+1) / 10-1$, and the payoff to shirking is $2 k / 10$. The efficient outcome is for all players to work, with resulting utility of 1 per player.

Now consider the following strategy profile: “Age-10 players always shirk. So long as no player has ever shirked when his age is less than 10, all players of age less than 10 work. If a player has ever shirked when his age is less than 10 , then all players shirk.” If all players conform to this profilc, each player reccives $18 / 10-1=4 / 5$ in the periods he works and $9 / 5$ in the period he is of age 10. Clearly, no player can gain by deviating when he is of age 10 . If a player of age 9 deviates, he receives $8 / 5$ the period he deviates, and 0 the next period, which is less than $4 / 5+9 / 5$; younger players lose even more by deviating. Thus, these strategies are a subgameperfect equilibrium.

Kandori (1989b) and Smith (1989) have generalized this type of construction and provided conditions for the folk theorem to obtain.

经济代写|博弈论代写Game Theory代考|Randomly Matched Opponents

Another variant of the repeated-games model supposes that there are $a$ many players, each of whom plays infinitely often but against a different opponent each period. More precisely, fix a two-player stage game, and suppose that there are two populations of players of equal size, $N$. Each period, every player 1 is matched with a player 2 . The probability of being matched to a particular player 2 is $1 / N$, and matching in each stage is independent. $^{14}$

In the first analyses of this sort of random-matching model, Rosenthal (1979) and Rosenthal and Landau (1979) assumed that when the players in each pair are matched, their information consists of the actions that the two of them played in the previous period. Thus, if the stage game is the prisoner’s dilemma, where C is “cooperate” and D is “defect,” there are four possible “histories” a pair of players can have, namely $(C, C),(D, C),(C, D)$, and (D, D), and consequently each player has $2^4=16$ pure strategies. (Note that players do not have perfect recall!)

With this information structure, the strategy “cooperate if and only if my opponent cooperated last period,” or “tit for tat,” is feasible. More generally, the action a player chooses in period $t$ can have a direct effect on his opponent’s play in period $t+1$.

If the player expects to face the same opponent in period $t+1$ and in period $t+2$, he may anticipate an additional indirect effect of his period- $t$ action on his opponent’s play in periods after $t+1$. For example, if the opponent’s strategy is to cooperate only if the history is (C,C), defecting in period $t$ will not only make the opponent defect in period $t+1$; it will also make the opponent defect in every period thereafter.

crsammer(1986)考虑了一种重复游戏,在这种游戏中,重叠的几代玩家生活$T$周期,所以在每个日期$T$有一个年龄$T$的玩家玩最后一轮,一个年龄$T$的玩家还有两轮,以此类推,新玩家将玩$T$次。每一时段,$T$参与者同时选择是工作还是逃避,他们的选择在每一时段结束时揭晓;玩家平均分享产出,这是选择工作的人数的递增函数。${}^{13}$努力成本超过产出增加的1美元/ T美元份额,因此逃避是阶段博弈中的主导策略,这有点像$T$参与人的囚徒困境。重复博弈的收益是每时期效用的平均值。
假设最有效的结果是所有参与者都工作。这种结果不可能出现在任何纳什均衡中,因为年龄为T的参与者总是会逃避。然而,也可能存在大多数参与者都参与的均衡。如果我们进一步对模型进行专门化,这将是最容易看到的。让T = 10美元。假设$k$参与者工作,总产出为$ 2k $,而努力的负效用为1。然后,如果偏好在产出和努力上是线性的,那么当k个对手工作时,工作的回报是2美元(k+1) / 10-1美元,而逃避的回报是2美元/ 10美元。有效的结果是让所有玩家都工作,每个玩家的效用为1。

现在考虑以下策略概要:“10岁的玩家总是逃避。只要没有球员在年龄小于10岁时偷懒,所有年龄小于10岁的球员都会工作。如果一名球员在年龄小于10岁时曾逃避,那么所有球员都会逃避。”如果所有玩家都符合这一特征,那么每个玩家在其工作期间将获得18美元/ 10-1=4 / 5美元,在其年满10岁期间将获得9美元/ 5美元。显然,没有一个球员能在10岁时通过偏离战术而获益。如果一个9岁的球员偏离,他在他偏离的时期得到8 / 5美元,下一个时期得到0美元,小于4 / 5美元+9 / 5美元;更年轻的玩家因偏离而损失更大。因此,这些策略是一种亚博弈完美均衡。

Kandori (1989b)和Smith(1989)对这种构造进行了推广,并为民间定理的获得提供了条件。

经济代写|博弈论代写Game Theory代考|Randomly Matched Opponents

重复博弈模型的另一种变体是假设有100多名玩家,每个玩家都无限频繁地玩游戏,但每个时期都面对不同的对手。更准确地说,固定一个两个人的阶段博弈,假设有两个大小相等的玩家群体,$N$。每个时期,参与人1都与参与人2匹配。匹配到特定参与人2的概率是1 / N,每个阶段的匹配都是独立的。$ ^ {14} $

在对这种随机匹配模型的第一次分析中,Rosenthal(1979)和Rosenthal and Landau(1979)假设,当每对参与者匹配时,他们的信息由他们两人在前一时期所做的动作组成。因此,如果阶段博弈是囚徒困境,其中C是“合作”,D是“缺陷”,那么一对参与者可以拥有四种可能的“历史”,即(C, C),(D, C),(C, D)和(D, D),因此每个参与者有$2^4=16$纯策略。(注意,玩家并没有完美的记忆!)



