• No results found

Reputation, incomplete information, and differences in patience in repeated games with multiple equilibria

N/A
N/A
Protected

Academic year: 2022

Share "Reputation, incomplete information, and differences in patience in repeated games with multiple equilibria"

Copied!
7
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

Reputation, incomplete information, and differences in patience in repeated games with multiple equilibria

Kjell Hausken

Abstract

A game with multiple equilibria and incomplete information, which allows for reputation building, is repeated infinitelymanytimes.Increasingdifferencesinpatiencecontributetoagreaterlikelihoodofcooperation.Asone playerbecomessufficientlymorepatientthantheotherplayer,bothplayersbenefit,andbothplayers'risklimits, andtheconflictbetween theplayers,decrease.

Keywords:Conflict; Risk limits; Repeated game; Discounting; Reputation; Incomplete information

Theliteraturehasproducedmixedresultsregardingwhetherplayersshouldchooseconflicttodaytoreap benefitstomorrow.First,theFolktheorem(FudenbergandMaskin,1986)isoftentakentoimplycooperationin long-termrelationships,whichiscorrectwhentheprisoner'sdilemmaisplayedrepeatedly.Second,Hausken (2005)hasforthebattleofthesexeswhereplayer1valuesthefutureandplayer2ismyopic,shownthatplayer 1prefersconflictinthepresentwhenthefutureisimportant.Similarly,SkaperdasandSyropoulos(1996)equip each agentwitha resourcewhichcan beallocated into productionversus arms.Theyshow thatincreased importanceofthefuturemayharmcooperation.Thisarticleconsidersabroadclassofgameswithmultiple equilibria,andintroducesincompleteinformationwhichallowstheplayerstobuildreputations.Theobjective ofthearticleistounderstandhowtheplayers'payoffs,risklimits,andtheconflictbetweenthemareinfluenced by different emphases on the future. The players are concerned about how the conflict between them evolves. The notion of risk limits was essential in Zeuthen's (1930) work. He originated the principle of risk dominance as a dominance relation based on comparing the various players' risk limits.1

© 2007 Elsevier B.V. All rights reserved.

(2)

ConsiderthegameinTable1wherea1≥ b1≥ t1,b2≥ a2≥ t2,a1≥ d1,b2N d2ora1N d1,b2≥ d2.2The twopure strategyequilibriaare(a1,a2) and (b1,b2).Rowplayer1prefers(a1,a2), and col umn player 2 prefers(b1,b2).3

Inthe finitely andinfinitely repeatedversions of the gamein Table1the twoNash equilibria are subgameperfect.4 In the infinitelyrepeatedgame the followingtwo strategies constitutea subgame

Table 1

Two-person two-strategy game with two equilibria

I II

I a1,a2 t1,t2

II d1,d2 b1,b2

1 SubsequentlyEllsberg(1961)discussedtheprinciplerelatedtohisparadoxes.ThereafterHarsanyi(1977)analyzedZeuthen's principle,andHarsanyiandSelten(1988:90)appliedthenotionofriskdominanceasacriterionforequilibriumselection.

2 Table1encompassesgames64–69inRapoportandGuyer's(1966:213)ordinaltaxonomy.Themostwell-knownofthesearethe BattleoftheSexes(game68),Chicken(game66),andgame69withseveralnamessuchas “LetGeorgedoit”, “Apology”, “Hero”,

Sacrificedleader.Thegames64,65,67arehybridasymmetricgames.

3 InthefinitelyandinfinitelyrepeatedversionsofthegameinTable1thetwoNashequilibriaaresubgame perfect.Inthe infinitelyrepeatedgame the followingtwostrategies constitutea subgame perfectequilibriumwith payoff(a1,a2) in each period:Player1:ChoosestrategyIwhenchallenged,unlessstrategy2waschoseninthepast,thenalwayschoosestrategyII.

Player 2: Choose strategy I unless player 1failed to choose strategy I in the past, then always choose strategy II. The justificationforthesubgameperfectequilibriumwithpayoff(b1,b2)ineachperiodisanalogous.Forthesetwosubgameperfect equilibriaoneplayeracquiresareputationforrecalcitrance,theotherforacquiescence.Oneproblemwiththesetwoequilibriais thatthereputationisnevertested.Table1isequivalenttotheprobablymostwell-knownexampleofentrydeterrence,viz.the chainstoregameonnormalform,whena1 =d1 b1 t1,b2 a2 =d2 t2,whereplayer1istheincumbent(fight=strategyI, acquiesce=strategyII)andplayer2theentrant(stayout=strategyI,enter=strategyII).BothgameshavethesametwoNash equilibria,butthechainstoregameinitsfinitelyrepeatedversionhasonlyoneuniquesubgameperfectequilibrium(provedby backwardinduction);theentrantentersandtheincumbentdoesnotfight.KrepsandWilson(1982)andMilgromandRoberts (1982)werefirsttoformalizereputationeffects,whereasmallamountofincompleteinformationcanbesufficienttoovercome Selten's(1978)chainstoreparadox.AsKrepsandWilson(1982:255)pointout,thesecondequilibrium(theentrantstaysout and the incumbent chooses the strategy fight if entry) i s imperfectand not soplausible as the first. It depends on anexpectation by the entrant of the bincumbent'sN behavior that, faced with the fait accompli of entry, would be irrationalbehavior for the bincumbentN. For text book treatments see e.g. Fudenberg and Tirole (1991:369374), Osborne andRubinstein(1994:105–106,239–243),Rasmusen(1989:85–118,2001:110,129),Wilson(1985:31–33).

4 Table1isequivalenttotheprobablymostwell-knownexampleofentrydeterrence,viz.thechainstoregameonnormalform,when a1=d1b1t1,b2a2=d2t2,whereplayer1istheincumbent(fight=strategyI,acquiesce=strategyII)andplayer2theentrant(stay out=strategyI,enter=strategyII).BothgameshavethesametwoNashequilibria,butthechainstoregameinitsfinitelyrepeated versionhasonlyoneuniquesubgameperfectequilibrium(provedbybackwardinduction);theentrantentersandtheincumbentdoes notfight.KrepsandWilson(1982)andMilgromandRoberts(1982)werefirsttoformalizereputationeffects,whereasmallamountof incompleteinformationcanbesufficienttoovercomeSelten's(1978)chainstoreparadox.AsKrepsandWilson(1982:255)pointout, thesecondequilibrium(theentrantstaysoutandtheincumbentchoosesthestrategy“fightifentry”) i s “imperfect” and“not so plausibleasthefirst.Itdependsonanexpectationbytheentrantofthe bincumbent'sN behaviorthat,facedwiththefaitaccompliof entry,wouldbeirrationalbehaviorforthebincumbentN.” Fortextbooktreatmentsseee.g.FudenbergandTirole(1991:369–374), OsborneandRubinstein(1994:105106,239243),Rasmusen(1989:85118,2001:110,129),Wilson(1985:3133).

(3)

perfect equilibrium with payoff (a1,a2) in each period: Player 1: Choose strategy I when challenged, unless strategy 2 was chosen in the past, then always choose strategy II. Player 2: Choose strategy I unless player 1 failed to choose strategy I in the past, then always choose strategy II. The justification for the subgame perfect equilibrium with payoff (b1,b2) in each period is analogous.

For these two subgame perfect equilibria one player acquires a reputation for recalcitrance, the other for acquiescence. One problem with these two equilibria is that the reputation is never tested.

Onewayaroundthisproblemistointroduceincompleteinformationsothatreputationscanbebuilt.5 Aliteratureonreputationboundshasemerged,expressingtheaveragediscountedpayoffstheplayerscan guaranteetothemselves. Thefirstsystematic treatmentwas presentedby FudenbergandLevine(1989, 1992).6Player1preferstheequilibrium(a1,a2),andgetst1inthethreatpoint,sowedefinea1asplayer 1'slowerbound.Analogously,player2preferstheequilibrium(b1,b2),andgetst2inthethreatpoint,so we define b2∞ as player2's lower bound. For players involved inreputation building Schmidt (1993) determinesforinfinitelyrepeatedgameswithconflictinginterestsandsimultaneousmovesineachperiod thetwolowerbounds

a1l¼ 1−l02dk11ðl1;d2Þ

t1þl02dk11ðl1;d2Þa1; b2l¼ 1−l01dk22ðl2;d1Þ

t2þl01dk22ðl2;d1Þb2; ð1Þ expressed as average discounted payoffs, where7

k1ðl1;d2Þ ¼ð½N1 þ1Þlnl1

lnð1−e1Þ ; k2ðl2;d1Þ ¼ð½N2 þ1Þlnl2 lnð1−e2Þ ;

N1¼lnð1−d2Þ þlnða2−t2Þ−lnðb2−t2Þ lnd2

; N2¼lnð1−d1Þ þlnðb1−t1Þ−lnða1−t1Þ lnd1

;

e1¼ð1−d2Þ2ða2−t2Þ

ðb2−t2Þ þdð½2N1þ1Þð1−d2Þ; e2¼ð1−d1Þ2ðb1−t1Þ

ða1−t1Þ þdð½1N2þ1Þð1−d1Þ;

ð2Þ

where [Ni] is the integer value ofNi, andμi0N0,μ⁎i=1−μi0N0 are the probabilities of the“normal”and

“committed” (always challenges) types, respectively, of player i, i= 1,2. Eq. (1) states that if the probability μ20

of the normal type of player 2 is close to one, and if player 1 is very patient, then the lower bound01∞for player 1 is close to his commitment payoffa1. The bounda1∞for player 1 (w.l.o.g.) is valid and reputation building has impact only when player 1 is sufficiently more patient than player 2.

5 Toallowaroleforreputationatleastoneplayermusthaveprivateinformationthatpersistsovertime,thisplayermustbe likelytotakeseveralactionsinsequence,andtheplayermustbeunabletocommitinadvancetothesequenceofactionsshe willtake(Wilson,1985;KrepsandWilson,1982).

6 SeeCelentanietal.(1996),Crippsetal.(1996),Sorin(1999),andWatson(1996)forsubsequenttreatments.

7 FortheliteratureonreputationboundstreatedsystematicallyfirstbyFudenbergandLevine(1989,1992),seee.g.Celentani et al. (1996), Cripps et al. (1996), Sorin (1999), Watson (1996). See also Fudenberg and Levine (1989, 1992), Eq. (1) correspondsinSchmidt's(1993)articleto(30)inTheorem3,Eq.(2)correspondsto(22),(17),(18),andEq.(3)correspondsto (28).(Thereisaprintingerrorin(23)whichdoesnotfollowfrom(18)and(37)).

(4)

¯ ¯

Thatis(Schmidt,1993:337),forany δ2b 1, μ1⁎N 0, ε1N 0, ∃δ11⁎,δ21)b1 s . t . f o r a n yδ1≥δ1

1⁎,δ21) t h e averagepayoffofthenormaltypeofplayer1isatleasta1, w h e r e8

Pd1ðl⁎

1;d2;e1Þ ¼ a1−t1−e1

a1−t1

1

k1ðl

1;d2Þ; Pd2ðl⁎

2;d1;e2Þ ¼ b2−t2−e2

b2−t2

1

k2ðl

2;d1Þ; ð3Þ

⁎=μ⁎

illustrated in Fig. 1 when μ1 2= 0.05 and μ1⁎ = μ2⁎=0.3, assuming (a1,a2)=(4,3), (b1,b2)=(3,4), (t1,t2)=(2,2).9 Theareawherethe boundsdonot holdissubstantial.[N1]jumpsdiscretelyfrom0to 1 w h e n δ2= 0.33,from 1 to2when δ2= 0.5,from 2to3 when δ2= 0.59,from 3to 4when δ2= 0.65,etc.,whichexplainthediscretejumpsinFig.1.

When the bounda1∞(b2∞) is valid for player 1 (2), the associated payoff to player 2 (1) compatible with (1) is

a2l¼ 1−l02dk11ðl1;d2Þ

t2þl02dk11ðl1;d2Þa2; b1l¼ 1−l01dk22ðl2;d1Þ

t1þl01dk22ðl2;d1Þb1: ð4Þ

illustrated inFig. 2forδ2= 0.2. That is, whenμ1⁎=μ⁎2= 0.3 and δ2= 0.2, player 1's discount factor must be aboveδ_1(0.3, 0.2,ε1) = 0.861519 for the reputation bounda1to be valid. The reputation bounda1

increases from a1∞= 3.44 when δ1= 0.861519 to a1∞= 3.90 when δ1= 1, which means that player 1 benefits from getting more patient. Player 2's associated payoff a2 is determined from Eq. (4), and increases froma2∞= 2.72 toa2∞= 2.95 in the same interval. In other words, as δ1becomes sufficiently

Fig. 1.δ

¯1(μ1,δ2,ε1) andδ

¯2(μ2,δ1,ε2) forμ1=μ2= 0.05 andμ1=μ2= 0.3.

8 Eq.(1)correspondsinSchmidt's(1993)articleto(30)inTheorem3,Eq.(2)correspondsto(22),(17),(18),andEq.(3)

correspondsto(28).(Thereisaprintingerrorin(23)whichdoesnotfollowfrom(18)and(37)).

9 Schmidt (1993:334)defines “conflictinginterests” sothatthe commitmentstrategy ofone player holdstheother player downto hisminmaxpayoff.Thisissatisfiedwhend2 N a2 ord1 N b1.SeeCrippsetal.(1996)for ananalysisofgamesalso withoutconflictinginterests,whichgivesweakerbounds.IthankLarrySamuelsonforadiscussionaboutthispoint.

(5)

larger than δ2, player 1 can increase his lower bounda1∞ toward his most preferred payoffa1, which also increases player 2's payoff towarda2∞. Let us formulate this as a property.

Property 1. As one player becomes sufficiently more patient than the other player, both players benefit.

Let us defineplayer 1's risk limit as r1 = (a1 − b1) / (a1 − t1) in the static game in Table 1, which reachesits minimum at r1= 0 when player1 isindifferent between the two equilibria, andreachesits maximum at r1 = 1 when player 1 is indifferent between the threat point and the non-preferred equilibrium.Intheinfinitelyrepeatedgameplayer1canguaranteea1∞ ratherthanb1tohimself.Wethus define player 1's risk limit in the infinitely repeated game by replacing player 1's least preferred equilibriumpayoff b1 witha1∞.Player 1strives towardthepayoff a1,andasthedifference betweena1 anda1∞ decreases,player1'srisklimitdecreases.Weanalogouslydefineplayer2'srisklimitasr2= (b2− a2) / (b2 − t2)inthestaticgame,replacinga2 witha2∞ intherepeatedgame.Hencewhenplayer1'slower bounda1∞ isvalid,wedefinetherisklimitsandconflictmeasure

r11l¼a1−a1l

a1−t1

¼1−l02dk11ðl1;d2Þ; r21l¼b2−a2l

b2−t2

¼1−ða2−t2Þl02dk11ðl1;d2Þ b2−t2

; cr1l¼r11lr21l; d1zPd1ðl1;d2;e1Þ

ð5Þ

Definingconflictastheproductoftheplayers'risklimitshasnotbeendoneearlierintheliterature,but Axelrod(1970) hasdefinedconflictinastaticgamesuchasTable1as

c¼ða1−b1Þðb2−a2Þ

ða1−t1Þðb2−t2Þ ð6Þ

which happens to equal the product r1r2. In a two-dimensional utility diagram for the two players, Axelrod defines conflict as the relation of the small rectangle (a1−b1)(b2−a2) of conflictful behavior to

Fig. 2. The bounda1∞and payoffa2∞as functions ofδ1_δ1(0.3,0.2,ε1) = 0.861519.

(6)

thelargerectangle(a1 − t1)(b2 − t2) ofjointdemand.Axelrod(1970:57) referstothesmallrectangle as

“theproportionofthejointdemandareawhichisinfeasible,”10 andtothelargerectangleastheareaof jointdemandspannedoutbythethreatpoint(t1,t2)andtheoutmostpointdeterminedbythebestpayoff (a1,b2) eachplayercanpossibly obtainunder hismostfavorable circumstances.

When player 2's lower boundb2∞is valid, we analogously define

r12l¼1−ðb1−t1Þl01dk22ðl2;d1Þ

a1−t1 ; r22l¼b2−b2l

b2−t2 ¼1−l01dk22ðl2;d1Þ;

cr2l¼r12lr22l; d2zPd2ðl2;d1;e2Þ: ð7Þ Eq. (5) is illustrated inFig. 3forδ2= 0.2. As player 1's patience increases beyondδ1≥δ

¯1(0.3,0.2,ε1), both risk limits and the conflict measure decrease. Increased difference between the players' emphasis on the future causes them to be more inclined to “cooperate” on the equilibrium preferred by the patient player. Conversely, as the players' emphasis on the future gets more similar, the conflict between them increases. Only the player who most successfully engages in costly reputation building in the present, which involves insisting on playing his preferred equilibrium to deter the other player from getting his preferred equilibrium, and which requires a high emphasis on the future, increases his chances to get his preferred equilibrium in the long run.

Thebenefitsof “playinghard” areprimarilyinthefutureandcanbegainedonlybychoosingconflict today.Withoutexplicitconflict inthepresentthathigherpayoffcannotbe obtained.Playerswhoplace greatervalueonthefuturearemorelikelytochooseconflictinordertoreapthosefuturebenefits.Note thesimilarity between this reasoning and that ofSkaperdas and Syropoulos (1996) andGarfinkel and Skaperdas (2000), where, despite a short-run incentive to settle a conflict, there can be long-term

10 It is the jointly infeasible expectation of an additional gain in case of a conflict.

Fig. 3. The risk limits and conflict measure whenδ1δ

¯1(0.3,0.2,ε1) = 0.861519.

“compound rewards to cheating”, or “long-term compounding rewards to going to war.” If these compounded gains are large enough then conflictful and not“cooperative”behavior is the equilibrium.

(7)

Property 2. As oneplayer becomes sufficientlymore patient than theother player,both players' risk limits, and the conflict between the players, decrease.

The Properties 1 and 2 jointly mean that both players benefit from an increasing discrepancy in the players' discount factors, which causes lower risk limits and reduced conflict. That is, increasing differences in patience contribute to a greater likelihood of cooperation.

References

Axelrod,R.,1970. Conflict of Interest. MarkhamPublishers,Chicago.

Celentani,M.,Fudenberg, D.,Levine,D.K.,Pesendorfer,W.,1996.Maintainingareputationagainstalong-livedopponent.

Econometrica 64,691704.

Cripps,M.W.,Schmidt,K.M.,Thomas,J.P.,1996.Reputationinperturbedrepeatedgames. Journal of Economic Theory 69,387– 410.

Ellsberg,D.,1961.Risk,ambiguityandthesavageaxioms. Quarterly Journal of Economics 75,643649.

Fudenberg,D.M.,Levine,E.,1989.Reputationandequilibriumselectioningameswithapatientplayer.Econometrica57,759–778.

Fudenberg,D.M.,Levine,E.,1992.Maintainingareputationwhenstrategiesareimperfectlyobserved.Review of Economic Studies 59,561579.

Fudenberg, D.M.,Maskin, E.,1986.Thefolktheoremin repeatedgameswithdiscounting orwithincompleteinformation.

Econometrica 54,533–554.

Fudenberg,D.M.,Tirole,J.,1991. Game Theory. MITPress,Cambridge.

Garfinkel, M.R., Skaperdas,S., 2000. Conflictwithout misperceptions or incompleteinformation: how the futurematters.

Journal of Conflict Resolution 44(6),793807.

Harsanyi,J.C.,1977.Rational Behavior and Bargaining Equilibrium in Games and Social Situations.Cambridge.

Harsanyi,J.C.,Selten,R.,1988.A General Theory of Equilibrium Selection in Games. MITPress,Cambridge.

Hausken,K.,2005.Thebattleofthesexeswhenthefutureisimportant. Economics Letters 87(1),8993.

Kreps,D.M.,Wilson,R.,1982.Reputationandimperfectinformation.JournalofEconomicTheory27,253–279.

Milgrom,P.,Roberts,J.,1982.Predation,reputationandentrydeterrence. Journal of Economic Theory 27,280312.

Osborne,M.J.,Rubinstein,A.,1994.ACourseinGameTheory.MITPress,Cambridge.

Rapoport,A.,Guyer,M.,1966. A Taxonomy of 2 × 2 Games, GeneralSystems11,203214.

Rasmusen,E.,1989.Games and Information.BasilBlackwell,Inc.,Cambridge.

Rasmusen,E.,2001. Games and Information. BasilBlackwell,Inc.,Cambridge.

Schmidt,K.M.,1993.Reputationandequilibriumcharacterizationinrepeatedgameswithconflictinginterests.Econometrica 61, 325–351.

Selten,R.,1978.Thechainstoreparadox. Theory and Decision 9,127159.

Skaperdas, S.,Syropoulos, C.,1996. Canthe shadowofthe futureharm cooperation?Journal of Economic Behavior and Organization 29,355372.

Sorin,S.,1999.Merging,reputation,andrepeatedgameswithincompleteinformation.GamesandEconomicBehavior29,274–

308.Watson,J.,1996.Reputationinrepeatedgameswithnodiscounting. Games and Economic Behavior 15,82109.

Wilson,R.,1985.Reputationsingamesandmarkets.In:Roth,A.E.(Ed.),Game-theoretic Models of Bargaining.Cambridge UniversityPress,Cambridge,pp.2762.

Zeuthen,F.,1930. Problems of Monopoly and Economics. Routledge,London.

Referanser

RELATERTE DOKUMENTER

Organized criminal networks operating in the fi sheries sector engage in illicit activities ranging from criminal fi shing to tax crimes, money laundering, cor- ruption,

While excellent work has been done on Jordanian jihadism - particularly on the history of the phenomenon and ideologues such as al-Maqdisi, more work needs to be done

The aims of this study were twofold: Firstly, to investigate sex differences in the acute effects of an extremely demand- ing military field exercise on explosive strength and

3.1 Evolution of costs of defence 3.1.1 Measurement unit 3.1.2 Base price index 3.2 Operating cost growth and investment cost escalation 3.3 Intra- and intergenerational operating

Scalable and flexible trust models and solutions for trust management must be found, as well as the political and policy foundation for trusting cooperating parties and

The novel figure-of-8 cable cerclage enhanced fixation stability and reduced re- displacement of the posteromedial-buttress in cephalomedullary nailing of subtrochanteric

Battle is modelled as a hierarchy of two-player zero-sum games that reflects the hierarchic decision-making in a hierarchic organisation. In each game, operational concepts and

From May to July 2004, photographer Camilla Damgård was engaged in pictorially documenting smoking behaviour both before and after the new law came into effect on June 1, 2004..