Posturing in Bargaining to In uence Outsiders...proposal game that ends after a (possibly in nite) deadline is reached. The set of feasible allocations depends on the state of the

Posturing in Bargaining to Influence Outsiders

Andres Perlroth∗

Job Market Paper

January 1, 2020Please click here for the latest version.

Abstract

This paper studies the effect of transparency in negotiations in the presence of outsiders.

I consider a game in which two players (negotiators) engage in bargaining in the presence

of a third player (outsider) who also cares about the outcome of the negotiation. As the

bargaining progresses, the outsider takes a single irreversible action that brings him a payoff

that depends on the allocation that the negotiators will agree to implement. The set of feasible

allocations depends on the state of the world which is known to the negotiators but not to the

outsider. The negotiators care about the bargaining outcome and have misaligned incentives

regarding the outsider’s decisions. When proposals are public (transparent negotiation), there

is an equilibrium where negotiators engage in posturing behavior as a tactic to distort the

outsider’s action. This behavior reduces efficiency because the outsider makes a suboptimal

decision, and sometimes, there is delay in the negotiation. Moreover, I show that these

inefficiencies arise for a general class of equilibria. On the other hand, when proposals are

private (non-transparent negotiation), the negotiators agree immediately, and the terms of

the agreement are highly informative for the outsider. In this situation, I show that every

equilibrium is approximately efficient.

∗Stanford GSB. Email: [email protected]. I am grateful to Andy Skrzypacz, Mike Ostrovsky and RobertWilson, for guidance and continuous help. I thank Kyle Bagwell, Anirudha Balasubramanian, Alex Bloedel,Jeremy Bulow, Gabe Carroll, Daniel Chen, Matt Jackson, Fuhito Kojima, Yucheng Liang, Bar Light, Suraj Mal-ladi, Deborah Melnick, Paul Milgrom, Ilya Morozov, Ashutosh Thakur and participants at the Theory LunchSeminar at Stanford for helpful comments. All errors are mine.

https://www.gsb.stanford.edu/sites/gsb/files/phd-job-market-paper/draft-nov24_0.pdf

mailto:[email protected]

1 Introduction

Many important decisions related to social policy and trade agreements are made through nego-

tiations. One dimension along which these negotiations vary is how information is revealed to

parties that are not directly involved in the negotiation. For instance, historically, while most trade

agreement negotiations were conducted behind closed doors, nowadays there is large variation in

the transparency protocols countries adopt (Marceddu, 2018). On the one hand, the recent negoti-

ations between US and Japan over their trade agreement were conducted behind closed doors.1 By

contrast, the European Union (EU) and Australia (UK) conducted their trade negotiations with a

high level of transparency (Drake-Brockman and Messerlin, 2018).

It has become a common trend among scholars and activists to recommend that such negoti-

ations be conducted transparently.2 They argue that transparency gives legitimacy to the nego-

tiations and facilitates accountability of the officials, mitigating agency problems between them

and their constituents (Stasavage, 2004). My paper challenges this presumption by showing that

transparency can be undesirable. When their actions are transparent, negotiators may posture to

influence outsiders, i.e., the parties not directly involved in the transparent negotiation.

In practice, trade agreements often feature an outsider who must make a payoff-relevant decision

on the basis of the negotiation. For example, consider the recent US-Japan trade agreement. Over

the course of the negotiation, car manufacturers in both countries wanted to know whether car-

tariffs would even be an issue covered in this agreement.3 Japanese manufacturers feared the

imposition of high tariffs and would plan to dial back car production in such a scenario. On

the other hand, American manufacturers would benefit from high tariffs to promote domestic

business.4Similarly, in the EU-Australia trade negotiations, the discussion over agricultural tariffs

is an important concern for Australian and European farmers.5

1 Source: https://www.politico.com/story/2019/08/25/us-japan-trade-deal-g7-1474461.2For instance, during the Trans-Pacific Partnership Agreement (TPP) talks, over 30 legal academics wrote a

letter to the United States Trade Representative, Ron Kirk, about their profound concern and disappointment atthe lack of public participation, transparency and open government processes in the negotiation of the intellectualproperty chapter of the Trans-Pacific Partnership Agreement (TPP).

Source: http://infojustice.org/archives/21137.)3“Washington may unveil tariffs on vehicles and automotive parts in the next few months that could have

broad repercussions for Japan’s massive automotive sector.”Source: https://worldview.stratfor.com/article/japans-auto-sector-poised-weather-us-tariff-storm.4“Japan, under Prime Minister Shinzo Abe, wants to head off Trumps threatened penalties on auto exports,

which could tip it into recession, as well as any currency clause directed at the Japanese yen.” (April 2019, For-tune).

5“The EU is notoriously sensitive about maintaining its already significant market protections for farmers.It has already warned Australia that it’s not keen on opening market access to beef, sheepmeat, rice, sugar or

1

https://www.politico.com/story/2019/08/25/us-japan-trade-deal-g7-1474461

http://infojustice.org/archives/21137

https://worldview.stratfor.com/article/japans-auto-sector-poised-weather-us-tariff-storm

To illustrate how the presence of outsiders affects the negotiation, I consider a simplified model.

Specifically, I consider a bargaining game with three players: two negotiators, A and B, and one

outsider O. The negotiators bargain over which allocation to implement and over the division of

the surplus generated by its implementation. The bargaining procedure is a one-sided repeated

proposal game that ends after a (possibly infinite) deadline is reached. The set of feasible allocations

depends on the state of the world. In the constrained state, the only implementable allocation is the

constrained policy in which the scope of the agreement must include the issues that are relevant

for the outsider (e.g. car tariffs). In the unconstrained state, it also feasible to implement the

unconstrained policy in which the agreement terms do not include such issues. The negotiators

know the state but the outsider does not. The outsider decides when and what irreversible action

to take. His payoff depends on the time and kind of action taken, and on the allocation that

negotiators agree to implement. At every period, the outsider trades-off taking the irreversible

action or waiting in the hope of obtaining more information afterwards. The negotiators have

misaligned incentives regarding the outsider’s action. The proposing negotiator A prefers a high

action: the action the outsider O takes when he believes that the unconstrained allocation will

be implemented. The other negotiator B prefers a low action: the action the outsider O takes

when he believes that the constrained allocation will be implemented. Throughout the paper I

assume that the unconstrained-state negotiation gains to A and B are sufficiently larger than their

payoffs arising from O’s action (so that they would not sacrifice the negotiations just to influence

the outsider, at least in the unconstrained state).

The main contribution of the paper is to compare welfare between transparent and non-

transparent negotiations. I define a transparent negotiation as one where the outsider observes all

the proposals that are made, and I define a non-transparent negotiation as one where the outsider

only observes the agreement terms of the negotiation after it has been completed. In negotiations

where A could make proposals frequently, it is typically more efficient to make the negotiation

non-transparent. More precisely, I show that (under standard off-path belief refinements) in every

equilibrium of the non-transparent negotiation game, there is an immediate agreement, and the

outsider takes the informed decision after waiting one period. By contrast, in the transparent

negotiation game, an inefficient equilibrium exists: negotiators engage in posturing behavior as a

tactic to distort the outsider’s action. As a consequence, the outsider makes a suboptimal action.

Furthermore, posturing may lead to a delay in the bargaining, which persists even as the frequency

dairy.”Source: https://www.farmonline.com.au/story/6380868/

eu-deal-is-dead-if-ag-loses-out-birmingham/.

2

https://www.farmonline.com.au/story/6380868/eu-deal-is-dead-if-ag-loses-out-birmingham/

https://www.farmonline.com.au/story/6380868/eu-deal-is-dead-if-ag-loses-out-birmingham/

of offers increases without bound. In contrast to most of the bargaining literature where inefficient

delay arises due to asymmetric information amongst bargainers, in this paper, delay arises because

bargainers want to influence third party outsiders.

To get an intuition for my results, suppose that the negotiation has an immediate deadline

(T = 0). The order of moves is that A makes a proposal, O decides whether to act immediately or

wait, B either accepts A’s proposal or rejects (and the gains from the negotiation are destroyed),

finally if O has not acted yet, he can take the action incurring a cost for the delay. In the scenario

where the negotiation fails, O believes that the unconstrained allocation will be later implemented

when such policy is feasible; otherwise, the constrained allocation will be implemented.6

When the negotiation is non-transparent, I assert that an equilibrium exists where the ne-

gotiators reach an immediate agreement. Indeed, suppose that the outsider O believes that in

the event of a rejection the constrained policy will be later implemented.7 In the unconstrained

state, the assumption that the gains from negotiation are large implies that A has a proposal such

that (i) B prefers to accept rather than reject and influence O to take B’s preferred action, and

(ii) if the proposal is accepted, the unconstrained policy is implemented. This shows that in the

unconstrained-state, A and B reach an agreement which is informative for O. In the constrained

state, A does not want to make an infeasible proposal since O would take his non-preferred action.

Hence, A makes a constrained-allocation proposal, B accepts, O takes the low action. Therefore,

O prefers to wait until the negotiation reaches an agreement rather than taking the action while

A and B are still negotiating. I conclude that under non-transparency, bargaining is efficient, and

O takes an optimal action with a delay that vanishes as the one-round delay costs are small.

Now suppose that there is an information leak and O somehow is able to observe the proposals

when A and B are privately negotiating (and hence behaving as specified above). In this situation,

O immediately learns the underlying state by observing A’s proposal; therefore, he takes the

optimal action without waiting to observe the final agreement. In this hypothetical situation,

bargaining is efficient and the outsider takes an optimal action without delay. However, if A knows

that O will observe the proposals, he can influence O to make A’s preferable action by making the

unconstrained-allocation proposal even when such allocation is not feasible. Accordingly, when A’s

payoff related to O’s action is larger than the negotiation gains by implementing the constrained

6The outsider believes that in a future negotiation, A and B will avoid to include in the scope of the negoti-ation hard issues from the previous negotiation (e.g. car tariffs), so long as it is feasible to reach an agreementwithout them.

7I show in the paper that this is the only belief consistent with the forward induction refinement, an extensionof the intuitive criterion for multi-sender games.

3

policy, A prefers to influence O at the cost of not reaching an agreement in the constrained-state.

From the previous paragraph, we conclude that when the negotiation is transparent and the

outsider’s action is sufficiently relevant for A, the equilibrium proposals cannot be fully revealing. I

construct a posturing equilibrium where A postures when the state is constrained by proposing, with

positive probability, the unconstrained-allocation proposal even though it is not feasible. Figure 1

describes the posturing probability as a function of the prior probability the state is unconstrained

(p). Notice that when p is greater than a threshold µA, there is strong posturing: A proposes the

unconstrained-allocation proposal with probability one. When p ≥ µA, there is weak posturing: A

mixes between proposing the unconstrained-allocation proposal and proposing a feasible proposal

that reveals the state. In the equilibrium, O does not benefit by waiting a period: B would

reject the unconstrained-allocation proposal as a way to influence O to believe that the only the

constrained policy is feasible. Consequently, O prefers to take a suboptimal action after observing

the proposal. After observing O to take the irreversible action, B accepts every feasible proposal.

Therefore, under transparency, there are two sources of inefficiencies: breakdown of negotiations

because A makes a non-feasible proposal (see Figure 1), and a suboptimal action from the outsider.

Furthermore, I show that these inefficiencies do not vanish as the frequency of the outsider’s actions

increases.

Prior belief pµA

Prob. A makes

unfeasible proposal

Prob. reach

agreement

10

1

Figure 1: Inefficiencies in the posturing equilibrium.

In the paper, I analyze the robustness of the previous intuition by looking at a larger class

of equilibria and considering a general deadline T . When the negotiation is transparent, I show

that under some technical conditions a posturing equilibrium exists. Depending on the deadline

4

T and the prior p, posturing can be strong or weak. When there is an extended deadline T or

the prior p is large, posturing is strong: negotiators A and B do not reveal any information to

the outsider O until he takes the action. In this situation, the outsider takes the uninformed

action immediately and, in the next period, negotiators reach an efficient agreement. Hence, with

frequent offers, bargaining is approximately efficient but the outsider takes a suboptimal action.

When T and p are small, there is weak posturing: Players A and O engage in a war of attrition. In

the constrained-state, A mixes between making an unfeasible unconstrained-allocation proposal or

revealing the underlying state by making a feasible constrained-allocation proposal. The outsider

O, on the other hand, mixes between taking the action or waiting to learn by observing from the

future proposals. Thus, with positive probability the outsider takes a suboptimal action. In this

case, the bargaining-inefficiencies do not vanish as the frequency of offers increases. Therefore,

when the frequency of offers increases, the weak posturing equilibrium shares three sources of

inefficiencies: suboptimal decision from the outsider, delay of the bargaining and breakdown of the

negotiation. I complement the analysis by showing that, under some technical conditions, every

equilibrium is inefficient as the frequency of offers increases.

When the negotiation is non-transparent, I show that a separating equilibrium always exists.

In the equilibrium, negotiators immediately agree, and hence, the outsider takes the full informed

action by waiting for one period. Thus, bargaining is efficient; O takes an optimal decision with

a one period delay. In particular, when the frequency of action increases, the outsider’s decision

is approximately optimal. Furthermore, I show that, under certain conditions, the equilibrium

outcome is unique.

My results share a concern over transparency that applies to a broader class of negotiations

beyond trade agreement negotiations. For instance, such concern may arise in a negotiation between

a labor union and managers, where the scope of the negotiation may include the implementation of

new technology for the company. While managers want suppliers to believe that the new technology

will be implemented, the labor union prefers the other scenario since the new technology would

signify a reduction of the labor force. One example of such conflict is the recent negotiation between

General Motors and the Union of Automobile Workers, which was centered around the company

transitioning to electric vehicle production. Managers were taking a hard position over reopening

shuttered car factories, and hence, influencing Workhorse Group Inc. (the outsider) to invest in

electric vehicle production for future supply to GM.8

8For a description of the timeline of the negotiation see https://www.reuters.com/article/

usa-autos-labor-events/timeline-uaw-members-ratify-new-labor-deal-with-gm-to-end-strike-idUSL2N27A0PF.

5

https://www.reuters.com/article/usa-autos-labor-events/timeline-uaw-members-ratify-new-labor-deal-with-gm-to-end-strike-idUSL2N27A0PF

https://www.reuters.com/article/usa-autos-labor-events/timeline-uaw-members-ratify-new-labor-deal-with-gm-to-end-strike-idUSL2N27A0PF

The paper is organized as follows. Section 2 discusses related literature. Section 3 describes the

model. Section 4 analysis the transparent negotiation game. Section 5 analyzes the game when

the negotiation is non-transparent. Section 6 concludes. All omitted proofs are in the Appendix.

2 Literature Review

The effects of transparency in bargaining have been studied in situations where representatives

(bargainers) and constituents (outsiders) have conflicting goals. Perry and Samuelson (1994) con-

sider a two-period bargaining game where one of the two bargainers represents a constituency who

may intervene and terminate negotiations. Because bargainers may prevent such intervention by

reaching immediate agreement, the level of transparency only affects allocation but not efficiency.

Stasavage (2004) consider a static bargaining game in which bargainers care about the outcome and

also about their constituents beliefs over their conflicting interests. He shows that transparency is

preferable when bargainers’ conflicting interests are strong. His result contrasts with mine because,

in my model, influencing the outsider affects the gains from the negotiation. In a similar model,

Fingleton and Raith (2005) consider a static bargaining model in which bargainers are motivated

by a (pure) career concern motive. Their result shares a similar concern regarding transparency,

although for a different reason. In my model, bargainers care about the outcome of the negotiations

and what the outsiders expect from the negotiations. In Fingleton and Raith (2005), bargainers

only care about the third parties’ perception regarding their bargaining skills.9

Beyond the political economy applications, Chaves (2019) analysis the role of transparency in

a bargaining game between a seller and a privately informed buyer that may be interrupted by

the entry of new buyers. He shows that under transparency, bargaining ends faster because the

seller’s lack of commitment makes him lower prices quickly in order to increase competition among

buyers. This contrasts with my model where bargaining ends faster under non-transparency. The

difference arises because in my model the Coasean force does not appear, and because bargainers

have stronger incentives to influence outsiders.10

My work also connects with the recent literature on price transparency and market performance

(Horner and Vieille, 2009; Kaya and Liu, 2015; Fuchs, Ory, and Skrzypacz, 2016). Those models

9A critical assumption in their model is that the outsiders must have private information about the bargaining.Otherwise, their perception about the bargaining skills does not depend on the transparency protocol.

10In Chaves’ model, when the seller has complete information, the bargainers behave without considering theentry of future buyers (outsiders). In my model, where bargainers have complete information, they still prefer todistort the bargaining outcome to influence the outsider decision.

6

consist of a long-run informed player facing a sequence of short-run uninformed players. In these

papers, the authors conclude that transparency is undesirable from a welfare point of view. The

driving force among these works is that the long-run player may want to delay trade to increase

his continuation payoff upon rejection inefficiently. My model shares the concern for transparency

but differs in that it studies a dynamic game where all players are long-lived. With this, I uncover

novel dynamics of posturing that arise when the negotiation is transparent.

Beyond the transparency agenda, my paper contributes to the bargaining literature in two

areas. First, to understand inefficient delay in bargaining without asymmetric information among

bargainers. With complete information, delay may occur when one of the parties can strike during

the course of the negotiation (Fernandez and Glazer, 1991), when bargainers make simultaneous

offers (Perry and Reny, 1993; Sakovics, 1993), and when bargainers are optimistic about their

bargaining power (Yildiz, 2004). My model adds to this literature by showing that delay may occur

because bargainers may want to influence a third party not involved in the negotiation. Second, the

paper contributes to the literature that rationalizes posturing behavior in negotiations. Following

the seminal work of Kreps and Wilson (1982) in reputation models, Abreu and Gul (2000); Kambe

(1999) show that posturing occurs when bargainers may be stubborn and demand a particular

allocation in the bargaining. Chatterjee and Samuelson (1987) shows that this behavior can happen

when both parties have incomplete information about their opponents’ valuation.

3 Model

The model has three players, two negotiators A and B, and an outsider O. Players A and B bargain

over which policy to implement and how to divide the gains generated from its implementation.

Player O has to make an irreversible decision z ∈ [0, 1].

The set of implementable policies depends on the state of the world Θ ∈ {Θ0,Θ1}. When the

state is Θ0, the only allocation that is feasible to implement is the constrained policy θ0. When the

state is Θ1, it is feasible to implement the unconstrained policy θ1 and also the constrained policy θ0.

Let π(θ) the gains from implementing allocation θ. I assume that π(θ0) ≤ π(θ1). Further, I consider

that Θ is known to Players A and B. Player O assesses a prior probability P[Θ = Θ1] = p.11

Time is discrete t ∈ {0,Λ, 2Λ, . . . , T, T + Λ, . . . , }, where T ≤ ∞ is the deadline for the negoti-

ation. 12 For a period t ≤ T the stage game is described in Figure 2. This is formally represented

11I ignore informational asymmetries among the negotiators to keep a tractable model that cleanly describesthe effect outsiders can have on negotiations.

12For notational simplicity I consider values for Λ and T such that TΛ ∈ N.

7

in three stages:

(t0) Player A proposes (θt, xAt ) ∈ {θ0, θ1} × [0, π(θt)].

(t1) Player O decides whether to take the irreversible action zt ∈ [0, 1] or to wait (w).

(t2) Player B decides whether to accept (a) xAt or to reject (r). If Θ = Θ0 and θt = θ1, Player B

has to reject the proposal.

A proposes ( "#$, &$') ∈ #*, #+ ×[0, /( "#$)] O waits (1) B rejects 2

O decides 3$ ∈ 0,1 ,and leaves

B accepts (5),negotiation ends

Figure 2: Stage game at period t ≤ T .

For t > T , Player O is the only one moving (with the same action space). In this case, the status

quo allocation is θt = ψ(Θ), where

ψ(Θ) =

θ0 if Θ = Θ0

θ1 if Θ = Θ1

.

Thus, if there is a breakdown of the negotiation, the outsider believes that in a future negotiation

the allocation θ1 will be implemented so long as it is feasible.13

I consider that proposals are observable for the outsider making the negotiation transparent.

I latter analyze the case when proposals are non-observable, that is, when the negotiation is non-

transparent.

Let τa = inf0≤t≤T

{Player B plays a at time t}, the time when the negotiation ends, and τz =

inft≥0{Player O plays z ∈ [0, 1] a time t}, the time where Player O takes the irreversible decision.14

The payoffs for each player are

uA = 1{τa<∞}e−rτaxAτa + 1{τz<∞}e

−rτzVA(zτz) where VA : [0, 1]→ R+

uB = 1{τa<∞}e−rτa(θt − xAτa) + 1{τz<∞}e

−rτzVB(zτz) where VB : [0, 1]→ R+,

uO = + 1{τz<∞}e−rτzZ(zτz , θt) where Z : [0, 1]× {θ0, θ1} → R++,

13The results remain unaltered if the outsiders believes that the allocation θ0 will be implemented in case ofbreakdown of negotiations.

14I use the convention inf ∅ =∞.

8

where r > 0 is the common discount rate.

I refer to the functions VA and VB as the externalities functions that Player O induces on each

respective negotiator.

Notice that the payoffs related to Player O’s decision are independent of the outcome of the

negotiation. Hence, if Player O were not strategic and played a fixed strategy, Players A and B

would behave in the negotiation as if no outsider was participating in the game. This non-linkage

in the payoff structure implies that any effect that the outsider could have on the negotiation is

due to informational considerations. Besides, these payoffs are nonnegative. Thus, players prefer

to receive their payoffs earlier in time. Consequently, if there is delay in the game, it comes from

the strategic considerations that the players are inducing on each other.

I assume that the outsider’s optimal action, z∗(µ) = arg maxz∈[0,1] Eµ[Z(z, θ)], is strictly mono-

tone in the belief µ = P[θ = θ1]. Given this monotonicity, without loss of generality, I consider

that z∗(µ) = µ.

Let Z(µ) := Eµ[Z(µ, θ)] the expected payoff for Player O when he takes the optimal action

given the belief µ that θ1 is implemented. I assume that Z is strictly convex, implying that Player

O is information-seeking. I restrict my attention to the case where Z(p) < e−rΛEµ[Z(1{θ=θ1}, θ)].

This assumption means that there is a value for learning: the outsider prefers to wait a period

and learn which allocation is implemented θ rather than taking an immediate action using only his

prior information. For technical reasons, I assume that Z is a continuously differentiable function.

In this paper, I focus on the case where negotiators have misaligned incentives regarding the

outsider’s decision. More precisely, I assume that VA is an increasing function, and VB is a de-

creasing function. Thus, Player A is better off when the outsider believes that the final allocation

is θ1. By contrast, Player B is better off when the outsider believes that the final allocation is θ0.

I define the size of the externalities by

∆VA = VA(1)− VA(0) and ∆VB = VB(0)− VB(1).

I assume that π(θ1) > max{∆VA,∆VB}. That is, the bargaining component of the game, in the

high state, is large enough in comparison with the externalities. This allows me to reduce the set

of equilibria and obtain sharper results. I further assume that VA, VB are differentiable functions.

9

Information structure, strategies, beliefs and solution concept

All actions are observable. Thus, a history htA ∈ H tA for Player A consists of the realization of Θ

and the actions that every player took before t0. A history htB ∈ H tB for Player B consists of the

realization of Θ and the actions that every player took before t2. A history htO ∈ H tO for Player O

consists of the actions that every Player took before t1.

A behavioral strategy for Player A is a sequence of functions σA = (σΘ,tA )t=0,...,T,Θ∈{Θ0,Θ1}, such

that σΘ,tA (htA) ∈ P

({(θ, x) ∈ {θ0, θ1} × [0, π(θt)]| x ≤ π(θ)

})for every htA ∈ H t

A that contains Θ.15

A behavioral strategy for Player B is a sequence of functions σB = (σΘ,tB )t=0,...,T,Θ∈{Θ0,Θ1}, such that

σΘ,tB (htB) ∈ P({a, r}) for every history htB ∈ H t

B that contains Θ. A behavioral strategy for Player

O is a sequence of functions σO = (σtO)t≥0, such that σtO(htO) ∈ P([0, 1] ∪ {w}).Given the information structure, a belief system for Player O is a probability distribution

µ = (µt)t≥0 over {Θ0,Θ1} such that P[Θ = Θ1|htO] is µt(htO) ∈ [0, 1].

For Player i ∈ {A,B,O}, I denote by uti(σ|µ, hti) the expected continuation payoff, at time

t, given hti and strategies σ = (σA, σB, σO) and the belief µ of Player O.16 An assessment (σ, µ)

is sequentially rational if for every Player i, any history hti, and any strategy σ′i, ui(σ|µ, hti) ≥ui(σ

′i, σ−i|µ, hti). An assessment (σ, µ) is a perfect Bayesian equilibrium (PBE) if it is sequentially

rational and beliefs follow Bayes’ rule wherever is possible (on and off the path of play).

Preliminaries

The assumption that Z > 0 means that Player O strictly prefers to take a particular action

earlier (to avoid discounting costs). Because after the negotiation deadline the information that

the outsider has about the state Θ does not change, he prefers to take the action immediately.

Therefore, in equilibrium, τı ≤ τa + Λ.17

The following result shows the role that private information plays in the model.

Lemma 1 Suppose that Player O knows the state Θ. Then, there is an equilibrium where Players

A and B strategies are the same as the equilibrium strategies in a game without an outsider. There

is immediate agreement in the negotiation (τa = 0). Player A extracts all the gains from the

negotiation (θτa = ψ(Θ), xAτa = π(θτa)). Furthermore, if T <∞ or Z(1) ≥ Z(0), the equilibrium is

unique.

15I denote by P(X), the space of probability measures given the Borel σ-algebra on X.16Because Players A and B know the state Θ, their belief system places probability one on Θ at every history

of the game.17In particular, τı ≤ T + Λ. For this reason, I describe the strategies and beliefs only until period T + Λ.

10

Lemma 1 follows from the non-linkage in payoffs between the outsider’s action and the pay-

offs from the negotiation. When Player O knows the state Θ, there is an equilibrium where he

immediately plays z = 1{Θ=Θ1}. The payoffs related to the Player O’s decision, VA, VB, Z, accrue

at that moment. Hence, Player A and B approach the negotiation without taking into account

the presence of the outsider. Therefore, in my model, where Player O does not know Θ, any dif-

ference in the outcome of the negotiation (e.g. allocation and efficiency) is due to informational

considerations.

Given the signaling nature of the game, the set of PBE is potentially large. To address this

issue, I develop equilibrium refinements that impose natural constraints on off-path beliefs. A

reader not interested in the technical details of these refinements may skip the next subsection.

3.1 Equilibrium Refinements

I impose two kinds of refinements. The first refinement is a version of forward induction. It

restricts the belief at an information set htO to assess positive probability only to the states of

the world that are relevant for that information set. That is, the states of the world that achieve

the information set for some sequentially rational strategies given an arbitrary belief at htO. This

refinement generalizes the intuitive criterion to a multi-period game with multiple senders. The

second refinement imposes a monotonicity condition on the beliefs after observing proposals.

Forward induction

To formally define the refinement, I first define a perturbed assessment at htO given an initial belief

system µ.

Definition 1 ((µ, µ, htO)-assessment) Given µ ∈ [0, 1], a belief system µ, and an information

set htO, a (µ, µ, htO)-assessment (σ(µ,htO), µ(µ,htO)) is an assessment where the strategies σ(µ,htO) are

sequentially rational to the belief system

µ(µ,htO)t (htO) =

µ if htO = htO

µt(htO) if htO 6= htO.

The (µ, µ, htO)-assessment contains sequentially rational strategies to beliefs that are a modifi-

cation to the original belief system µ at htO (where the belief is µ instead of µt(htO)).

11

Given an information set ht+ΛO = htO∪{y, (θ, x)}, where (y, (θ, x)) ∈ {a, r}×

({θ0, θ1}×[0, π(θ)]

),

and an assessment (σ, µ), I define the set

D(Θ, (y, (θ, x))|µ, htO) = {µ ∈ [0, 1]| there exists a (µ, µ, htO)-assessment such that

{y, (θ, x)} happens with positive probability given htO and that the state is Θ}

This set contains all the plausible beliefs that Player O could have at htO ∪ {y, (θ, x)} so that the

event (y, (θ, x)) happens with positive probability for strategies that are sequentially rational to

these plausible belief, given htO and that the state of the world is Θ.

Definition 2 (Relevant information set) Given a beliefs system µ, a state Θ is relevant to

ht+ΛO = htO ∪ {y, (θ, x)} if the set D(Θ, (y, (θ, x))|htO, µ) is not empty.

In words, a state Θ is relevant for htO if observing htO when the state is Θ can be explained by

(sequential) rational behavior of the players.

Definition 3 (Forward induction) A PBE satisfies forward induction if for every information

set htO, every element of the support of the belief µt(htO) is relevant for htO.

Govindan and Wilson (2009)[GW] present another version of forward induction, which differs

in the definition of the relevant information set. In GW, a state Θ is relevant for htO if a weak

sequential equilibrium exists such htO happens with positive probability when the state is Θ. 18 My

refinement only requires the existence of some belief at htO that can induce a sequential rational

behavior so that htO happens with positive probability when the state is Θ. This difference parallels

the difference between the solution concepts PBE and sequential equilibrium. The first one does

not impose a rational foundation to off-path beliefs. A sequential equilibrium requires that the

off-path beliefs have to be consistent (i.e., being a limit of a sequence of beliefs that are following

Bayes’ rule).

To see the usefulness of the refinement, consider a one-period version of the game. By construc-

tion, the proposal (θ1, π(θ1) −∆VB − ε) is accepted by Player B independently of the belief that

Player O could have after observing a rejection. In turn, Player A prefers to make such proposal

over proposal that will be later rejected. Thus, the state Θ1 is not relevant for history in which

there is rejection. Hence, if Player O observes that Player B rejects the proposal, the forward

induction refinement imposes that belief to be zero.

18Weak sequential equilibrium differs from a sequential equilibrium by requiring sequential rationality over thepath of the game instead of every information set (See GW for a detailed definition of the concepts).

12

Monotone Belief

Definition 4 (Monotone Belief) A PBE of the game satisfies the monotone belief refinement

if for every history htO, we have that

1. µt+Λ(htO ∪ {r, (θ, x)}) is non-decreasing in x ∈ [0, π(θ)].

2. µt+Λ(htO ∪ {r, (θ1, x)}) ≥ µt+Λ(htO ∪ {r, (θ0, x)}) for every x ∈ [0, π(θ0)].

Intuitively, if Player O observes a higher proposal, he should not think that gains from the negoti-

ations are smaller than when he observes a lower proposal. The refinement imposes a condition on

beliefs on-path and off-path. However, my results do not change if the monotonicity is restricted

only to off-path beliefs.

4 Posturing Equilibrium

In this section, I construct an equilibrium of the game that exhibits posturing. In the equilibrium,

Player A postures by proposing (θ1, π(θ1)) when the state is Θ0 even though the proposal is not

feasible. Similarly, Player B postures by rejecting (θ1, π(θ1)) when the state is Θ1 even though he

will later accept (θ1, π(θ1)) (after Player O takes the action). I then provide a simple characteriza-

tion of the equilibrium shape by looking at a sequence of games with period length Λ converging

to zero.

The equilibrium strategies have the following structure. Consider the subgame where at period

t, Player O has not taken the action yet. Then, when the state is Θ1, Player A proposes (θ1, π(θ1))

with probability one. When the state is Θ0, Player A mixes between proposing (θ1, π(θ1)), with

probability α∗t , and proposing (θ0, π(θ0)), with probability 1 − α∗t . When Player O observes the

proposal (θ1, π(θ1)), he mixes between taking the irreversible decision equals to the current belief

µ∗t , with probability β∗t , or waiting, with probability 1−β∗t . When Player O observes (θ0, π(θ0)), he

learns that the state is Θ0 and takes immediately the optimal decision z∗ = 0. If Player O waits at

t, Player B decides to reject if the proposal is (θ1, π(θ1)) and to accept if the proposal is (θ0, π(θ0)).

If Payer O decides to take the action at period t, the following subgame has a unique solution:

Player B accepts every feasible proposal, and Player A proposes (θ1, π(θ1)) when the state is Θ1

and (θ0, π(θ0)) when the state is Θ0.

A key aspect of these strategies is the mixing probabilities, α∗t and β∗t , which play a crucial role

in a potential delay in the bargaining. When there is an extended deadline T or when the prior p

13

is high, these mixing probabilities are degenerate, i.e., α∗0 = β∗0 = 1. Player A proposes (θ1, π(θ1))

with probability one, independently of the state of the world. Therefore, Player O does not learn by

delaying his decision. Thus, he immediately plays the uninformed action p. I call this equilibrium

a strong posturing equilibrium. On the other hand, when there is a short-term deadline, and the

prior is low, Player A and Player O mixed strategies are not degenerate, i.e., α∗t , β∗t ∈ (0, 1).

Both players engage in a war of attrition. With positive probability Player A compromises by

proposing (θ0, π(θ0)) when the state Θ0, and with positive probability Player O waits to obtain

more information from the subsequent rounds of the negotiation. I call this equilibrium a weak

posturing equilibrium.

To understand the connection between the posturing behavior, the deadline, and the prior,

consider the subgame when the negotiation reaches the deadline T and the outsider has not taken

the action yet.19 In this subgame, when the state is Θ0, Player A faces the following trade-off. By

posturing and proposing (θ1, π(θ1)), he makes an unfeasible proposal, which due to the deadline

limit, implies the destruction of the gains from the negotiations. By proposing (θ0, π(θ0)), he reveals

the non-preferable state Θ0 and gets a payoff of π(θ0) +VA(0). Let µA be the smallest belief µ such

that V (µ) ≥ π(θ0) + V (0).20 Then, when µT−Λ(hT−ΛO ) ≥ µA, Player A strictly prefers to propose

(θ1, π(θ1)) over (θ0, π(θ0)) when the state Θ0. Player O does not learn from the proposal and,

therefore, plays µT−Λ(hT−ΛO ) at T . In this situation, posturing is strong. When µT−Λ(hT−Λ

O ) < µA,

the value for posturing does not overweight the destruction of surplus. Hence, Player A mixes

so that these two forces are equalized. Thus, µT (hT−ΛO ∪ {(θ1, π(θ1))}) = µA. Therefore, when

µT−Λ(hT−ΛO ) < µA, posturing is weak. Given the equilibrium of the subgame, I deduce that when

there is a short-term deadline and the prior is small, Player O can obtain information about the

state Θ by waiting to period T . Thus, Player A must reveal information about the state of the

world. This fact explains why the weak posturing equilibrium depends on the deadline and prior.

To pin-down the equilibrium assessment, suppose that Player O is mixing along the equilibrium

path. Because Player O is sequentially rational, the optimal action at period t is to play the current

belief µt(htO), which I denote by µt. In order that Player O is indifferent between playing play µt,

19In the appendix I show that for this subgame, proposals are uniquely determined in every PBE satisfyingforward induction and monotonicity of beliefs.

20The monotonicity and continuity assumptions on VA ensures the existence and uniqueness of µA. Notice thatwhen π(θ0) < ∆VA, µA > 0.

14

or wait a period to learn from Player A’s proposals, the beliefs have to satisfy:Z(µt) = e−rΛ

( µtµt+Λ

Z(µt+Λ)︸︷︷︸A proposes (θ1,π(θ1)) at t+Λ

+ (1− µtµt+Λ

) Z(0)︸︷︷︸A proposes (θ0,π(θ0)) at t+Λ

)if t < T

µT = µA

(1)

Equation (1) could have multiple solutions; however, there is only one that satisfies continuity

as Λ→ 0.

Lemma 2 Consider t ≤ T < ∞. Suppose that (µ1s)Ts=t and (µ2

s)Ts=t solve Equation (1) from

s = t, . . . , T and satisfy the following condition

Z ′(µs)µs+Λ ≥ e−rΛ(Z(µs+Λ)− Z(0)) for s < T. (2)

Then, µ1s = µ2

s for every s = t, . . . , T . The solution (µs)Ts=t is increasing over time.

To induce the above sequence of beliefs, Player A proposes (θ1, π(θ1)), when the state is Θ0, with

probability αt = µt(1−µt+Λ)

µt+Λ(1−µt) . In particular, for t = 0, Player A mixes with probability α0 = p(1−µ0)µ0(1−p) .

From this, I observe that α0 < 1 if and only if p < µ0. Thus, if p < µ0 there is weak posturing.

If p > µ0, there is strong posturing along the equilibrium path. When there is strong posturing

and Player O deviates and wait, using Bayes’ rule, I get that the off-path belief equals to the prior

p. Therefore, the strong posturing strategy continues in the off-path game until the games reaches

period t, where µt−Λ < p < µt. For t ≥ t and in the subsequent periods, the off-path behavior

corresponds to the weak posturing behavior where beliefs follow Equation (1).

Given the beliefs (µt), I now formally define the strategies at every history of the game. For

this extent, I require the following notation.

• Let tE := inft≥0{a solution (µs)

Ts=t exists for Equation (1) satisfying Condition (2)}. Period tE

corresponds to the first time where a solution to Equation (1) exists. Notice that tE ≤ T and

tE =∞ when T =∞.

• Let t∗ = inft≥tE{µt ≤ p, where (µt)

Tt=tE

solve Equation (1) satisfying Condition (2)}. Period

t∗ corresponds to the first time where the sequence of beliefs is greater than p (because of

monotonicity µs > p for s > t∗).

15

• Consider the sequence of beliefs

µ∗t =

p if t < t∗

µt if t ≥ t∗.

This sequence generates the belief system of the equilibrium assessment.

• Let α∗t =µ∗t (1−µ∗t+Λ)

µ∗t+Λ(1−µ∗t ), the mixing probability for Player A that induces the sequence (µ∗t )

Tt=0.

• Let β∗t the mixing probability that keeps indifferent Player A between proposing θ1, π(θ1) or

(θ0, π(θ0)) when the state is Θ0. Thus,

π(θ0) + VA(0) = β∗t (e−rΛπ(θ0) + VA(µt))︸︷︷︸O plays µt after observing (θ1,π(θ1))

+ (1− β∗t ) e−rΛ(π(θ0) + VA(0))︸︷︷︸O waits and A plays (θ0,π(θ0)) at t+Λ

.

Solving the above equation, I obtain that

β∗t = (1− e−rΛ)π(θ0) + VA(0)

VA(µ∗t )− e−rΛVA(0)if t < T. (3)

When t = T , Player O does not learn by waiting. Hence, β∗T = 1.

• Let

uB(σ∗|Θ1, htB) =

T∑s=t+Λ

β∗s

s−1∏j=t+Λ

(1− β∗j )e−rΛ(s−t)VB(µ∗s),

Player B’s continuation payoff given that the state is Θ1, and that at history htB, Player A

proposes (θ1, π(θ1)) and Player O decides to wait.

• Let x∗t defined as

π(θ1)− x∗t + e−rΛVB(1) = uB(σ∗|Θ1, htB).

Thus, x∗t is the proposal that makes indifferent to Player B at period t between accepting,

an agreeing in policy θ1 or rejecting and continuing along the equilibrium path.

Equilibrium Assessment

The strategies for each player are as follows.

16

Strategy for Player A: consider htA a history such that there is not an agreement before t, and

Player O has not taken the action yet. If state is Θ1, P[σ∗t,Θ1

A (htA) = (θ1, π(θ1))] = 1. If the state is

Θ0, Player A mixes so that P[σ∗t,Θ0

A (htA) = (θ1, π(θ1))] = α∗t and P[σ∗t,Θ0

A (htA) = (θ0, π(θ0))] = 1−α∗t .Notice that α∗t = 1 if and only if t < t∗. Thus, if t < t∗ posturing is strong, and if t ≥ t∗ posturing is

weak. For a history htA where Player O has already taken the action, P[σ∗t,ΘA = (ψ(Θ), π(ψ(Θ))] = 1.

Strategy for Player B: consider a history htB such that there is not an agreement before t. If

Player O takes the action after t2, P [σ∗t,Θ1

B (htB) = a] = 1{xAt ≤x∗t } and P [σ∗t,Θ0

B (htB) = a] = 1{θt=θ0}.

If Player O takes the action before t2, P [σ∗t,ΘB (htB) = a] = 1{θt is feasible}.

Strategy for Player O: consider a history htO such that there is not an agreement before t,

θt = θ1 and xAt ≥ x∗t . If t < t∗ or t = T , P[σ∗tO(htO) = µ∗t ] = 1. If t∗ ≤ t and t < T , Player O

mixes so that P[σ∗tO(htO) = µ∗t ] = β∗t and P[σ∗tO(htO) = w] = 1 − β∗t . If Player O observes θt = θ0

or xAt < x∗t , then P[σ∗tO(htO) = 0] = 1. For a history htO where the negotiation ends before t,

P[σ∗tO(htO) = 1{θτa=θ1}] = 1.

The belief system µ∗ is as follows. For a history htO such there is not an agreement before

t,θt = θ1 and xAt ≥ x∗t , µ∗tO(htO) = µ∗t . For a history htO such there is not an agreement before

t and θt = θ0 or xAt < x∗t , µ∗tO(htO) = 0. For a history htO such there is an agreement before t,

µ∗tO(htO) = 1{θτa=θ1}.

Proposition 1 A δ ∈ (0, 1) exists such that if e−rT > δ or T = ∞, the assessment (σ∗, µ∗) is a

PBE of the game satisfying forward induction and monotonicity of beliefs. If t∗ = 0 and p < µ0

there is weak posturing along the equilibrium path. If not, there is strong posturing and there is an

immediate agreement in the negotiation.

Corollary 1 Suppose that e−rT > δ. Then, P[limΛ→0 e−rτΛ

0 Z(zτΛ0

) < pZ(1) + (1 − p)Z(0)] > 0.

Furthermore, if p < limΛ→0 µΛ0 , then P[limΛ→0 τ

Λa > 0] > 0.

Corollary 1 shows that when the negotiation is transparent an equilibrium exists where the

negotiation is inefficient. The outsider takes, with positive probability, a suboptimal decision.

Furthermore, if in the equilibrium there is weak posturing, there is delay in the bargaining which

is persistent as Λ → 0. Regarding the robustness of Corollary 1, Section 4.2 shows that, under

some conditions on the parameters, that these inefficiencies hold over a larger set of equilibria.

As a side remark, notice that in my model, there is no asymmetric information among the

negotiators and still, there is delay in the bargaining. Just the presence of a relevant outsider, may

affect the negotiators’ behavior with the consequence of destroying surplus of the bargaining.

17

To present a clearer expositions of these results, I analyze the limit of the sequence of the

posturing equilibria as Λ→ 0. The formal proofs can be found in the appendix.

4.1 Continuous time approach: Λ→ 0

I start the analysis by computing the beliefs along the equilibrium path. Consider the sequence

(µΛt ) that solves Equation (1) for Λ > 0. Because Z is continuously differentiable, the limit

µt = limΛ→0 µΛt , is differentiable.21 Thus, Equation (1) turns torZ(µt) = µt

µt(Z ′(µt)µt + Z(0)− Z(µt)) if t < T

µT = µA.(4)

The expression in the left-hand-side correspond the cost of waiting a period. The expression in

the right-hand-side measures the two benefits of waiting a period. The first term Z ′(µt)µt ≈(Z(µt + µtΛ) − Z(µt))/Λ is the marginal benefit of making a more accurate decision conditional

that Player O observes (θ1, π(θ1)). The second term is the benefit by playing 0 instead of µt when

the state is Θ0.

Because Z is strictly convex, Z ′(µt)µt + Z(0) − Z(µt) > 0 for µt > 0.22 Thus, the solution

trivially satisfy Condition (2). The convexity inequality and the smoothness of Z guarantee a

unique solution to Equation (4). Furthermore, because the solution is continuous we have that

tE := limΛ→0 tΛE is strictly less than T . From the Differential Equation (4) I obtain that if r or T

are sufficiently small, then tE = 0 and µ0 converges to µA. Therefore, when e−rT is small enough

the equilibrium exhibits strong posturing when p ≥ µA and weak posturing when p < µA.

Figures 3 and 4 depict the beliefs µt and µ∗t as a function of time for different discount rates.

When the discount rate is large, the slope of the beliefs are steeper. Hence, 0 < tE < t∗ =

limΛ→0 t∗,Λ and µ∗ differs from µ (see Figure 3). Because t∗ > 0, the limit equilibrium presents

strong posturing. Figure 4 analyzes the model when r is small. In this case, tE = t∗ = 0 and

µ∗ = µ. Since p < µ0, the limit equilibrium has weak posturing.

The limit-strategies of the players in the strong posturing region, when t < t∗, are the same

as in the discrete model. Player A proposes (θ1, π(θ1)) with probability one, Player O plays p

immediately, and Player B accepts every proposal after observing Player O taking the action. For

the weak posturing region, the limit-strategies take a simple form that I describe in the following

21The formal proof of the statement is in the appendix22A function f : [0, 1]→ R differentiable is strictly convex if for every x 6= y, f ′(x)(y − x) + f(x) < f(y).

18

Time

µA

p

1

t∗tE T0

µt

µ∗

t

Figure 3: Beliefs for the strong posturingequilibrium for r = 1.

Time

µA

p

1

0 T

}

← atom

µ0

µt = µ∗

t

Figure 4: Beliefs for the weak posturingequilibrium for r = 0.3.

Example for Z(µ) = 2− µ(1− µ) and T = 1.

section.

Weak posturing in continuous time: war of attrition

In the weak posturing equilibrium, Players A and O engage in continuous time war of attrition.

When t∗ = 0 and there is weak posturing along the equilibrium path, Player A places an atom at

t = 0 by playing (θ0, π(θ0)) with probability γ∗0− = 1 − limΛ→0 α∗,Λ0 = 1 − p(1−µ0)

µ0(1−p) (see Figure 4).

When t > 0, the war of attrition strategies of Players A and O converges to a continuous time

distribution. Let FΛA (t) the probability that Player A decides to concede by proposing (θ1, π(θ0))

before t when the state is Θ0. Then,

limΛ→0

1

Λ· F

ΛA (t+ Λ)− FΛ

A (t)

1− FΛA (t)

= limΛ→0

1− α∗,ΛtΛ

=µt

µt(1− µt)=

rZ(µt)

(1− µt) (Z ′(µt)µt + Z(0)− Z(µt)).

Thus, in the continuous time game, Player A proposes (θ0, π(θ0)) according to a continuous distri-

bution with hazard rate

γ∗t =rZ(µt)

(1− µt) (Z ′(µt)µt + Z(0)− Z(µt)).

Similarly, let FΛO (t) the probability that Player O decides to take the action before time t < T .

19

Then,

limΛ→0

1

Λ· F

ΛO (t+ Λ)− FΛ

O (t)

1− FΛO (t)

= limΛ→0

β∗,ΛtΛ

= rπ(θ0) + VA(0)

VA(µt)− V (0).

Thus, in the continuous time game, Player O takes the action at t < T , and playing µt, according

to a continuous distribution with hazard rate

η∗t = rπ(θ0) + VA(0)

VA(µt)− V (0). (5)

If the game reaches T , Player O places an atom at T and takes the action µA.

A common feature that appears in both equilibrium, strong and weak, is that the outsider

takes a suboptimal decision. Furthermore, the outsider’s payoff is bounded away from the payoff

he obtains with full information as Λ approaches zero. Indeed, by a simple computation, I deduce

that

limΛ→0

uO(σ∗,Λ|µ∗,Λ, h0O) = (p+ (1− p)(1− γ+

0−))Z(µ∗0) + (1− p)γ+0−Z(0) < pZ(1) + (1− p)Z(0),

where γ+0− = max{γ0− , 0}. In Section 4.2 I address this inefficiency result by looking at a larger set

of equilibria.

Tackling Proposition 1

Using the continuous-time approach, I address Proposition 1 by analyzing sequential rationality at

the critical histories. Thus, I restrict the attention to histories hti where there is not an agreement

of the negotiation yet, and Player O has not taken the action yet.

The conditions for Players B and O are simple to check. If Player B observes θt = θ0 and rejects,

then in the subgame, Player A never proposes less than (θ0, π(θ0)) afterwards. Thus, Player B’s

payoff by rejecting is less or equal than e−rΛVB(0). By accepting the proposal, he gets a payoff of

at least e−rΛVB(0). Hence, it is optimal for B to accept the proposal. If θt = θ1 and the state is Θ1

(if the state is Θ0, Player B is forced to reject the proposal), the division x∗t is set so that Player

B’s decision is sequentially rational.

Regarding Player O’s decision, notice that after observing xAt ≤ x∗t , his belief is that µt(htO) = 0.

Thus, playing 0 is sequentially rational. Suppose that he observes that θt = θ1, that xAt > x∗t and

that t < t∗. If Player O decides to deviate by waiting, Player B rejects, and in the next period,

Player A proposes (θ1, π(θ1)) with probability one and Player O plays µ∗t+Λ = µ∗t . Thus, under the

20

deviation, there is one period delay, and Player O takes the same action µ∗t . I conclude that Player

O prefers to take the action immediately. The other scenario is that θt = θ1, that xAt > x∗t and that

t ≥ t∗. In this situation, the belief (µ∗s)Ts=t satisfy Equation (4). Therefore, Player O is indifferent

between taking the action µ∗t or waiting. Thus, Player O is sequentially rational.

Regarding Player A’s decision, notice that if t < t∗, following the equilibrium strategies leads

to a payoff of π(ψ(Θ)) + VA(µ∗t ). He obtains the same payoff if he proposes θt = θ1 and xAt > x∗t .

However, if he proposes θt = θ0 or xAt ≤ x∗t his payoff is less or equal than π(ψ(Θ)) +VA(0). Hence,

Player A does not want to deviate. If the state is Θ0 and t ≥ t∗, Player A’s payoff by proposing

either θt = θ1 or xAt ≥ x∗t is π(θ0) + VA(0). If he proposes θt = θ0 or xAt < x∗t , his payoff less or

equal than min{xAt , π(θ0)}+VA(0). Therefore, Player A does not want deviate. Finally, it remains

the case when the state is Θ1 and t ≥ t∗. Suppose that Player A deviates by proposing θt = θ1

and xAt > x∗t , then the outcome is the same as the one by proposing (θ1, π(θ1)), with the caveat

that if Player O takes the action at t, he obtains xAt + VA(µ∗t ) instead of π(θ1) + VA(µ∗t ). Hence,

this deviation is not profitable. Suppose that Player A proposes either θt = θ0 or xAt ≤ x∗t . Then,

Player O immediately takes the action z∗ = 0. Thus, the optimal deviation is to propose θt = θ1

and x∗t . Therefore, a sufficient condition for the equilibrium to hold is that

limΛ→0

uA(σ∗,Λ|Θ1, htA) > x∗t + VA(0) .

where x∗t = limΛ→0 x∗,Λt .

Using the continuous time limit characterization of the strategies, I obtain

limΛ→0

uA(σ∗,Λ|Θ, htA) =

∫ T

t

e−(r(s−t)+∫ st ηwdw)ηs(π(θ1) + VA(µs))ds+ e−(r(T−t)+

∫ Tt ηwdw)(π(θ1) + VA(µA)),

limΛ→0

uB(σ∗,Λ|Θ1, htB) =

∫ T

t

e−(r(s−t)+∫ st ηwdw)ηsVB(µs)ds+ e−(r(T−t)+

∫ Tt ηwdw)VB(µA),

x∗t = H − limΛ→0

uB(σ∗,Λ|Θ1, htB) + VB(1),

where htB = hTA ∪ {(θ1, π(θ1)), w}.Thus, the equilibrium condition is equivalent to

∫ T

t

e−(r(s−t)+∫ st ηwdw)ηs(π(θ1) + VA(µs) + VB(µs))ds+ e−(r(T−t)+

∫ Tt ηwdw)(π(θ1) + VA(µA) + VB(µA)) > π(θ1) + VA(0) + VB(1).

The next lemma bounds the left-hand-side of the above inequality.

21

Lemma 3 For every t ≤ T ,the following inequality holds∫ T

t

e−(r(s−t)+∫ st ηwdw)ηs(π(θ1) + VA(µs) + VB(µs))ds+ e−(r(T−t)+

∫ Tt ηwdw)(π(θ1) + VA(µA) + VB(µA))

> (π(θ1) + VA(µA) + VB(µA))

(ηT

r + ηT+ e−(r+ηT )T r

r + ηT

).

Using Lemma 3, I deduce that a sufficient condition so that Player A does not want to deviate

is that

(π(θ1)+VA(µA)+VB(µA))

(π(θ0) + VA(0)

2L+ VA(0)+ e

−rT(

2π(θ0)+VA(0)

π(θ0)

)π(θ0)

2π(θ0) + VA(0)

)≥ π(θ1)+VA(0)+VB(1).

In particular, when e−rT → 1, the right-hand-side converges to π(θ1) +VA(µA) +VB(µA), which

is strictly greater than π(θ1) + VA(0) + VB(1). Therefore, a δ ∈ (0, 1) exists such that if e−rT > δ,

Player A is sequentially rational.

As a final remark, I can dispense the assumption that e−rT > δ if the following condition holds

(π(θ1) + VA(µA) + VB(µA))π(θ0) + VA(0)

2π(θ0) + VA(0)≥ π(θ1) + VA(0) + VB(1). (6)

Condition (6) is satisfied when ∆VA and ∆VB are high enough.23 For instance, consider VA(0) =

0, VB(1) = 0 and VB(µA) + VA(µA) > π(θ1)2

.

4.2 Inefficiency of transparent negotiation

In the previous section, I show that in the posturing equilibrium, there is inefficiency due to a

suboptimal decision from the outsider. I now generalize this result for a broader set of equilibria.

To be precise, under the assumption that e−rTVB(µA) > VB(1), I show that if p ≥ µA, then the

probability that the outsider takes an optimal action without delay is bounded away from one as

Λ approaches zero. Furthermore, strong posturing strategies are the unique proposal strategies

consistent with a PBE satisfying forward induction and monotonicty of beliefs.

Theorem 1 Suppose that e−rTVB(µA) > VB(1), p ≥ µA. A Λ > 0 exists such that if Λ < Λ and

(σ, µ) is a PBE satisfying forward induction and monotonicty of beliefs, then the equilibrium has

strong posturing behavior along the equilibrium path.

23Importantly, Condition (6) also ensures that x∗t < Hπ(θ1).

22

Theorem 1 states that when the length of the period is small, the equilibrium payoff that Player

O obtains is bounded away from the full information payoff. Thus, as Λ approaches zero, the

inefficiency related to Player O’s decision does not vanish. This inefficiency also affects the overall

welfare (including Players A and B) when the outcome of the full information game maximizes

total welfare.24 In such a case, Theorem 1 implies that the welfare-loss in equilibrium is persistent

as the length period Λ converges to zero.

To address the proof, consider the following lemma.

Lemma 4 Suppose that e−rTVB(µA) > VB(1), and consider (σ, µ) a monotone equilibrium satis-

fying forward induction.

Let htO any history such that: there is not an agreement of the negotiation before t < T ; Player

O has not taken the action before t; θt = θ1 and xAt = π(θ1). Then if µt(htO) ≥ µA, Λ > 0 exists

such that if Λ < Λ, the equilibrium is a strong posturing equilibrium. Along the equilibrium path,

Player A proposes (θ1, π(θ1)) independently of the state of the world, and Player O takes the action

immediately after observing (θ1, π(θ1))

To see this intuition behind this lemma, consider t = T . Because µT (hTO) ≥ µA , I show that

for this subgame there must be strong posturing along the equilibrium path.25 That is, Player A

proposes (θ1, π(θ1)) for both states and Player O takes the optimal decision at T . Moving a step

back in time, consider the history at T − Λ where Player A proposes (θ1, π(θ1)) and then Player

O waits. If Player B accepts, he obtains e−rΛVB(1), if he rejects, he obtains e−rΛVB(µT (hT−ΛO ∪

{r, (θ1, π(θ1))})). By Bayes’ rule, µT (hT−ΛO ∪ {r, (θ1, π(θ1))}) < 1. Hence, Player B rejects. Using

Bayes’ rule again, I conclude that µT (hT−ΛO ∪{r, (θ1, π(θ1))}) = µT−Λ(hT−2Λ

O ∪{r, (θ1, π(θ1))}) ≥ µA.

Therefore, after observing (θ1, π(θ1)) at T − Λ, Player O does not obtain information by delaying

his decision. Hence, he immediately takes the action at T − Λ. Using this logic until the initial

period, I conclude that if µT (hT−ΛO ∪ {r, (θ1, π(θ1))}) > µA then in any PBE satisfying forward

induction and monotonicity of beliefs, Player A proposes (θ1, π(θ1)) for both states and Player O

takes the action immediately.

For a general t < T , I proceed by backward induction. Suppose the proposition holds for t+ Λ.

Using the monotonicity of beliefs, the belief after observing a proposal (θ1, π(θ1)) is greater than

the belief after observing (θt, xAt ). Thus, the belief after observing (θ1, π(θ1)) is greater than µt(h

tO).

Because e−rTVB(µA) > VB(1), Player B prefers to reject (θ1, π(θ1)) and disagree until period T ,

24For example, when the externalities payoffs VA and VB are convex functions.25Lemma 7 in the appendix shows that every PBE satisfying forward induction and monotonicity of beliefs is a

posturing equilibrium. In particular, if µT (hTO) ≥ µA posturing is strong.

23

where gets at least VB(µA) rather accepting (θ1, π(θ1)) and obtaining VB(1). Hence, Player B

rejects when he observes (θ1, π(θ1)), independently of the state of the world. The Bayes restriction

over beliefs implies that the belief does not change if, in the next period, Player A proposes

(θ1, π(θ1)). Mathematically, µA = µt(htO) = µt+Λ(htO ∪ {r, (θ1, π(θ1))}). Because µt+Λ(ht+Λ

O ) ≥ µA

for hOt+Λ = htO ∪ {r, (θ1, π(θ1))}, the inductive hypothesis concludes the proof.

A consequence of Lemma 4 is that if the prior p ≥ µA, the equilibrium is unique (by Bayes’ rule

and the monotonicity of beliefs, µ0({(θ1, π(θ1))}) ≥ p. Therefore, I conclude from Lemma 4 that

when the length period Λ is small the outsider takes the uninformed decision with probability one.

5 Efficiency of non-transparent negotiation

The previous section shows that when the negotiation is transparent, that is, when proposals are

observable for the outsider, there is an inefficiency of the game which is persistent as the period

length goes to zero. This section shows that this result heavily relies on the information structure

chosen for the negotiation. In what follows, I consider a setting where the negotiation is non-

transparent: the outsider only observes whether there is an agreement or not, and the terms of

the agreement. Under a non-transparency policy, there is a separating equilibrium of the game,

which is approximately efficient as Λ converges to zero. I provide conditions that guarantee that

every equilibrium satisfies this property.

The non-transparent negotiation game is basically the same model as the transparent negoti-

ation game described in Section 3. The only difference is the information structure for Player O.

Here, a history htO for Player O consists of all the actions made by Player B prior to time t and

the outcome (θτa , xτa) in case τa < t. Regarding the action space for each player, their strategies,

the beliefs, and the solution concept, they remain invariant once is taking into account the new

information structure for Player O. Likewise, I naturally redefine the forward induction refinement

for this particular context.26 I keep the same assumptions made in Section 3.

Before going to the main results of the section, I develop a new refinement that ensures that

every equilibria of the game is approximately efficient.

26Given a belief system µ, the set D(θ, k|µ, htO) consists of all possible beliefs µ at k ∈ ({a, (θ, x)}, {r}) suchthat there are sequentially rational strategies to the perturbed belief system (µ, µ, htO) where k happens with posi-tive probability.

24

Refinement: Belief stable equilibrium

Definition 5 (Belief stable equilibrium) An assessment (σ, µ) is a belief stable equilibrium, if

(σ, µ) is a PBE such that for every history htO a sequence (µn)n≥0 exists, satisfying

1. That µn ∈ [0, 1] \ {µt(htO)} and µn → µt(htO).

2. For every player i ∈ {A,B,O}, we have that uni (hti) → ui(hti). Where uni (hti) is the expected

continuation payoff for player i at hti given the (µn, µ, htO)-assessment.

A belief stable equilibrium is an equilibrium where the payoffs that players obtain can be

approximated by the payoffs they would receive under some sequentially rational strategies to

some perturbed beliefs. Mathematically, it imposes that the correspondence from beliefs to all the

payoffs that are sequentially rational to those beliefs has to be upper-hemicontinuous.27

Separating Equilibrium

The first result provides the existence of a separating equilibrium. In a separating equilibrium,

Player A makes a different proposal for each state of the world. Player B immediately accepts the

respective proposal and, hence, he reveals the underlying state. Thus, Player O waits to observe

the agreement terms at the initial period, and takes the efficient action after time zero.28

Proposition 2 There exists a separating equilibrium satisfying forward induction and belief sta-

bility with the following properties:

I. When π(θ1)(1− e−rΛ) > e−rΛ(∆VB −∆VA), Player A proposes (θ1, H − e−rΛ∆VB) when the

state is Θ1, and proposes (θ0, π(θ0)) when the state is Θ0.

II. When π(θ1)(1−e−rΛ) ≤ e−rΛ(∆VB−∆VA), Player A proposes (θ1, π(θ1)−e−rΛ[e−r(T−Λ)VB(0)−VB(1)]+) when the state is Θ1 ,and proposes (θ0, π(θ0)) when the state is Θ0.29

The full characterization of the equilibrium assessment, and the proof of Proposition 2 are

delegated to the appendix. In what follows, I provide an intuition for the result.

27Kohlberg and Mertens (1986) develop a similar refinement, called, stable equilibrium. A stable equilibriumrequires that correspondence from games to equilibrium strategies must be upper-hemicontinuous.

28The assumption that there is a value for learning implies that Player O prefers to wait a period rather thantaking the action only using his prior information p.

29I use the notation [x]+ = max{x, 0}.

25

First, consider a history hTO where the negotiation reaches T , and Player O has not taken the

action yet. Suppose that the belief µT (hTO) is such that Player O has value for learning. Let

xB = π(θ1)−, H − e−rΛ∆VB. Then for any belief Player O could have after rejection we have that

Player B prefers to accept (θ1, xB − ε). Indeed,

uB(a|htB,Θ1) = π(θ1)− (xB− ε) + e−ΛVB(1) > π(θB)−xB + e−ΛVB(1) > e−ΛVB(0) ≥ uB(r|htB,Θ1),

Taking ε→ 0, I conclude that Player A can secure at least π(θ1)−xB+e−rΛVA(1) when the state is

Θ1. Thus, for any potential belief that Player O could have at hTO∪{r}, the negotiation never ends

in disagreement when the state is Θ1. In other words, Θ1 is not relevant to hTO ∪ {r}. Therefore,

the forward induction refinement imposes that, in case of rejection at T , Player O believes that the

state is Θ0. Consequently, Player A proposes (θ1, xB) when the state is Θ1, and proposes (θ0, π(θ0))

when the state is Θ0. Player B accepts every proposal. Thus, Player O prefers to wait a period

and then learn the state through the agreement terms. This reasoning shows that for the subgame

hTO, separating strategies are sequentially rational. As a final remark, notice this proof shows a

stronger result. In any equilibrium where Player O waits at T with probability one, the negotiation

ends with an agreement. Moreover, the agreement terms fully reveal the state of the world.

When t < T , a similar logic applies, however, it depends on his belief after observing a rejection.

For Scenario I, forward induction imposes that after observing a rejection at t, Player O believes

that the state is Θ0. Hence, this rejection-threat forces Player A to make an initial proposal of

(θ1, xB) when the state is Θ1. For Scenario II, however, this refinement does not longer apply. In

particular, I construct an equilibrium where the off-path belief after a rejection equals to the initial

prior p. Thus, in case of observing a rejection, Player O waits assuming that, in the next period,

there is an agreement in the negotiation which reveals the underlying state. As a consequence,

for Scenario II, the rejection-threat consists of reaching period T without an agreement. For

that reason, when the state is Θ1, Player A proposal is such that Player B is indifferent between

accepting or letting the negotiation to reach T .

Observe that, in this equilibrium, there is no delay in the negotiation, and the outsider takes

the optimal decision with one period delay. Thus, as the length period shrinks, the equilibrium

converges to the full information outcome. This result contrasts with the posturing equilibrium of

Section 4. There, the outsiders takes a suboptimal decision, moreover, if there is weak posturing

there is delay in the negotiation that is persistent when Λ → 0. Accordingly, when the full

information benchmark achieves the first best, the separating equilibrium converges to the efficient

outcome as Λ→ 0.

26

Nonetheless, the continuous-time limit of this separating equilibrium differs from the full in-

formation benchmark in one aspect: the pie distribution. When ∆VA ≥ ∆VB (Scenario I), or

∆VB > ∆VA and T is small (Scenario II), Player B obtains part of the gains from the negotiation

when the state is Θ1. This is because the gains from the negotiation in the high state are signif-

icant enough compare to the externalities. Thus, when the state is Θ1, negotiators always prefer

to agree rather than breaking the negotiation. Hence, a breakdown of negotiations should only

happen when the state is Θ0. This further generates a credible threat that Player B can pull when

the game is close to the deadline T . Thus, to avoid the surplus-destruction, Player A compensates

Player B by allocating to him a fraction of the gains from the negotiation. Moreover, the smaller

is the size of the externalities for Player B (∆VB), the lower is what Player A obtains from the

negotiation. The reason is that, in this case, the cost of delaying the negotiation is greater the

incentive cost that Player A needs to pay for an earlier agreement. Thus, observing a rejection

means that the state is Θ0. By the previous logic, Player B’s bargaining power increases since just

a delay of the negotiation triggers Player to believe that the state Θ0.

The following conditions guarantee that every equilibrium of the non-transparent negotiation

game is a separating equilibrium.

Assumption 1 Suppose that VA and VB satisfy the following properties:

1a. If VA(µ)− VA(0) ≥ π(θ0) then VB(µ)− VB(1) ≤ π(θ0).

1b. V ′A(0) < π(θ0)(Z(1)−Z(0)−Z′(0)

Z(0)

)and V ′B(1) < π(θ0)

(Z(0)−Z(1)−Z′(1)

Z(1)

).

Assumption 1a. means that the gains from the negotiations in the low state are significant in

comparison to the externalities VA and VB. That is if the belief µ is such that VA(µ)−VA(0) ≥ π(θ0)

the belief has to be sufficiently close to one so that VB(µ)− VB(1) ≤ π(θ0).

Assumption 1b. implies that for small Λ, each negotiator prefers to reveal the non-preferable

state and to obtain at least π(θ0) of the pie, rather than waiting a period to also obtain π(θ0),

and to induce a belief with a mild noise over the non-preferable state, so that Player O does not

have a value of learning. Thus, under Assumption 1b., the cost of delaying π(θ0) a period, which

is approximately rπ(θ0), is greater than the benefit generated by slightly changing the outsider’s

belief, which I show that is bounded by V ′i (ξθ)rZ(1−ξθ)−Z(ξθ)−Z′(ξθ)

Z′(ξθ), where ξθ = 1{θ=Θ1}.

Theorem 2 Suppose that Assumption 1 holds and that T < ∞. A Λ > 0 exists such that if

Λ < Λ, then every belief stable equilibrium satisfying forward induction is a separating equilibrium.

Furthermore, if π(θ0) < ∆VA, the on-path behavior is uniquely determined.

27

The following corollary is an immediate consequence of Theorem 2.

Corollary 2 Suppose that T <∞ and that Assumption 1 holds. Then, the outcome of every belief

stable equilibrium satisfying forward induction is approximately efficient as Λ→ 0.

Theorem 2 shows that along the equilibrium path, negotiators do not signal information about

the state through rejections. To see this, assume that if the negotiation reaches the deadline, the

unique equilibrium is separating – I address this point in the next paragraph. Consider now a

period previous to the deadline. If there is rejection along the equilibrium path, in the next period

T , Player O waits since he knows that negotiators are going to agree at T . Thus, a rejection at

T − Λ does not persuade Player O’s decision. Hence, it is optimal for negotiators to avoid an

unnecessarily delay by making an informative agreement at T − Λ. Therefore, the negotiators

play separating strategies at T − Λ as well. This logic generates an unraveling process, leading to

separating strategies at the initial period. Critical for this argument is that in case of a rejection,

Player O has a value of learning. Assumption 1b. ensures that, when the off-path beliefs is such

that Player O does not have a value of learning, there is no value for the negotiators to disagree

at that period.

The reason for the uniqueness of equilibrium for the subgame reaching period T is as follows.

Without loss of generality, suppose that Player O’s belief is such that he has value of learning. If

Player O waits with probability one, the same logic as for Proposition 2 implies that the equilibrium

has separating strategies. Hence, suppose for the sake of a contradiction that Player O plays at

T with positive probability. Because Player O is information-seeking, the posterior belief after

observing rejection at T must be noisy. Hence, Player B rejects with positive probability for

both states Θ0 and Θ1. Thus, Player A’s proposal when the state is Θ1 is rejected by Player

B. This implies that in equilibrium, Player A prefers to propose (θ1, π(θ1)) over the acceptable

proposal (θ1, π(θ1) − e−rΛ(VB(µT+Λ(hTO ∪ {r})) − VB(1)). This decision is optimal for Player A

if: (i) the probability that Player O waits at T is low; (ii) if the incentive cost for an agreement,

e−rΛ(VB(µT+Λ(hTO ∪{r}))− VB(1)), is high. However, conditions (i) and (ii) are impossible to hold

if Player B rejects when the state is Θ0. Using forward induction, I show that Player A has to

make an unfeasible proposal (θ1, x) when the state is Θ0. This behavior is optimal if: (iii) the

probability that Player O waits is high and if e−rΛVA(µT+Λ(hTO ∪{r})) > π(θ0) + e−rΛVA(0). Using

Assumption 1a. the last inequality turns implies that (iv) e−rΛ(VB(hTO ∪ {r}) − VB(1)) ≤ π(θ0).

Thus, Player A has a low cost to pay for an agreement when the state is Θ1. In the appendix, I

formally show that conditions violates (i) and (ii) are impossible to hold with conditions (iii) and

(iv). This shows that the subgame at T has a unique separating equilibrium.

28

6 Concluding remarks

There are several extensions of my framework that have not been addressed. First, how the

results would change if the asymmetric information among the negotiators is important. When

the negotiation is non-transparent, it could be that it is too costly for the outsider to wait until

negotiators reach an agreement for taking the action. Hence, transparency could be preferable

since it reveals some information to the outsiders earlier on time.

Another issue to be concerned about is the robustness of my results to the bargaining protocol.

In the model, proposals and offers are not a synonym. Similarly, to propose a proportion of

the surplus is different than proposing a particular allocation. In those cases, the incentives for

(strong) posturing are even higher when the negotiation is transparent. An equilibrium exists

where Player A obtains all the surplus of the negotiation, and the outsider takes the uninformed

action immediately. In turn, when the negotiation is non-transparent, an (approximately) efficient

equilibrium can be sustained by threating Player A that if there is no immediate agreement, Player

O will play z = 0.

Third, I assume throughout the paper that negotiators can only communicate their private

information to the outsider through the negotiation. If other communication channels are costless

(cheap talk), the same logic of our results will imply that the outsider will not obtain more infor-

mation. However, if negotiators somehow can “burn” money to send some signal, it may be the

case that a separating equilibrium could be sustained in equilibrium.

Beyond these extensions, it is important to notice that the conclusions of the paper do not

contradict why many negotiations are transparent. My results only claim what would be more

efficient, but does not provide a theory of how bargainers decide what transparency policy to

implement in the negotiation. Furthermore, my theory does not consider other aspects of the

negotiations that may be relevant as well. For instance, when negotiators represent multiple

constituents, transparency could be important since it provides legitimacy to the outcome of the

negotiation. Furthermore, it may also help to avoid moral hazard conflicts between the negotiators

and their constituents.

References

Abreu, D. and F. Gul (2000): “Bargaining and reputation,” Econometrica, 68, 85–117.

Chatterjee, K. and L. Samuelson (1987): “Bargaining with two-sided incomplete informa-

29

tion: An infinite horizon model with alternating offers,” The Review of Economic Studies, 54,

175–192.

Chaves, I. (2019): “Privacy in Bargaining: The Case of Endogenous Entry,” working paper,

Northwestern.

Drake-Brockman, J. and P. A. Messerlin (2018): Potential benefits of an Australia-EU free

trade agreement: Key issues and options, Adelaide: University of Adelaide Press.

Fernandez, R. and J. Glazer (1991): “Striking for a Bargain Between Two Completely

Informed Agents,” The American Economic Review, 240–252.

Fingleton, J. and M. Raith (2005): “Career concerns of bargainers,” Journal of Law, Eco-

nomics, and organization, 21, 179–204.

Fuchs, W., A. Ory, and A. Skrzypacz (2016): “Transparency and distressed sales under

asymmetric information,” Theoretical Economics, 11, 1103–1144.

Fudenberg, D. and J. Tirole (1991): Game Theory, Cambridge, MA: MIT Press.

Govindan, S. and R. Wilson (2009): “On forward induction,” Econometrica, 77, 1–28.

Horner, J. and N. Vieille (2009): “Public vs. private offers in the market for lemons,” Econo-

metrica, 77, 29–69.

Kambe, S. (1999): “Bargaining with imperfect commitment,” Games and Economic Behavior,

28, 217–237.

Kaya, A. and Q. Liu (2015): “Transparency and price formation,” Theoretical Economics, 10,

341–383.

Kohlberg, E. and J.-F. Mertens (1986): “On the strategic stability of equilibria,” Economet-

rica: Journal of the Econometric Society, 1003–1037.

Kreps, D. M., P. Milgrom, J. Roberts, and R. Wilson (1982): “Rational cooperation in

the finitely repeated prisoners’ dilemma,” Journal of Economic theory, 27, 245–252.

Kreps, D. M. and R. Wilson (1982): “Reputation and imperfect information,” Journal of

economic theory, 27, 253–279.

30

Marceddu, M. L. (2018): “Implementing Transparency and Public Participation in FTA Nego-

tiations: Are the Times a-Changin?” Journal of International Economic Law, 21, 681–702.

Milgrom, P. and J. Roberts (1982): “Predation, reputation, and entry deterrence,” Journal

of economic theory, 27, 280–312.

Perry, M. and P. J. Reny (1993): “A non-cooperative bargaining model with strategically

timed offers,” Journal of Economic Theory, 59, 50–77.

Perry, M. and L. Samuelson (1994): “Open-versus closed-door negotiations,” The RAND

Journal of Economics, 348–359.

Sakovics, J. (1993): “Delay in bargaining games with complete information,” Journal of Eco-

nomic Theory, 59, 78–95.

Stasavage, D. (2004): “Open-door or closed-door? Transparency in domestic and international

bargaining,” International organization, 58, 667–703.

Yildiz, M. (2004): “Waiting to persuade,” The Quarterly Journal of Economics, 119, 223–248.

A Appendix

Proof of Lemma 1.

The proof that an equilibrium exists such that: A proposes (θ, π(θ)), O playes z = 1{Θ=Θ1}

and B accepts, is straightforward from the standard one-sided proposal bargaining game and is

therefore omitted (See Section 4.4. in Fudenberg and Tirole (1991)). In what follows, I show the

uniqueness of the equilibrium.

Notice that when the state is Θ = Θ0, O immediately proposes z = 0. Because the external-

ities payments VA(0) and VB(0) accrue immediately, A and B face a standard one-sided proposal

bargaining game. Hence, when Θ = Θ0, negotiators behave is if there is not outsider present.

When Θ = Θ1 and T < ∞. The proof is by backward induction. For the subgame consider

the last period T , where the negotiators have not agreed prior to T . If the outsider has taken

the action. Clearly, A proposes (θ1, π(θ1)) and B accepts. If O has not taken the action, A can

guarantee at least π(θ1)− ε+VA(1). Indeed, if he proposes (θ1, π(θ1)− ε), it is optimal for O plays

z = 1 (he knows that for t > T , θt = θ1). Therefore, B prefers to accept and obtain ε + VB(1)

31

rather than rejecting and obtaining VB(1). Because this result hold for every ε, I conclude that A

proposes (θt, π(θt)), O plays z = 1, and B accepts. Using the same reasoning for T − Λ, . . . , 0, I

conclude that A always proposes (θt, π(θt)), O does not wait and plays z = 1, and B accepts.

For the case when Θ = Θ1, T = ∞, and Z(1) ≥ Z(0), I claim that can A guarantee at least

π(θ1) − ε + VA(1). Suppose A proposes (θ1, π(θ1) − ε). Then if O plays z = 1, VA(1) and VB(1)

are sunk, and therefore, B prefers to obtain ε of the bargaining pie rather than rejecting, and

given subgame perfection, obtain 0 from the pie. Because, Z is convex an Z(1) ≥ Z(0), O strictly

prefers to play z = 1 rather wait (or playing z < 1) and obtaining a payoff which is bounded by

e−rΛ maxµ ∈ [0, 1]Z(µ) = e−rΛZ(1). Therefore, A can get at least a payoff of π(θ1)− ε+ VA(1) for

every ε > 0. I conclude that A proposes (θt, π(θt)), O plays z = 1, and B accepts.

A.1 Proofs of Section 4

Proof of Lemma 2. At period T , µ1T = µ2

T = µA. Let

f(x|µs+Λ) = Z(x)− e−rΛ(

x

µs+Λ

(Z(µs+Λ)− Z(0)) + Z(0)

).

Notice that f ′′(x|µs+Λ) = Z ′′(x). Thus, f is strictly convex. The function f satisfies f(µs+Λ|µs+Λ) =

Z(µs+Λ)(1− e−rΛ) > 0 and

f ′(µs+Λ|µs+Λ) = Z ′(µs+Λ)µs+Λ − e−rΛ (Z(µs+Λ)− Z(0)) > (1− e−rΛ) (Z(µs+Λ)− Z(0)) > 0,

where the first inequality comes from the strict convexity of Z. Therefore, f(x|µs+Λ) > 0 for

x ≥ µs+Λ. This implies that any solution to Equation (1) is monotone.

Regarding uniqueness of the solution, since f(0|µs+Λ) = Z(0)(1 − e−rΛ) > 0, f(x|µs+Λ) can

cross zero at most twice. In particular, there is unique solution such that f ′(x|µs+Λ) ≥ 0. That is,

given µs+Λ, there is a unique solution µt solving Equation (1) and Condition (2). Because µT = µA,

by a simple backward induction argument I deduce the uniqueness result.

Proof of Proposition 1.

Simple inspection shows that beliefs are monotone. Also, by construction of the beliefs, they

follow Bayes’rule wherever is possible. To show sequentially rationality, notice that for every history

such that either there is already an agreement in the negotiation, or Player O has already taken

the action, the strategies are (trivially) the unique subgame PBE. Thus, in what follows I consider

a period t such that the there is not an agreement before period and Player O has not taken the

32

action yet. I use the one-shot deviation principle to prove sequential rationality.

For these histories notice that the equilibrium has the following memoryless property: the

decision at period t does not depend on the history before period t. I now proceed to show

sequential rationality for each player.

Regarding Player A, consider the case where t < t∗. If he proposes xtA < x∗t , he gets a payoff

of min{π(θ), xAt } + VA(0). If he proposes xAt ≥ x∗t he gets a payoff of min{π(θ), xAt } + VA(µ∗t ).

Thus, the optimal decision is to propose (θ1, π(θ1). For the case t ≥ t∗, the only relevant deviations

are θt = θ0 or xAt < x∗t (for θt = θ1 and xAt ≥ x∗t , the behavior of every player is as (θ1, π(θ1))).

When the state is Θ0, Player A is indifferent between proposing (θ1, π(θ1)) or proposing (θ0, π(θ0)),

and getting π(θ0) + VA(0). By proposing θt = θ0 or xAt ∈ [0, x∗t ) his payoff is less or equal than

max{e−rΛπ(θ0),min{π(θ0), xAt }}+ VA(0). Therefore, Player A prefers to propose (θ0, π(θ0)) which

is the same payoff as proposing (θ1, π(θ1)). Finally, consider the case when the state is Θ1. If Player

proposes θt = θ0 or xAt < x∗t , he obtains a payoff bounded by xAt +e−rΛVA(0). The equilibrium payoff

uA(σ|Θ1, htO) can be bounded by e−rT (π(θ1) + V (p)). This is because µ∗t ≥ p and τz ≤ T (a.s.).

Let δ such that δ(π(θ1) + V (p)) > xAt + VA(0). Then if e−rT > δ, then uA(σ|Θ1, htO) > xAt + VA(0).

Therefore, Player A is sequentially rational.

Regarding Player B if he observes (θ0, xAt ) and rejects, given (σ∗, µ∗), his payoff by rejecting is

less or equal than e−rΛVB(0). By accepting the proposal, he gets a payoff of at least e−rΛVB(0).

Thus, is optimal for B to accept the proposal. If the state is Θ1, by construction of x∗t Player

B decision is sequentially rational. Finally, to ensure that x∗t is well-defined, I take δ so that

uB(htB|Θ1) > e−rΛVB(1).

Regarding Player O, if he observes θt = θ0 or xAt < x∗t , µt(htO) = 0. Thus, playing 0 is

sequentially rational. Suppose that θt = θ1 and xAt ≥ x∗t , t < t∗,and that Player O deviates and

decides to wait (clearly, a deviation to make a different action is not profitable). In such situation,

Player B always rejects. If t + Λ < t∗, Player A plays (θ1, π(θ1)) and Player O takes the same

action at t + Λ than the optimal action at t. Because he does not learn at all by waiting, this

deviation is not profitable. If t + Λ ≥ t∗, in that case Player A mixes when the state is Θ0. If

Player O would have a profitable deviation then

Z(µ∗t ) < e−rΛ(

µ∗tµ∗t+Λ

Z(µ∗t+Λ) +

(1− µ∗t

µ∗t+Λ

)Z(0)

). (7)

If t∗ = tE, then f(x|µ∗t+Λ), defined as in the proof of Lemma 2, is strictly greater than zero (in th

lemma I show that f is convex, with f(0|µ∗t ) > 0 and f(µ∗t+Λ|µ∗t+Λ) > 0). Therefore, f(µ∗t |µ∗t+Λ) > 0,

33

this contradicts Inequality (7). If tE < t∗, then µt < µ∗t . Because (µt) satisfies Condition (2), and

µ∗t+Λ = µt+Λ, from the proof of Lemma 2 f ′(µt|µ∗t+Λ) > 0. Because f is convex, f ′(x|µ∗t+Λ) > 0 for

every x > µt. Because µ∗t > µt, I conclude that f(µ∗t |µ∗t+Λ) > 0, this contradicts Inequality (7).

Therefore, Player O does not have a profitable deviation when xAt > x∗t and t ≤ t∗. If xAt ≥ x∗t

and t ≥ t∗, then the belief (µ∗s)Ts=t satisfy Equation (1). Therefore, Player O is indifferent between

taking the action µ∗t or waiting. Thus, Player O is sequentially rational.

Proof of Lemma 3.

Because beliefs are increasing over time, I obtain that ηw = r π(θ0)+VA(0)VA(µs)−V (0)

> ηT for every w ≤ T .

Consider two distributions X and Y over [t, T ], where P[X > s] = e−(r(s−t)+∫ st ηwdw) and P[Y >

s] = e−(r+ηT )(s−t) for s ∈ [t, T ). Because ηw > ηT , I obtain that P[Y ≥ s] > P[X ≥ s] for every

s ≥ T . Hence, Y first order stochastic dominates X. Consider the function

g(s) =ηs

r + ηs(π(θ1) + VA(µs) + VB(µs)).

I assert that g is non-increasing. Indeed, since g(s) = π(θ0)+VA(0)π(θ0)+VA(µs)

(π(θ1) + VA(µs) + VB(µs)), I can

rewrite g as composition g(s) = h(µs), where h(µ) = π(θ0)+VA(0)π(θ0)+VA(µ)

(π(θ1) + VA(µ) + VB(µ)). Notice

that h is decreasing since

h′(µ) = (π(θ0) + VA(0))V ′B(µ)(π(θ0) + VA(µ))− V ′A(µ)(π(θ1) + VB(µ))

(π(θ0) + VA(µ))2< 0.

Using that µs is increasing over time, I conclude that g is a non-increasing function.

Because Y first order stochastic dominates X and g is non-increasing, I conclude that EX [g] ≥EY [g]. The above expectations are equivalent to∫ T

t

e−(r(s−t)+∫ st ηwdw) ηs(π(θ1) + VA(µs) + VB(µs))︸︷︷︸

(r+ηs)g(µs)

ds+ e−(r(T−t)+∫ Tt ηwdw) ηT

r + ηT(π(θ1) + VA(µA) + VB(µA))

≥∫ T

t

e−(r+ηT )(s−t)(r + ηT )g(µs)ds+ e−(r+ηT )(T−t) ηTr + ηT

(π(θ1) + VA(µA) + VB(µA)).

34

Therefore,∫ T

t

e−(r(s−t)+∫ st ηwdw)ηs(π(θ1) + VA(µs) + VB(µs)) + e−(r(T−t)+


=∫ T

t

e−(r(s−t)+∫ st ηwdw)ηs(π(θ1) + VA(µs) + VB(µs)) +

ηTr + ηT

e−(r(T−t)+∫ Tt ηwdw)(π(θ1) + VA(µA) + VB(µA)) +

r

r + ηTe−(r(T−t)+


≥∫ T

t

e−(r+ηT )(s−t)(r + ηT )g(µs)ds+ηT

r + ηTe−(r+ηT )(T−t)(π(θ1) + VA(µA) + VB(µA)) +

r



≥∫ T

t

e−(r+ηT )(s−t)(r + ηT )g(µT )ds︸︷︷︸g(µT )(1−e−(r+ηT )(T−t))

+ηT

r + ηTe−(r+ηT )(T−t)(π(θ1) + VA(µA) + VB(µA)) +

r



= (π(θ1) + VA(µA) + VB(µA))

(ηT

r + ηT+ e−(r+ηT )T r

r + ηT

).

The first inequality comes from EX [g] ≥ EY [g]. The second inequality comes from the fact that

g is decreasing for s ≤ T .

A.1.1 The continuous time limit

The following lemma shows that continuous time limit is well-defined.

Lemma 5 Let (µΛt )Tt=tΛE

the solution to Equation (1). Then, tE = limΛ→0 tΛE, t∗E exists. Also,

µt = limΛ→0 µΛtΛ

, where tΛ is any period in the game indexed by Λ that satisfies tΛ → t, exists.

Furthermore, µt is differentiable for t ∈ (tE, T ).

Proof. Consider the function f(x|µs+Λ,Λ) defined as in the proof of Lemma 2. Notice that

d

dΛf(x|µs+Λ,Λ) = re−rΛ

(x

µs+Λ

(Z(µs+Λ)− Z(0)) + Z(0)

)> 0,

d

dµs+Λ

f(x|µs+Λ,Λ) =−e−rΛ

µ2s+Λ

(Z ′(µs+Λ)µs+Λ − (Z(µs+Λ)− Z(0))) > 0.

where the second inequality holds due to the strict convexity of Z. A simple induction argument

shows that f(x|µΛ′

s+Λ′ ,Λ′) < f(x|µΛ

s+Λ,Λ) for every x and Λ′ < Λ. Therefore, µΛ′s > µΛ and tΛ

′E < tΛE

for Λ′ < Λ and s ≥ tΛE. Therefore, µs = limΛ→0 µΛs and tE = limΛ→0 t

ΛE are well-defined. I assert

that for µs > 0, µs is differentiable.

By definition µΛt , I obtain that

Z(µΛt+Λ)− Z(µΛ

t ) = (1− e−rΛ)Z(µΛt+Λ) + e−rΛ

(1− µΛ

t

µΛt+Λ

)(Z(µΛ

t+Λ)− Z(0)). (8)

35

From the convexity of Z, I get that Z(µΛt+Λ) − Z(µΛ

t ) ≥ Z ′(µΛt )(µΛ

t+Λ − µΛt ). Plugging this into

Equation (8), I derive that

(µΛt+Λ − µΛ

t )(Z ′(µΛ

t )µΛt+Λ − e−rΛ(Z(µΛ

t+Λ)− Z(0)))≤ (1− e−rΛ)Z(µΛ

t+Λ).

From Condition 2, I obtain that

|µΛt+Λ − µΛ

t | = µΛt+Λ − µΛ

t ≤ (1− e−rΛ)µΛt+ ΛZ(µΛ

t+Λ)

Z ′(µΛt )µΛ

t+Λ − e−rΛ(Z(µΛt+Λ)− Z(0))

. (9)

Using (again) the convexity of Z, I get that Z(µΛt+Λ)−Z(µΛ

t ) ≤ Z ′(µΛt+Λ)(µΛ

t+Λ−µΛt ). Plugging

this inequality into Equation (8), I derive that

(µΛt+Λ − µΛ

t )(Z ′(µΛ

t+Λ)µΛt+Λ − e−rΛ(Z(µΛ

t+Λ)− Z(0)))≥ (1− e−rΛ)Z(µΛ

t+Λ).

Because Z is convex Z ′(µΛt+Λ) > Z ′(µΛ

t ). Hence, from Condition 2, I obtain that

Z ′(µΛt+Λ)µΛ

t+Λ − e−rΛ(Z(µΛt+Λ)− Z(0)) > 0.

Thus,

|µΛt+Λ − µΛ

t | = µΛt+Λ − µΛ

t ≥ (1− e−rΛ)µΛt+ ΛZ(µΛ

t+Λ)

Z ′(µΛt+Λ)µΛ

t+Λ − e−rΛ(Z(µΛt+Λ)− Z(0))

. (10)

Let > Tt > tE such that µt > 0, then Z ′(µt)µt + Z(0) − Z(µt) > 0 and Z ′(µΛt+Λ)µΛ

t+Λ −e−rΛ(Z(µΛ

t+Λ) − Z(0)) > ξ for Λ small enough and ξ > 0. Using Inequality 9 and Inequality 10, I

obtain that µt is differentiable with

µt =µtZ(µt)

Z ′(µt)µt + Z(0)− Z(µt).

Furthermore because Z is strictly convex, Z ′(µt)µt + Z(0) − Z(µt) is bounded from zero for

every t > tE. Thus, using the Picard-Lindelof Theorem, I conclude that (µt) is the unique solution

of the o.d.e (4) for t ∈ [tE + ε, T ]. Because the result holds for every ε > 0, I conclude the proof

by taking ε→ 0.

36

A.1.2 Uniqueness of the subgame at period T

Consider the subgame at period T , where the negotiation has not reached an agreement yet and

Player O has not taken the action yet. I show that for this subgame, there is a unique monotone

equilibrium that satisfy forward induction.

The existence is shown in the proof of Proof of Proposition 1. Therefore, it remains to show

uniqueness.

Lemma 6 In any monotone equilibrium (σ, µ) we have that P[σTO(hT−ΛO ∪ {(θ1, π(θ1))}) = w] = 0.

Proof of Lemma 6. I assert that from Player O’s perspective, waiting does not provide any extra

information. If µT (hT−ΛO ∪ {(θ1, π(θ1))}) is degenerate, Player O takes the action immediately. If

µT (hT−ΛO ∪ {(θ1, π(θ1))}) ∈ (0, 1) from by Bayes’ rule I obtain that µT+Λ(hT−Λ

O ∪ {(θ1, π(θ1)), r}) <1.30 From the sequential rationality of Player B, Player B rejects for both states Θ0 and Θ1. Hence,

µT+Λ(hT−ΛO ∪ {(θ1, π(θ1)), r}) = µT (hT−Λ

O ∪ {(θ1, π(θ1))}). Thus,

uO(w|hT−ΛO ∪ {(θ1, π(θ1))}) = e−rΛZ(µT+Λ(hT−Λ

O ∪ {(θ1, π(θ1)), r}))

< Z(µT (hT−ΛO ∪ {(θ1, π(θ1))}))

= uO(µT (hT−ΛO ∪ {(θ1, π(θ1))})|hT−Λ

O ∪ {(θ1, π(θ1))}).

Therefore, Player O never waits after observing proposal (θ1, π(θ1)).

Lemma 7 Consider a monotone equilibrium (σ, µ) satisfying forward induction. Then, P[σT,Θ1

A (hTA) =

(θ1, π(θ1))] = 1.

Proof.

Consider a monotone equilibrium (σ, µ) and suppose, for the sake of a contradiction, that

(θ, x) ∈ supp(σT,Θ1

A (hTA)) exists such that θ = θ0 or x < π(θ1). Consider the case µT (hT−ΛO ∪

{x}) = 1. Because beliefs are monotone, for ε > 0 such that π(θ1) − ε > π(θ0), the beliefs are

µT (hT−ΛO ∪{π(θ1)−ε}) = 1. If A proposes π(θ1)−ε when the state is Θ1, we have that O plays z = 1

at period T , and B accepts the proposals. Thus, uA((θ1, π(θ1)− ε)|Θ1, hTA) = π(θ1)− ε+ VA(1) >

x+ VA(1) ≥ uA((θ, x)|Θ1, hTA). This is a contradiction.

Consider the case µT (hT−ΛO ∪ {(θ, x)}) < 1, from Bayes’ rule I conclude that with positive

probability Player A plays (θ, x) when the state is Θ0. This further implies that is sequentially

rational for Player A to play (θ, x) when the state is Θ0.

30With positive probability the state is Θ0, and therefore, Player B rejects.

37

If θ = θ1, notice that uA((θ, x)|Θ0, hTA) ≥ VA(µT (hT−Λ

O ∪{(θ, x)})). Furthermore, from Lemma 6,

uA((θ1, π(θ1))|Θ0, hTA) = VA(µT (hT−Λ

O ∪ {(θ1, π(θ1))})) ≥ VA(µT (hT−ΛO ∪ {(θ, x)})). Where the

inequality comes from the monotonicity of beliefs. The optimality condition requires that µT (hT−ΛO ∪

{(θ1, π(θ1))}) = µT (hT−ΛO ∪ {θ, x}) and that uA((θ, x)|Θ0, h

TA) = VA(µT (hT−Λ

O ∪ {(θ, x)})). Hence,

Player O plays at T with probability one after observing (θ, x). Thus, uA((θ, x)|Θ1, hTA) = x +

VA(µT (hT−ΛO ∪ {(θ, x)})). Consider the proposal (θ1, π(θ1) − ε), I claim that O does not wait. If

he waits, B always rejects.31 Therefore, uA((θ1, π(θ1) − ε)|Θ1, hTA) = π(θ1) − ε + VA(µT (hT−Λ

O ∪{(θ, x)})) > uA((θ, x)|Θ1, h

TA). This is a contradiction.

If θ = θ0, B always accepts when the state is Θ1. He obtains at least VB(0) while he obtains

VB(µT (hT−ΛO ∪ {(θ, x)} ∪ {r}) < VB(0). Hence, uA((θ, x)|Θ1, h

TAleπ(θ0) + VA(0). Suppose that

uA((θ, x)|Θ0, hTA) ≥ VA(1). Then, by Forward Induction, (or simply the intuitive criterion) I

obtain that µT (hT−ΛO ∪ {(θ1, x)}) = 1 for every x ∈ [0, π(θ1)]. Henceforth, Player A prefers to

propose (θ1, π(θ1)− ε), getting π(θ1)− ε+ VA(1) rather than proposing (θ, x), and getting at most

π(θ0) + VA(0). If uA((θ, x)|Θ0, hTA) < VA(1), from the sequential rationality of Player A, when

the state is Θ0, I obtain that µT (hT−ΛO ∪ {(θ1, π(θ1))}) < 1. Hence, for every ε > 0, µT (hT−Λ

O ∪{(θ1, π(θ1)− ε)}) ≤ µT (hT−Λ

O ∪{(θ1, π(θ1))}) < 1. Thus, for ε small enough Player O does not wait

when he observes the proposal (θ1, π(θ1)− ε). Therefore,

π(θ1)−ε+VA(µT (hT−ΛO ∪{(θ1, π(θ1)−ε)})) = uA(θ1, π(θ1)|Θ1, h

TA) ≤ uA((θ, x)|Θ1, h

TA) ≤ π(θ0)+VA(0) .

Taking ε→ 0, this implies that π(θ1) < π(θ0). This is a contradiction.

Proposition 3 Consider a monotone equilibrium (σ, µ) satisfying forward induction. For the sub-

game at period T , where the negotiation has not reached an agreement yet and Player O has

not taken the action yet, the equilibrium is a posturing equilibrium. Thus, when the state is Θ1,

Player A plays (θ1, π(θ1)); when the state is Θ0, A mixes so that µT (hT−ΛO ∪ {(θ1, π(θ1))}) =

max{µT−λ(hT−ΛO ), µVA}. Moreover, Player O plays with probability one at T .

Proof. From Lemma 7, I obtain have that P[σT,Θ1

A (hTA) = (θ1, π(θ1))] = 1. When the state

is Θ0, clearly A does not propose (θ1, x) with x < π(θ1). On the other hand for every ε > 0,

uA((θ0, π(θ0)− ε)|Θ0, hT−Λ) = π(θ0)− ε+VA(0). This is because B prefers to accept such proposal,

31The logic is the same as in Lemma 7. If B rejects, he obtains

VB(µT (hT−ΛO ∪ {(θ1, π(θ1)− ε), r})) ≥ VB(µT (hT−Λ

O ∪ {(θ1, π(θ1)− ε)})) ≥ ε+ VB(1),

where the inequalities hold for ε small enough.

38

independently on whether O waits or takes the action before. Thus, if π(θ0) ≥ ∆VA. There is a

separating equilibrium, where A proposes (θ, π(θ)) for θ ∈ {θ0, θ1}. B accepts each proposal. And

O plays accordingly.

When π(θ0) < ∆VA, we have that in equilibrium uA((θ0, π(θ0))|Θ0, hT−Λ) ≤ uA((θ1, π(θ1))|Θ0, h

T−Λ).

That is, π(θ0) +VA(0) ≤ VA(µT (hT−ΛO ∪{(θ1, π(θ1))}). Therefore, if µT (hT−Λ

O ∪{(θ1, π(θ1))}) > µA,

A proposes (θ1, π(θ1)) with probability one when the state is Θ0. If µT (hT−ΛO ∪{(θ1, π(θ1))}) = µA,

A mixes between both proposals. In both cases O takes the action immediately, and B accepts

whenever is feasible.

A.2 Proof of Section 4.2

Proof of Lemma 4. The proof is by backward induction for t = T, . . . , 0.

I prove the base case in Lemma 8. For the inductive step, consider a history htO such that

(θt, xAt ) = (θ1, π(θ1)) and µt(h

tO) ≥ µA. Under that history, if B rejects with probability one, then

µt(htO ∪{r, (θ1, π(θ1))}) ≥ µA. Henceforth, by the inductive hypothesis, the equilibrium is a strong

posturing equilibrium. If Player B accepts so that µt(htO ∪ {r, (θ1, π(θ1))}) < µA, I claim that

Player B has to be indifferent. If he strictly prefers to accept, then µt(htO ∪{r, (θ1, π(θ1))}) = 0. In

such case, Player B prefers to reject when the state is Θ1. Hence, Player B’s continuation payoff

by rejecting is exactly e−rΛVB(1). However, this cannot be true. By rejecting the proposal and

waiting to T he obtains at least VB(µA). By the assumption e−rΛVB(µA) > VB(1), this is profitable.

Therefore, Player B rejects when he observes (θ1, π(θ1)). Thus, µt+Λ(htO ∪ {r, (θ1, π(θ1))}) ≥ µA.

From the inductive hypothesis there is strong posturing and t + Λ. The same argument as in

Lemma 8 shows that there is strong posturing at t as well.

Lemma 8 Consider T < ∞ and a (σ, µ) monotone equilibrium, satisfying forward induction.

Suppose that there is some history hTO = hT−ΛO ∪ {(θT , xAT ), w, r} and µT (hT−Λ

O ) ≥ µA. Then, if

Λ < Λ for some Λ > 0, the equilibrium is a strong posturing equilibrium.

Proof.

The proof is backward induction.

Using Lemma 7, I conclude that µT (hTO) ≥ µA. Given that µT (hT−ΛO ) ≥ µA, Bayes’ rule imples

that in the subgame at t = T , there is strong posturing in the equilibrium.

Suppose that this result holds for the subgame from period t+ Λ, I want to show that there is

strong posturing at t.

39

Let (θ, x) ∈ supp(σt,Θ1

A ) and suppose that either θ = θ0 or x < π(θ1). If θ = θ0, then B

accepts and Player A obtains no more than π(θ0) + VA(0). I claim that if he deviates and offer

(θ1, π(θ1) − ε) for small ε, he obtains at least π(θ1) − ε + e−rΛVA(0). Suppose that this is not

the case. Therefore, µt(ht−ΛO ∪ {(θ1, π(θ1) − ε)}) ≤ µΛ < 1, where µΛ is the highest belief where

Player O weakly prefers to wait (See Equation 11 for a formal definition). Hence, by rejecting

(θ1, π(θ1)− ε Player B obtains at least e−rΛVB(µΛ) (by the inductive hypothesis) which is strictly

greater than the payoff by accepting, ε+e−rΛVB(1), if ε small enough. By the inductive hypothesis,

Player O does not learn by waiting and takes the action immediately. Hence, Player A obtains at

least π(θ1)− ε+ e−rΛVA(0). Because, π(θ1) > π(θ0), we have if Λ and ε are sufficiently small then

π(θ0) +VA(0) < π(θ1)− ε+ e−rΛVA(0).32 Therefore, A prefers to deviate and propose (θ1, π(θ1)− ε)over (θ, x). This is a contradiction.

For the case θ = θ1 and x < π(θ1), notice that µt(ht−ΛO ∪ {(θ, x)}) < 1: otherwise, by mono-

tonicity, µt(ht−ΛO ∪ {(θ1, π(θ1,−ε)}) = 1, and Player A deviates to (θ1, π(θ1,−ε)) when the state

is Θ1. Because µt(ht−ΛO ∪ {(θ, x)}) < 1, this implies that A proposes (θ, x) when the state is Θ0.

Because µt−Λ(ht−ΛO ) ≥ µA, using Bayes’ rule we can assume that µt(h

t−ΛO ∪ {(θ, x)}) ≥ µA. Hence,

if B rejects, the subgame has strong posturing. I assert that B rejects (θ, x) with probability one.

Suppose not, then uA((θ, x)|Θ0, htA) < e−rΛπ(θ0) + VA(µt(h

t−ΛO ∪ {(θ, x)})). However, if A deviates

and proposes (θ1, π(θ1)), tomorrow there is strong posturing. Thus, B rejects for sure if O waits

(e−rTVB(µa) > VB(1)). Consequently, O takes the action immediately. Therefore,

uA((θ1, π(θ1)|Θ0, htA) = e−rΛπ(θ0) + VA(µt(h

t−ΛO ∪ {(θ1, π(θ1))}))

≥ e−rΛπ(θ0) + VA(µt(ht−ΛO ∪ {(θ, x)})) > uA((θ, x)|Θ0, h

tA).

Hence, A prefers to propose (θ1, π(θ1)) over (θ, x). This is a contradiction. Therefore, B rejects for

sure when he observes (θ, x), and consequently, O takes the action immediately. However, A would

deviate when the state is Θ1 and propose (θ1, π(θ1)− ε) for ε small enough. By the same argument,

there is strong posturing if B rejects such proposal and, hence, O takes the action immediately.

Therefore,

uA((θ1, π(θ1)− ε|Θ1, htA) = π(θ1)− ε+ VA(µt(h

t−ΛO ∪ {(θ1, π(θ1)− ε)}))

≥ x+ VA(µt(ht−ΛO ∪ {(θ, x)})) = uA((θ, x)|Θ1, h

tA).

32Because p > µA, this means that π(θ0) < ∆VA < π(θ1).

40

This is a contradiction.

From the above, I conclude that P[σt,Θ1

A (htA) = (θ1, π(θ1))] = 1. If Player B after observing

(θ1, π(θ1)) and Player O waits decides to accept, he obtains er−ΛVB(0). If he rejects, he obtains

e−rΛVB(µt = t(htO)). From Bayes’ rule, if Player B rejects µt+Λ(htO ∪ {r, (θ1, π(θ1))}) < µt+Λ(htO ∪{a}).33 Hence, by the inductive hypothesis, Player O does not learn by waiting. Hence, he takes the

action immediately. Furthermore, using Bayes’ rule µt(htO) = µt+Λ(htO ∪ {r, (θ1, π(θ1))}), which by

inductive hypothesis, equals µT (hT−ΛO ) ≥ µA. Hence, uA((θ1, π(θ1))|htO,Θ0) ≥ e−rΛπ(θ0)+V (µA) >

π(θ0) + VA(0) ≥ uA((θ, x)|htO,Θ0) if (θ, x) 6= (θ1, π(θ1)). I conclude that there is strong posturing

at t.

A.3 Proofs of Section 5

Proof of Proposition 2. Suppose that prior to time t Player B has rejected and O has not

taken the action yet. The strategy for Player A is

σt,Θ0

A (htA) = (θ0, π(θ0)), σt,Θ1

A (htA) =

(θ1, xB) if π(θ1)(1− e−rΛ) > e−rΛ(∆VA −∆VB)

(θ1, π(θ1)− e−rΛ[e−r(T−t−Λ)VB(0)− VB(1)

]+) if π(θ1)(1− e−rΛ) ≤ e−rΛ(∆VB −∆VA).

Regarding the strategy for Player B. If Player O waits at t, then P[σt,Θ0

B (htB) = a] = 1 accepts if

θt = θ0,P[σt,Θ0

B (htB) = r] = 1 if θt = θ1, P[σt,Θ1

B (htB) = a] = 1 if xAt ≤ σt,Θ1

A (htA), P[σt,θ1B (htB) = r] = 1

if xAt > σt,Θ1

A (htA). When Player O takes the action at t, Player B accepts every feasible proposal.

Regarding the strategy for Player O, P[σtO(htO) = w] = 1.

For a history where there is an agreement prior t and Player O has not taken the action,

P [σtO(htO) = 0] = 1 if θτa = θ0 and P [σtO(htO) = 1] = 1 if θτa = θ1.

Finally for a history where Player O has taken the action before t, the equilibrium follows the

strategies from the bargaining game with one side proposal (see Section 4.4. in Fudenberg and

Tirole (1991)).

For history htO of rejections, the beliefs are:

µt(htO) =

0 if t > T or π(θ1)(1− e−rΛ) > e−rΛ(∆VB −∆VA)

p if t ≤ T or π(θ1)(1− e−rΛ) ≤ e−rΛ(∆VB −∆VA)..

33Because µt(htO) > 0, Bayes’ rule implies that µt(h

tO) > 0.

41

For a history htO where there is an agreement before t, the beliefs are

µt(htO) =

0 if θτa = θ0

1 if θτa = θ1

By simple inspection, the beliefs are consistent. Regarding sequential rationality, by the con-

struction of the strategies, I only need to consider histories htO where Player B has rejected from

0, . . . , t−Λ and O has not taken the action yet. Notice that if Player A proposes less than expected,

he obtains the same externality payoff but less from the negotiation pie. If he offers more than

σt,Θ1

A (ht), B rejects, and he has to wait a period to obtain less from the negotiation pie and less

from the negotiation outcome. Thus, Player A is sequentially rational. For Player B the same

reasoning applies. For Player O, by the value of information assumption, it is optimal for Player

O to wait a period and learn the state of the world. Thus, every player is sequentially rational.

Regarding belief stability, when π(θ1)(1−e−rΛ) ≤ e−rΛ(∆VA−∆VB) notice that same strategies

are sequentially rational if the belief at a history htO is any µ ∈ (µΛ, µΛ) (See Equations (11)and (12)

for a definition). Thus, the equilibrium is belief stable. When π(θ1)(1−e−rΛ) > e−rΛ(∆VA−∆VB),

consider µt(htO) = ε and let γ = e−rΛ(VB(0) − VB(ε)). Then, for ε small enough, consider the

strategies as σt,Θ1

A (htA) = (θ1, xB +γ), P[σt,Θ1

B (htB) = a] = 1 if xAt ≤ xB +γ and P[σt,Θ1

B (htB) = r] = 1

if xAt > xB + γ, and the other set of strategies defined as before. Then, such strategies, are

sequentially rational given the perturbed beliefs. Taking ε → 0, I obtain the equilibrium payoffs

for each player. Therefore, the assessment is belief stable.

Regarding the forward induction argument. Consider an off-path history where Player B has

rejected until t and Player O has not taken the action yet. From the Step 0. and Step 8. of

Lemma 9, I obtain that µt+Λ(htO∪{(θ, x)}) = 0 when θt = θ0 and µT+ΛO (hTO∪{r}) = 0, respectively.

Let

µΛ

= inf{µ ∈ [0, 1] s.t. Z(µ) < e−rΛ (µZ(1) + (1− µ)Z(0))},

the smallest belief where Player O prefers to wait a period. I define

η = maxµ∈[0,µ

Λ]VA(µ) + VB(µ)− (VA(0) + VB(0)).

From Lemma 10 I get that if π(θ1)(1− e−rΛ) > (∆VB−∆VA) + η, by forward induction, µt+Λ(htO ∪

42

{r}) = 0. However, when π(θ1)(1− e−rΛ) ≤ e−rΛ(∆VB−∆VA) +η, a belief µ ≤ µΛ

exists such that

π(θ1)(1− e−rΛ) ≤ e−rΛ(VB(µ)− VB(1)) + e−rΛ(VA(µ)− VA(1)) ⇐⇒ π(θ1)− e−rΛ(VB(µ)− VB(1)) + VA(1) ≤ e−rΛπ(θ1) + e−rΛVA(µ).

Because is sequentially for PlayerO to play µ at htO∪{r}. Then under that strategy, Player A prefers

to propose (θ1, π(θ1)), that B rejects, and to reach agreement at t + Λ. Thus, µ ∈ D(Θ1, r|µ, htO)

for t < T . Also if e−rΛ ≥ π(θ0)∆VA

, I obtain that π(θ0)(1− e−rΛ) > e−rΛ(VA(µ)− VA(0)). Thus, Player

A prefers to propose a non-feasible offer when the state is Θ0. Thus, µ ∈ D(Θ0, r|µ, htO) and the

forward induction argument does not refine the off-path beliefs.

I conclude that the assessment (σ, µ) is a PBE satisfying forward induction and belief stability.

Proof of Theorem 2

The proof is a consequence of a general result. For this purpose, I define

µΛ = sup{µ ∈ [0, 1] s.t. Z(µ) < e−rΛ (µZ(1) + (1− µ)Z(0))} (11)

µΛ

= inf{µ ∈ [0, 1] s.t. Z(µ) < e−rΛ (µZ(1) + (1− µ)Z(0))} (12)

η = maxµ∈[0,µ

Λ]VA(µ) + VB(µ)− (VA(0) + VB(0)). (13)

The beliefs µΛ and µΛ

are the threshold beliefs such if at some point µt(htO) ∈ [0, 1] \ [µ

Λ, µΛ],

Player O takes the action without taking into account what happens in the negotiation. Because

Z is strictly convex, if µ ∈ (µΛ, µΛ) Player O prefers to delay a period and learn the state rather

than taking the action µ. Notice that the information-value assumption implies that p ∈ (µΛ, µΛ).

Theorem 3 (Theorem 2 extended) Suppose that Assumption 1 holds and that T < ∞. A

Λ > 0 exists such that if Λ < Λ every equilibrium satisfying forward induction and belief stability is

a separating equilibrium. Furthermore, if π(θ0) < ∆VA the equilibrium path is uniquely determined.

Lemma 11 shows that limΛ→0 µΛ= 0. Hence, is straightforward to conclude that Theorem 2 is

a direct consequence of Theorem 3. Now I proceed to demonstrate Theorem 3.

Proof of Theorem 3. First, because p ∈ (µΛ, µΛ) using Lemma 9 I obtain that for Λ < Λ every

equilibrium that satisfies forward induction is a separating equilibrium. Moreover, Player A offers

(θ0, π(θ0)) when the state is Θ1, and Player B always accepts any proposal.

43

When π(θ1)(1 − e−rΛ) > e−rΛ(∆VB − ∆VA) + η and π(θ0) < ∆VA, Lemma 10 shows that in

the case of a rejection, Player O believes that the state is Θ1. Hence, after a history of rejections,

Player A proposes (θ1, xB) when the state is Θ1. Player B accepts if the proposal is such that

xAt ≤ xB, and rejects otherwise. In particular, this condition holds when ∆VA ≥ ∆VB. This shows

that the equilibrium is unique.

When π(θ1)(1 − e−rΛ) > e−rΛ(∆VB − ∆VA) − η, I claim that for a history of rejections ht),

we have that µt(htO) ∈ (µ

Λ, µΛ) and O waits with probability one at t. This claim implies that

on the equilibrium path, B accepts every proposal x ≤ π(θ1) − e−rΛ[e−r(T−t−Λ)VB(0)− VB(1)

]+.

Therefore, A proposes π(θ1)− e−rΛ[e−r(T−t−Λ)VB(0)− VB(1)

]+which proves the last statement of

the proposition.

I know proceed to prove the claim. Suppose that µt(htO) > µΛ. Because µΛ → 1 as Λ → 0

and π(θ0) < ∆VA, a Λ exists such if Λ < Λ we have that ı(θ0) > VA(1)− VA(µΛ) and e−rΛπ(θ0) +

VA(µt(htO) > π(θ0) + VA(0). Thus, Player A prefers to deviate by proposing an unfeasible offer.

This violates Lemma 9. If µt(htO) < µΛ, using that π(θ1)(1 − e−rΛ) > e−rΛ(∆VB − ∆VA) − η,

we have that π(θ1) − e−rΛVB(µt(htO) + e−rΛVA(1) < e−rΛπ(θ1) + e−rΛVA(µt(h

tO). Thus, A prefers

to propose (θ1, π(θ1)) at t − 1, and that B rejects. This violates Lemma 9. From the previous

discussion, we have that if µt(htO) ∈ {µ

Λ, µΛ}, Player O has to wait with positive probability at

t. Because the equilibrium is belief stable, the same argument I use in the inductive hypothesis of

Lemma 9 implies that O waits if µt(htO) = µΛ. This shows the assertion, which finishes the proof.

Lemma 9 Suppose Assumption 1 holds. A Λ > 0 exists such that if Λ < Λ, then every equilibrium

satisfying forward induction is a separating equilibrium. Furthermore, if π(θ0) < ∆VA the for a

history hT+ΛO where B always rejects, the belief satisfy µT+Λ(hT+Λ

O ) = 0.

Proof. The proof is by (backward) induction from t = T to t = 0. To simplify notation, let htOthe history where Player B rejects from 0 to t − Λ and Player O has not taken the action yet. I

denote µt = µt(htO) and βt = P[σtO(htO) 6= w].

I demonstrate the lemma by showing the following statement. Consider a history htO where

Player B rejects from 1, . . . , t and Player O has not taken the action yet. If µt ∈ (µΛ, µΛ) then

Player B accepts in every state and reveals the state θ,and Player O waits to t + Λ. Notice that

this statement clearly proves Lemma 9 since p ∈ (µΛ, µΛ). Moreover, Player B obtains e−rΛVB(0)

when the state is Θ0 and a payoff between e−r(T−t−Λ)VB(0) and e−rΛVB(0) when the state is Θ1.

In what follows, I fix a PBE (σ, µ) satisfying forward induction.

44

Base case

Fix a history hTO such that no agreement has been reached prior to T . I claim that if µT ∈ (µΛ, µΛ)

then A proposes (θ0, π(θ0)) when the state is Θ0 and proposes (θ1, π(θ1)− e−rΛ(VB(µT )− VB(1)))

when the state is Θ1. Player B’s payoff is e−rΛVB(0) when Θ = Θ0 and e−rΛVB(µT ) when Θ = Θ1.

Player O learns θ and waits with probability one. Moreover, if π(θ0) < ∆VA then µT+Λ = 0.

I split the argument into the following steps.

I claim that if βT < 1, Player B accepts with probability one when the state is Θ0. The

proof is by contradiction. For the following six steps, suppose that Player B rejects with positive

probability when Θ = Θ0.

Step 1. I claim that µT+Λ > 0. If µT+Λ = 0, because B rejects with positive probability when

Θ = Θ0, we have that uA(hTA|Θ0) < π(θ0) +βTVA(µT ) + (1−βT )e−rΛVA(0). Consider the deviation

where A proposes (θ0, π(θ0)−ε). Then, conditional that O waits, B gets ε+e−rΛVB(0) by accepting

the offer. This is strictly greater than B’s payoff by rejecting. Thus, B accepts (θ0, π(θ0) − ε).

Taking ε small enough we have that uA(hT−ΛA ∪ {(θ0, π(θ0) − ε)}|Θ0) = π(θ0) − ε + βTVA(µT ) +

(1− βT )e−rΛVA(0) > uA(hTA|Θ0). I conclude that A prefers to deviate by proposing (θ0, π(θ0)− ε),which is a contradiction. Hence, µT+Λ > 0.

Step 2. I claim that B rejects with positive probability when the state is Θ1, and that βT > 0.

Because µT+Λ > 0 (Step 1) by Bayes’ rule B rejects when the state is Θ1. If βT = 0, then A strictly

prefers to propose (θ1, xB − ε). This proposal is accepted by B. This implies, by Bayes’ rule that

µT+Λ = 0. This contradicts Step 1.

Step 3. When the state is Θ1, I assert that A proposes (θ1, π(θ1)) and B rejects it. From Step

2, a proposal (θ, x) exists such that B rejects it when Θ = Θ1. Suppose that B accepts (θ, x) with

positive probability, then we have that θ = θ1 and x ≥ xB. Hence, B reveals that the state is Θ1

when he accepts (θ, x). Because B is rejecting (θ, x) with positive probability, A’s payoff is strictly

less than x+ βTVA(µT ) + (1− βT )e−rΛVA(1). However, for ε > 0 small enough, A has a profitable

deviation by proposing (θ1, x − ε): B accepts with probability one, and there is no destruction

of the gains from the negotiation. Therefore, B rejects (θ, x) with probability one. From Step 2,

notice that pT > 0. From the optimality of A’s proposal this implies that (θ, x) = (θ1, π(θ1)).

Step 4. I claim that the following inequality holds

π(θ1)− e−rΛ(VB(µT+Λ)− VB(1)) + βTVA(µT ) + (1− βT )e−rΛVA(1) ≤ βT (π(θ1) + VA(µT )) + (1− βT )e−rΛVA(µT+Λ). (14)

Indeed, if A proposes (θ1, π(θ1)− e−rΛ(VB(µT+Λ)− VB(1))− ε), B accepts. Step 3 implies that

45

A prefers to proposes (θ1, π(θ1)) rather than proposing (θ1, π(θ1) − e−rΛ(VB(µT+Λ) − VB(1)) − ε),for every ε > 0. Taking ε→ 0, I derive Inequality (14).

Step 5. I claim that the following inequality holds

π(θ0) + βTVA(µT ) + (1− βT )e−rΛVA(0) ≤ βT (VA(µT )) + (1− βT )e−rΛVA(µT+Λ) ⇐⇒ (1− βT ) ≥ π(θ0)

e−rΛ(VA(µT+Λ)− VA(0)). (15)

By the same reasoning as in Step 3, I get that if the state is Θ0, then Player B accepts any

offer (θ0, x) for x < π(θ0). In particular, if A proposes (θ0, π(θ0)− ε), B accepts. Taking ε → 0, I

obtain the desired inequality.

Step 6. I claim the Inequality (14) and Inequality (15) are mutually exclusive. Indeed, replacing

Inequality (15) into Inequality (14), I get that

(1− βT )(π(θ1) + e−rΛ∆VA) ≤ e−rΛ(VB(µT+Λ)− VB(1)) + π(θ0) .

Because βT > 0 (Step 3), Inequality (15) implies that e−rΛ(VA(µT+Λ) − VA(0)) > π(θ0). Assump-

tion 1 (a.) implies that e−rΛ(VB(µT+Λ)−VB(1)) ≤ π(θ0). Hence, (1−βT )(π(θ1)+e−rΛV ) ≤ 2π(θ0).

However, this contradicts

(1− βT )(π(θ1) + e−rΛ∆VA) > 2(1− βT )(e−rΛ∆VA) ≥ 2π(θ0)(e−rΛ∆VA)

e−rΛ(VA(µT+Λ)− VA(0))> 2π(θ0) .

The first inequality comes from π(θ1) > ∆VA. The second inequalities comes from Inequality (15).

Therefore, if βT < 1, Player B accepts with probability one when Θ = Θ0.

Step 7. Suppose that µT ∈ (µΛµΛ), I claim that βT < 1. If βT = 1, then by optimality of

A we have that σT,ΘA (hTA) = (θ, π(θ)). Consider a deviation, where O waits instead. If µT+Λ > 0,

then B accepts (θ0, π(θ0)). Hence, Player O learns θ by waiting. Therefore, waiting is a profitable

deviation for Player O. If µT+Λ = 0, Player B strictly prefers to reject when Θ = Θ1. This violates

Bayes’ rule for µT+Λ.

From Step 7, I conclude that if µT ∈ (µΛ, µΛ), then Steps 1- 6 holds. Hence, Player B does not

reject when Θ = Θ0. Thus, A offers (θ0, π(θ0)) when Θ = Θ0. By the same logic as Step 3, B does

not reject when Θ = Θ1. Hence, O learns θ and βT = 0. B’s payoff is less or equal than e−rΛVB(0)

for every Θ ∈ {Θ0,Θ1}.Step 8. I assert that if π(θ0) < ∆VA then µT+Λ = 0 and B’s payoff is e−rΛVB(0). Consider a

belief µ for Player O at T + Λ after observing a rejection. From Step 1 to Step 7, this implies that

βT = 0. Thus, when state is Θ1, Player A strictly prefers to propose (θ1, xB − ε), and obtaining

46

xB − ε + e−rΛVA(1), rather than making a proposal that is going to be rejected, and obtaining

(e−rΛVA(µ)). Hence, D(Θ1, r|µ, hT ) = ∅. On the other hand, notice that π(θ0) + e−rΛVA(0) <

e−rΛVA(1). Therefore, µ ∈ D(Θ0, r|µ, hT ) for µ sufficiently close to 1. Using forward induction,

I conclude that µT+Λ = 0. Hence, Player A proposes (θ1, xB) when the state is Θ1. Therefore,

Player B obtains e−rΛVB(0).

This concludes the base case.

Inductive Step

Consider a history htO where Player B rejects from 0 to t−Λ and Player O has not taken the action

yet and that µt ∈ (µΛ, µΛ).

I claim that uB(htB ∪ {r}|Θ) ≤ e−rΛVB(0) for every Θ ∈ {Θ0,Θ1}. Consider a belief µ instead

of µt+Λ at t+ Λ. For µ < µΛ

or µ > µΛ, Player O does not wait and takes the action immediately.

Thus, B gets e−rΛVB(µ) ≤ e−rΛVB(0) at period t. If µ ∈ (µΛ, µΛ) by the induction hypothesis

B gets a payoff of e−2rΛVB(0) at time t. Because the equilibrium is belief stable, I conclude that

uB(htB ∪ {r}|Θ) ≤ e−rΛVB(0).

To conclude the proof, I divide the argument in three cases.

Case 1. That µt+Λ ∈ (µΛ, µΛ). I claim that for Θ = Θ0, A proposes (θ0, π(θ0)) and B accepts

it. Using the inductive hypothesis, if Player B rejects he obtains a payoff that is less or equal than

e−r2ΛVB(0). Therefore, Player B accepts (θ0, π(θ0)) and reveals θ to O. On the other hand, the

continuation payoff for Player A after a rejection is e−rΛ(π(θ0) + e−rΛVA(0)) < π(θ0) + e−rΛVA(0).

Therefore, when the state is Θ0, Player A proposes (θ0, π(θ0)) and B accepts it. Consequently,

when the state is Θ1, I assert that Player A proposes (θ1, x) with x ≥ xB and B accepts it. Suppose

that with positive probability Player B rejects such proposal. Then, by Bayes’ rule, µt+Λ = 1. This

contradicts the assumption that µt+Λ ∈ (µΛ, µΛ). Summarizing, Player B accepts with probability

one when Θ = Θ1. From the optimality of A, Player A proposes (θ1, π(θ1) − uB(htB ∪ {r}|Θ1).

Because, π(θ1)− uB(htB ∪ {r}|Θ1) > xB, Player B gets a payoff strictly less than e−rΛVB(0).

Case 2. That µt+Λ ≤ µΛ. I claim that Player A proposes (θ0, π(θ0)) when the state is Θ0, then

Player B accepts it, getting a payoff of e−rΛVB(0). From the same argument as the one used in

the Step 7 of the base case, I derive that Player O waits with positive probability at t, i.e., βt < 1.

Consider the proposal (θ0, π(θ0)− ε) when the state is Θ0. Because uB(htB ∪ {r}|Θ0) < e−rΛVB(0),

Player B strictly prefers to accept it since it reveals that the state is Θ0. Thus,

uA(htA∪{(θ0, π(θ0)−ε)}|Θ0)−uA(htA∪{r}|Θ0) > (1−βt)(π(θ0)+e−rΛVA(0)−e−rΛ(π(θ0)+e−rΛVA(µΛ)).

47

From Lemma 11, I obtain that

limΛ→0

e−rΛVA(µ

Λ))− V (0)

Λ= V ′A(0)

rZ(0)

Z(1)− Z(0)− Z ′(0).

Using Assumption 1, I conclude that

limΛ→0

e−rΛVA(µ

Λ))− V (0)

Λ= V ′A(0)

rZ(0)

Z(1)− Z(0)− Z ′(0)< rπ(θ0).

Because limΛ→0π(θ0)−e−rΛπ(θ0)

Λ= rπ(θ0), a Λ exists such that if Λ < Λ, then

π(θ0)+e−rΛVA(0)−e−rΛ(π(θ0)+e−rΛVA(µΛ)) > 0 =⇒ uA(htA∪{(θ0, π(θ0)−ε)}|Θ0)−uA(htA∪{r}|Θ0) > 0.

Therefore, Player A strictly prefers to propose (θ0, π(θ0) − ε) over an unfeasible proposal, for ε

small enough. In such case, Player B accepts with probability one when the state is Θ0 and gets

e−rΛVB(0). Hence, there is no rejection when the state is Θ0. Because µt+Λ < µt, I conclude that

µt+Λ is an off-path belief. Therefore, Player B accepts with probability one when the state is Θ1.

Case 3. That µt+Λ ≥ µΛ. If there is rejection with positive probability, because µt+Λ > µt

I conclude that, with positive probability, there is an agreement at t when the state is Θ0 and

there is a rejection when the state Θ1. Consider a deviation where A proposes (θ1, x) where

x = π(θ1)− e−rΛ(VB(µt+Λ)− VB(1)) at period t when Θ = Θ1. Clearly, B accepts. Moreover,

uA(htA ∪ {(θ1, x)}|Θ1)− uA(htA ∪ {r}|Θ1) > π(θ1)(1− e−rΛ)− e−rΛ(VB(µt+Λ)− VB(1)) ≥ π(θ0)(1− e−rΛ)− e−rΛ(VB(µt+Λ)− VB(1)).

Using that µt+Λ ∈ [µΛ, 1), Lemma 11 and Assumption 1, the same procedure as in Case 2 allow

me to conclude that Player A prefers to deviate and propose (θ1, x) at t and that Player B accepts

with probability one. This is a contradiction.

Lemma 10 Suppose that π(θ1)(1−e−rΛ) > e−rΛ(∆VB−∆VA)+η, π(θ0) < ∆VA and that Assump-

tion 1 holds. A Λ exists such if Λ < Λ and (σ, µ) is an equilibrium satisfying forward induction,

then µt+Λ(htO ∪ {r}) = 0 for every history htO where B rejects until t.

Proof. The proof is by (backward) induction from the last period t = T + Λ to t = 0. The base

case T + Λ corresponds to Step 8. in Lemma 9.

For the inductive hypothesis, consider a belief µ at the history ht+ΛO = htO ∪ {r}. For µ > µΛ, I

assert that Player A prefers to propose an unfeasible offer when the state is Θ0. Indeed, take Λ so

48

for Λ < Λ we have e−rΛ ≥ π(θ0)∆VA

. Then,

utA(htA ∪ {(θ0, π(θ0))}|Θ0) = π(θ0) + e−rΛVA(0) < e−rΛπ(θ0) + e−rΛVA(µ) = utA(htA ∪ {r}|Θ0).

Therefore, µ ∈ D(Θ0, r|µ, htO).

I assert that D(Θ1, r|µ, htO) = ∅. Indeed, for µ ∈ (µΛ, µΛ), and using the inductive hypothesis

I obtain that µt+2O (htO ∪ {r} ∪ {r}) = 0. Thus,

utA(htA ∪ {r}|Θ1) = e−rΛ(xB + e−rΛVA(1)) < xB + e−rΛVA(1) = utA(htA ∪ {(θ1, xb)}|Θ1).

I conclude that µ /∈ D(Θ1, r|µ, htO) if µ ∈ (µΛ, µΛ).

Consider µ ∈ [0, 1] \ [µΛ, µΛ], the following inequality holds when Λ < Λ:

π(θ1)− e−rΛ(VB(µ)− VB(1)) + VA(1) > e−rΛπ(θ1) + e−rΛVA(µ) ⇐⇒ π(θ1)(1− e−rΛ) > e−rΛ(VB(µ)− VB(1)) + e−rΛ(VA(µ)− VA(1)).

(16)

Indeed, for µ > µΛ, Lemma 11 and Assumption 1 implies that for Λ < Λ, π(θ1)(1 − e−rΛ) >

e−rΛ(VB(µ)− VB(1)). For µ < µΛ, notice that

π(θ1)(1−e−rΛ) > e−rΛ(∆VB −∆VA)+η =⇒ π(θ1)(1−e−rΛ) > e−rΛ(VB(µ)−VB(1))+e−rΛ(VA(µ)−VA(1)).

Using Inequality (16), I conclude that

utA(htA ∪ {r}|Θ1) = e−rΛ(π(θ1) + e−rΛVA(µ)) < π(θ1)− e−rΛ(VB(µ)− VB(1)) + e−rΛVA(1)

= utA(htA ∪ {(θ1, xB)}|Θ1).

Thus, µ /∈ D(Θ1, r|µ, htO).

Finally, for µ ∈ {µΛ, µΛ}, the payoff the Player A obtains in a rejection is a convex combination

of the previous two cases. Therefore, D(Θ1, r|µ, htO) = ∅.The forward induction criterion implies that µt+Λ

O (htO ∪ {r}) = 0.

Lemma 11 The following holds limΛ→0µ

Λ

∆= rZ(0)

Z(1)−Z(0)−Z′(0)and limΛ→0

1−µΛ

∆= rZ(1)

Z(0)−Z(1)−Z′(1).

Proof. Because Z is continuous, I conclude that

Z(µΛ) = e−rΛ

(µ

ΛZ(1)− (1− µ

Λ)Z(0)

).

49

Using that Z is strictly convex I conclude that limΛ→0 µΛ→ 0. Also,

Z(µΛ)− Z(0) = (e−rΛ − 1)Z(0) + e−rΛ(Z(1)− Z(0))µ

Λ. (17)

From the convexity of Z I obtain the following inequality:

Z ′(0)µΛ≤ Z(µ

Λ)− Z(0) ≤ Z ′(µ

Λ)µ

Λ.

Plugging this inequalities on Equation (17), I obtain that

µΛ

Λ

(Z ′(0)− e−rΛ(Z(1)− Z(0))

)≤ e−rΛ − 1

ΛZ(0) ≤

µΛ

Λ

(Z ′(µ

Λ)− e−rΛ(Z(1)− Z(0))

).

Taking Λ → 0, and using that limΛ→0 µΛ= 0 and Z ′ is continuous, concludes the proof for µ

Λ.

The case for µΛ is analogous, and is therefore omitted.

50

Documents

Posturing in Bargaining to In uence Outsiders...proposal game that ends after a (possibly in nite) deadline is reached. The set of feasible allocations depends on the state of the