Upload
nguyenkhanh
View
217
Download
1
Embed Size (px)
Citation preview
The Agency Model and MFN Clauses
JUSTIN P. JOHNSON†
January 25, 2017
Abstract. I provide an analysis of vertical relations in markets with imperfect
competition at both layers of the supply chain and where exchange is intermediated
either with wholesale prices or revenue-sharing contracts. Revenue-sharing is ex-
tremely attractive to firms that are able to set the revenue shares but often makes
the firms that set retail prices worse off. This is so whether revenue-sharing low-
ers or raises industry profits. These results are strengthened when a market moves
from “the wholesale model” of sales to “the agency model” of sales, which results
in retailers setting revenue shares and suppliers setting retail prices. I also show
that retail price-parity restrictions raise industry prices. These results provide a
potential explanation for why many online retailers have adopted the agency model
and retail price-parity clauses.
In this article I examine two business practices that are used especially in online markets.
The first of these is the “agency model” of sales, which refers to a supply chain in which
suppliers (rather than retailers) set final retail prices and in which sales revenue is split
between suppliers and retailers according to endogenously determined shares. The second
business practice is “retail price-parity” restrictions such as most-favored nation clauses
(MFNs), which stipulate that the retail price set by a supplier through one retailer can be
no higher than the retail price set by that supplier through a competing retailer.1
Advances in technology have made it easier to implement and enforce these practices, and
the agency model and retail price-parity restrictions are worthy of study precisely because
they are becoming fairly common in online markets. For example, the online retailer Amazon
uses the agency model in operation of its exchange for a vast variety of retail goods called
the “Amazon Marketplace,” which accounts for over 40% of the products it sells globally.
Amazon’s online competitor, eBay, also uses the agency model when merchants sell goods
I thank Rabah Amir, Aaron Edlin, Justin Ho, Thomas Jeitschko, Jin Li, Sridhar Moorthy, Daniel O’Brien, MarcoOttaviani, Tony Curzon Price, Andrew Rhodes, Thomas Ross, Henry Schneider, Yossi Spiegel, Michael Waldman,Glen Weyl, Julian Wright, and anonymous referees for extensive comments. I also thank participants at the 2012MaCCI Summer Institute in Competition Policy, the Fifth Annual Searle Conference on Antitrust Economics andCompetition Policy, the Fifth Annual Searle Conference on Internet Search and Innovation, the 2014 InternationalIndustrial Organization Conference, the 2013 Strategy Conference at Harvard Business School, the 2014 Workshopon Platforms at the National University of Singapore, the 2015 SICS Conference at the University of Berkeley, andthe EU Directorate General for Competition (Brussels).†Johnson Graduate School of Management, Cornell University. Email: [email protected], MFNs that so restrict retail prices only make sense when suppliers indeed set final retail prices, as underthe agency model. These clauses differ from the well-studied “wholesale MFNs,” which instead stipulate that asupplier must offer a retailer wholesale price terms no worse than those offered to a competing retailer.
2
using the fixed-price rather than the auction format (eBay now emphasizes the fixed-price
format over the auction format and it accounts for more than half of eBay’s sales). Products
as diverse as e-books, applications for mobile devices such as smartphones and tablets, car
insurance, and hotel reservations have been sold online using the agency model.
Moreover, in many of the markets just mentioned, MFN contracts have been used. In
particular, Amazon has used such contracts for its Amazon Marketplace (both in the EU
and the US), and (separately) in its e-book store. Apple imposed such restrictions when
it entered the e-book market in 2010. Price-comparison websites have used MFNs in their
dealings with private motor insurance providers, and online travel agencies such as Expedia
and Booking.com have used them. Pointedly, in each of these markets, MFNs have been the
subject of regulatory scrutiny, either in the EU, the US, or both.
So motivated, I seek to better understand the agency model and retail price-parity clauses.
My analysis encompasses not only bilateral monopoly but also bilateral imperfect compe-
tition (in which there is imperfect market power at both stages of supply), and allows for
flexibility in which firms in the supply chain set the terms of trade (given either by a wholesale
price or the split of revenue between firms) and which set retail prices.
My main message is that a shift from the wholesale model to the agency model benefits
retailers and consumers, raises overall surplus, but harms suppliers. That is, beginning from
a situation in which suppliers set wholesale prices and retailers set retail prices, a move to
a system in which retailers set revenue shares and suppliers set retail prices leads to an
increase in the profits of retailers, a decrease in the profits of suppliers, and an increase in
consumer surplus. In this sense, the agency model is a weapon that retailers can use against
suppliers, and to the benefit of consumers. However, I also show that retail price-parity
clauses, when used in conjunction with the agency model, work to raise prices and therefore
harm consumers.
Although I am motivated by the desire to understand the consequences of a shift from the
wholesale model to the agency model, my analysis is more general. I consider four distinct
business models, categorized by whether (i) a wholesale price or instead revenue-sharing is
used to transfer money between suppliers and retailers, and (ii) whether retailers or instead
suppliers set the retail price. Table 1 lays out these four business models. For any business
model, one firm sets the “terms of trade” given either by a wholesale price or a revenue-
sharing term, and the other firm sets the retail price. In addition to the wholesale model and
agency model, another common business model is the franchise model in which a supplier
sets a revenue-sharing term and the retailer sets the retail price.2 Less common is what I
call the consignment model, in which the retailer sets a wholesale price that it pays to the
supplier for each unit sold and in which the supplier sets the retail price.
2Many franchise operations including restaurants involve such contracts.
3
Wholesale Price Used Revenue-Sharing Used
Retail price set by retailer “wholesale model” “franchise model”
Retail price set by supplier “consignment model” “agency model”
Table 1. Four distinct business models involving a retailer and a supplier.The firm that is not setting the retail price is setting the terms of trade givenby either a wholesale price or instead a revenue share.
Before describing my results in more detail, I note that double-markups remain under
revenue-sharing, although they are more subtle. In particular, when firms setting revenue
shares keep some revenue for themselves, this is one markup. The reason is that it causes
firms setting retail prices to receive only a share of revenue but bear the entirety of their
own costs, causing them to act as if their costs are higher. The second markup is that which
the firms setting retail prices impose over these perceived marginal costs.
I now describe my leading results, building to the main message already put forth. First,
under an array of conditions that encompasses the case of bilateral monopoly, when revenue-
sharing is used each firm prefers to be the one that sets revenue shares rather than being
the one that sets retail prices, regardless of the shape of demand. That is, there is first-
mover advantage under the revenue-sharing model so that using the terminology of Table 1
the retailer always prefers the agency model over the franchise model whereas the supplier
always prefers the franchise model over the agency model. This result is interesting in part
because under wholesale pricing whether it is better to set the terms of trade (given by a
wholesale price) or instead the retail price depends on the log-curvature of demand.
Second, despite the fact that double markups do exist under revenue-sharing, the equilibrium
retail price tends to be lower than under wholesale pricing. This is true even when there is
imperfect competition at both layers of the supply chain (for the simpler case of bilateral
monopoly this result has been known since Bishop (1968)). An interesting corollary of this
result is that revenue-sharing may lead to lower industry profits. The reason is simply that
with imperfect bilateral competition the retail price under wholesale pricing may be lower
than that which maximizes industry profits (unlike with bilateral monopoly). When this
is the case, a move to revenue-sharing further lowers the retail price and therefore further
lowers industry profits.
Third, I show that the firms that set retail prices are often harmed by a shift to revenue-
sharing. For firms that set the terms of trade, however, under bilateral imperfect competition
their profits tend to increase with revenue-sharing; this extends an existing result concerning
bilateral monopoly. Interestingly, these results on firms’ profitability are true whether a shift
to revenue-sharing leads to higher or lower industry profits. For example, when bilateral
imperfect competition is at least moderately tough, revenue-sharing lowers industry profits
4
and yet benefits the firms setting the terms of trade while harming the firms that set retail
prices. But, when bilateral competition is not too tough, a move to revenue-sharing raises
industry profits and yet still harms the firms that set retail prices, while making the firms
that set revenue-shares better off. When a move to revenue-sharing is combined with a
change in which firm sets the retail price, these profitability results continue to hold: a shift
from the wholesale model to the agency model tends to make retailers and consumers better
off, but suppliers worse off.
My results provide a potential explanation for why retailers such as Amazon, Apple, and
eBay might benefit from using the agency model rather than the wholesale model, and why
they would wish to set the revenue-sharing terms themselves.
The intuition for all of the above results is as follows.3 Suppose that the firm setting retail
prices pays the other firm a wholesale price per unit sold. When this firm raises the retail
price by one, its own per-unit margin therefore increases by one. In contrast, when revenue-
sharing is in effect, an increase of one in the retail price is only partially captured by the
firm setting that price. That is, its own per-unit margin increases by less than one because
it must share the extra revenue from the price increase with the other firm. This effect
diminishes the effectiveness of price increases under revenue-sharing. Thus, compared to the
wholesale-price based model of sales, revenue-sharing puts the firm setting retail prices in a
difficult position, a fact that the firm setting the revenue share exploits to its own benefit
and the other firm’s harm.
My final main result is that retail price-parity restrictions in the form of retail MFN contracts
tend to raise industry prices. In particular, MFNs kill a retailer’s incentives to compete in the
terms of trade that it offers suppliers. The reason is that a retailer who raises the commission
it charges (or offers suppliers a lower revenue share) knows that the price set through its
store will not increase relative to that at other stores. Indeed, the essence of price parity
is that affected suppliers must raise the price that they set through all stores if they are
to raise prices at all. This means that suppliers cannot asymmetrically adjust their prices
to divert demand towards retailers offering more-attractive contractual terms. In turn, this
effect encourages retailers to charge higher fees to suppliers, which in equilibrium raises the
final retail price and harms consumers.
The prospect that MFNs might lead to higher retail prices—and negate any positive effects
from a switch to the agency model—suggests that such contracts ought to be carefully
scrutinized. Indeed, concerns have been expressed in multiple antitrust investigations in
3Note that there are no “delegation effects” driving my results, as is clear from inspection of the model presented inlater sections. Such work on the delegation effects of vertical structure includes, for example, Bonanno and Vickers(1988), Lin (1990), and Rey and Stiglitz (1995). They argue, respectively, that vertical separation, exclusivityprovisions, and exclusive territory restrictions may raise retail prices by softening supplier competition. My resultshold in a generalized model of bilateral monopoly in which such softening effects are necessarily absent.
5
the US and the EU, as I discuss in more detail later. However, I also discuss potential
pro-competitive effects of price-parity clauses.
I now briefly discuss some of the related literature. The effect of revenue-sharing alone has
been investigated both in the economics and operations research literatures (for example,
Mathewson and Winter (1985), Dana and Spier (2001), Cachon and Lariviere (2005), and
Krishnan et al. (2004)). These articles are quite different in focus and consider neither
supplier retail-price setting nor market power at both the manufacturing and retailing stages.
There are several related contributions to the economics of platforms. Gans (2012) argues
that most-favored nation contracts can resolve a holdup problem. Boik and Corts (2016) and
Edelman and Wright (2015) argue as I do that retail price-parity clauses may harm consumers
but do not consider differences between the agency model and wholesale models.4 Johnson
(2013) considers the agency and wholesale markets in an environment that is dynamic but
otherwise simpler and focused on different issues. Johansen and Verge (2016) argue such
restraints may be beneficial to consumers in certain circumstances, leading to lower prices.
Foros et al. (2015) investigate aspects of revenue-sharing and MFNs but focus on exogenously
given contracts, whereas such contracts are endogenous in my model. Hagiu and Wright
(2015) consider the choice between what they call a “marketplace” (the agency model in
my terminology) and a “reseller” (the wholesale model in my terminology), emphasizing the
importance of non-contractible decisions of suppliers. Gaudin and White (2014a) compare
the wholesale and agency models, studying a separate set of questions from this paper.
They show, in the context of the electronic book market, that the parallel sale of an e-
reading device by the retailer significantly influences this comparison. Gaudin and White
(2014b) consider taxation in a model that uses a conduct-parameter approach similar to
what I use. As I do, they identify certain circumstances under which revenue-sharing may
lower retail prices; I discuss the details later. Wang and Wright (2017) and Loertscher and
Niedermayer (2015) argue that revenue-sharing may allow for optimal price discrimination
against a heterogeneous group of downstream firms.
My paper is organized as follows. Section 1 analyzes bilateral monopoly, which is extended
in Section 2 to bilateral imperfect competition. Section 3 assesses price-parity clauses and
Section 4 concludes.
1. Bilateral Monopoly
In this section I analyze the revenue-sharing and wholesale pricing models of bilateral trade.
There are two firms, D and U , and end consumers characterized by the strictly decreasing
4The key mechanism in Edelman and Wright (2015) is that intermediaries make excessive investments in benefitsfor buyers which, along with price-parity, causes too many consumers to use intermediary services (such as a creditcard). However, the effect I identify also exists in their model, although I show it for a more general demand system.
6
demand curve Q(p), where p is the price of some good. For each unit of the good demanded
by consumers, firm U bears a constant marginal cost cU > 0 and firm D bears a constant
marginal cost of cD > 0.
A useful measure of the sensitivity of demand is
λ(p) = −Q(p)
Q′(p)> 0.
If demand is generated by some underlying distribution of consumer valuations, then λ(p)
is Mills’ ratio or the inverse hazard rate of that distribution.5 To ensure the quasi-concavity
of certain profit functions below, I assume that both λ(p) and λ(p)(2 − λ′(p)) have slopes
strictly less than one. Other assumptions will be introduced later to ensure equilibria are
well-behaved.
As is already known, and will be further demonstrated, properties of λ(p) are very important
in price theory.6 One such property is the sign of λ′(p), which determines the log-curvature of
demand (that is, the curvature of log(Q(p))): log-concavity, log-convexity, and log-linearity
are, respectively, equivalent to λ′(p) being (globally) negative, positive, or zero.7 As shown
by Bagnoli and Bergstrom (2005), many common distributions generate demand that is log-
concave or log-convex.8 Canonical illustrations of demand include linear, exponential, and
constant-elasticity, which are, respectively, log-concave, log-linear, and log-convex.9
Whether considering the use of wholesale prices or revenue-sharing, there is a distinct order
of moves with one firm setting the terms of trade governing exchange between the two firms
and the other firm then setting the retail price. To be as general as possible, I will consider
how the order of moves—and a change to this order—influences market outcomes, and so it
is necessary to introduce additional notation. To this end, agree that “firm 1” is the firm
5In particular, suppose consumer valuations θ are distributed according to F (θ) with density f(θ). At price p aconsumer θ buys if θ ≥ p, so that Q(p) ∼= 1−F (p) and Q′(p) ∼= −f(p), for some common constant of proportionalityequal to the size of the market. Thus,
λ(p) =1− F (p)
f(p),
which is Mills’ ratio of F .6For example, as shown by Amir et al. (2004), whether λ(p) is decreasing or instead increasing determines whetherthe pass-through rate of the change in a monopolist’s marginal cost is less than one or not. Similarly, the curvatureof λ(p) determines whether this pass-through rate is increasing or not.7For example, Q(p) is log-concave if
d2 log(Q(p))
dp2≤ 0⇐⇒ d
dp
Q′(p)
Q(p)= − d
dp
1
λ(p)=λ′(p)
λ(p)2≤ 0⇐⇒ λ′(p) ≤ 0.
8For example, demand is log-concave when the underlying distribution of consumer valuations is uniform, normal,and certain parameterizations of beta and gamma, just to name a few. Log-convex demand can arise from the weibullor power distribution, and some parameterizations of the gamma distribution.9Linear demand Q(p) = 1− p arises from a uniform distribution of consumer valuations, exponential demand Q(p) =
e−λp, λ > 0, arises from an exponential distribution, and constant-elasticity demand Q(p) = p−1/σ, σ ∈ (0, 1), arisesfrom a Pareto distribution.
7
that sets the terms of trade, and “firm 2” is the firm that sets the retail price, with marginal
costs c1 and c2, respectively. This approach allows me to consider each of the four business
models categorized in Table 1.
1.1. Wholesale pricing. With wholesale pricing, one firm sets a wholesale price w and
then the other firm sets p, taking w as given (and paying w to the first firm for each unit
sold). As noted above, I wish to preserve generality and so I do not specify whether it is U
or instead D that sets w, but instead denote this firm by 1, with marginal cost c1, and the
retail price-setting firm by 2, with marginal cost c2. In the terminology set forth in Table 1,
if the upstream firm sets w and the downstream firm sets p, then the wholesale model is in
effect (and c1 = cU and c2 = cD), whereas if the downstream firm sets w and the upstream
firm sets p, then the consignment model is in effect (and c1 = cD and c2 = cU).
Taking w as given, firm 2 chooses p to maximize
(p− w − c2)Q(p).
Recalling that λ(p) = −Q(p)/Q′(p), the first-order condition can be written as
p− w − c2 = λ(p). (1)
Now consider firm 1. Although this firm chooses w, a useful alternative view is that it chooses
p, with the per-unit transfer w equilibrating according to Equation (1): w = p− c2 − λ(p).
Thus, a unit increase of p results in an equilibrating increase of w equal to 1 − λ′(p).10
(Reciprocally, (1− λ′(p))−1 represents the pass-through rate of a monopolist responding to
a unit increase in its costs.)
Under this alternative formulation, firm 1’s profit (w − c1)Q(p) can be written as
[p− c1 − c2 − λ(p)]Q(p),
with first-order condition
p− c1 − c2 = λ(p) + λ(p)(1− λ′(p)). (2)
If pw is the equilibrium price then, because λ(pw) is firm 2’s margin, firm 1’s margin is
λ(pw)(1 − λ′(pw)). Of course, because of the familiar double-marginalization problem, pw
exceeds the price that maximizes joint industry profits.11 Note that the equilibrium retail
price pw (which solves Equation (2)) does not depend on which firm (D or U) is setting the
wholesale price and which firm is setting the retail price.
10An increase in w will always raise the price set by firm 2. Also, 1− λ′(p) > 0, evaluated at the price firm 2 wouldchoose facing some given value of w, represents the second-order condition of firm 2.11Because (i) firm 1 always chooses w > c1, (ii) firm 2 acts as a monopolist with effective marginal costs of w + c2,and (iii) the price a monopolist chooses always increases in its effective marginal costs, it follows that pw exceeds theprice that maximizes industry profits.
8
1.2. Revenue-sharing. I now consider revenue-sharing contracts. Firm 1 sets the terms of
trade, which are given by a revenue share r ∈ (0, 1) that stipulates the share of sales revenue
that firm 2 receives; firm 1 keeps the remaining 1− r share. Firm 2, taking r as given, then
chooses p. In the terminology set forth in Table 1, if U sets r and D sets p, then the franchise
model is in effect (and c1 = cU and c2 = cD), whereas if D sets r and U sets p, then the
agency model is in effect (and c1 = cD and c2 = cU).
Consider firm 2. Given that it keeps a share r of total revenue, it chooses p to maximize
(rp− c2)Q(p) = r(p− c2
r
)Q(p).
Optimization requires that
p− c2
r= λ(p)⇐⇒ rp− c2 = rλ(p). (3)
Thus, firm 2 prices just as it would under wholesale pricing if it were facing a combined
wholesale price and marginal cost equal to c2/r > c2. In other words, because 2 keeps only
a share of revenue but bears all of its costs of production, it acts as if its marginal cost is
higher than c2. I will refer to c2/r as the “perceived marginal cost” of firm 2.
Now consider firm 1. Its profits are
[(1− r)p− c1]Q(p).
Although firm 1 chooses r, as in the analysis of wholesale pricing it is equivalent and conve-
nient to suppose that firm 1 chooses p, with the terms of trade r then equilibrating according
to Equation (3). That equation can be rearranged to show that
r =c2
p− λ(p). (4)
Because r ∈ (0, 1), firm 1 is constrained to pick a price p exceeding that which firm 2 would
set if firm 2 received all the revenues. That is, firm 1 can only select p such that p > c2+λ(p),
ensuring the value of r defined above is in fact between zero and one. Also note that lower
values of r are necessarily associated with higher values of p, because lower values of r increase
firm 2’s perceived marginal cost c2/r. Using Equation (4), firm 1’s objective function can be
written as [p− c1 − c2 − c2
λ(p)
p− λ(p)
]Q(p). (5)
Differentiating and rearranging, the associated optimality condition is
p− c1 − c2 = λ(p)
[1 + c2
p(1− λ′(p))(p− λ(p))2
]. (6)
I assume that, in the admissible region p > c2 + λ(p), there is a unique solution pa to this
first-order condition and that it uniquely maximizes firm 1’s profits. I use the notation pa
9
when revenue-sharing is in effect, regardless of which firm is setting the retail price. Note,
however, that unlike the equilibrium price pw when wholesale prices are used, the equilibrium
price under revenue-sharing depends on which firm (D or U) is setting r and which is setting
the retail price (unless cD = cU , in which case move order does not alter the price under
revenue-sharing). However, this is taken into account in the expression above, simply by
setting c1 and c2 equal to cD or cU depending on the move order.
Lemma 1. Under revenue-sharing, the equilibrium retail price strictly exceeds the price that
maximizes industry profits. Equivalently, the equilibrium revenue-sharing term r is such that
c2
r> c1 + c2.
Lemma 1 says that there is a double-marginalization problem when revenue-sharing is in
effect: both firms impose a markup and joint firm profits are not maximized. Firm 2 imposes
a markup over its perceived marginal cost c2/r > c2, and firm 1 imposes a markup by choosing
r < 1, thereby inflating firm 1’s perceived marginal cost. Moreover, the equilibrium value of
r is such that c2/r > c1 + c2, meaning that firm 2’s perceived marginal costs exceed the sum
of its costs and firm 1’s costs.
1.3. Comparing sales models. Having laid out the basics of the different sales models,
here I provide a number of results. I begin by showing that moving first is advantageous
under revenue-sharing. That is, being the firm that sets the terms of trade r is preferable to
being the firm that sets the retail price p.
Although it may seem intuitive that it is better to be the first-mover, in fact this feature
is particular to revenue-sharing: with wholesale pricing, whether it is preferable to move
first or instead second depends on the log-curvature of demand. More specifically, under
wholesale pricing a firm prefers to move first if demand is log-concave but prefers to move
second if demand is log-convex. This observation regarding wholesale pricing has previously
been reported by Weyl and Fabinger (2013) and Adachi and Ebina (2014).12 An intuitive
way of seeing the result is by noting that the rate at which wholesale price increases are
passed through is given by (1−λ′(p))−1, which is less than one whenever λ′(p) < 0 (which is
equivalent to log-concavity), and greater than one whenever λ′(p) > 0 (which is equivalent
to log-convexity).13
12More precisely, Weyl and Fabinger (2013) provide conditions (pp. 562–563) from which this result can be derived.Adachi and Ebina (2014) provide the condition given above on λ′(pw) and consider several parametric examples thatyield different signs for this value. This argument is also closely related to the observation by Amir et al. (2004) thatthe pass-through rate of a monopolist is determined by the log-curvature of demand.13A more complete argument is as follows. The equilibrium price pw is invariant to the move order, which meansthat industry profits are as well. Because the margin of firm 1 is λ(pw)(1− λ′(pw)) whereas the margin of firm 2 isλ(pw), the definitions of log-concavity and log-convexity imply the result.
10
The intuition for why moving first is preferable under revenue-sharing is essentially that,
although doing so would not be optimal, the firm setting the revenue share could choose to
induce the price that maximizes industry profits and even at this price it would seize a larger
share of industry profits than it would as a second mover. Because actual industry profits
must be lower than maximized industry profits, moving first is best. This logic is fleshed out
in the proof of Proposition 1, found in the Appendix.
Proposition 1. Under revenue-sharing, each firm prefers to be the first-mover.
Proposition 1 is consistent with the observation that popular restaurant chains such as
McDonald’s set the revenue-sharing term rather than retail prices when dealing with non-
company owned outlets.14 That is, it provides one reason why McDonald’s (the upstream
firm) prefers the franchise model over the agency model.
I now assess the effect of revenue-sharing on the retail price. Although Lemma 1 indicates
that revenue-sharing does not eliminate the double-marginalization problem, it turns out
that under a certain condition on demand this pricing externality is nonetheless mitigated.
This condition involves the elasticity of λ(p), given by pλ′(p)/λ(p). Although I state the
condition in these terms, it is equivalent to (and better known as) Marshall’s Second Law of
Demand: pQ′(p)/Q(p) is decreasing in p.
Lemma 2 (Bishop (1968)). If the elasticity of Mills ratio is less than one (pλ′(p)/λ(p) ≤ 1),
which is equivalent to decreasing elasticity pQ′(p)/Q(p), then the equilibrium retail price
under revenue-sharing is strictly less than that under wholesale pricing (pa < pw). This
result holds even if the move order differs depending on whether wholesale pricing or revenue-
sharing is used.
In essence, Lemma 2 has been known since at least the work of Bishop (1968), although
several recent contributions including Gaudin and White (2014a) and Llobet and Padilla
(2014) also report this result.15 Gaudin and White (2014b) provide a generalization, which
I discuss later. In the relevant portion of his analysis, Bishop considered a government
maximizing its tax revenues and showed that this leads to a lower final retail price under
an ad valorem tax than under a per-unit tax. Interpreting the government as firm 1, this
more or less implies the result above. A small innovation here is that is that I have shown
the result is true regardless of the move order and have allowed both firms to have positive
marginal costs (whereas Bishop assumes that the government bears no costs).
14There are, of course, other possible explanations.15These papers do more than simply describe how prices differ under the two sales models. For example, Gaudinand White (2014a) broadly compare the agency and wholesale models in the presence of a complementary hardwaredevice such as an e-reader.
11
Note that the sufficient condition pλ′(p) ≤ λ(p) from Lemma 2 readily admits some log-
convex demand functions (most obviously the constant-elasticity one), in addition to log-
linear and all log-concave ones. Indeed, rewriting the elasticity of Mills’ ratio in terms of the
underlying demand function Q(p), the condition is equivalent to
pQ′(p)
Q(p)− pQ′′(p)
Q′(p)≤ 1.
The left-hand side is the difference between the elasticity of demand and the elasticity of the
slope of demand (this second term is also called the adjusted-concavity of demand), and the
condition for log-concavity is that this is less than zero. Thus, the condition clearly admits
some log-convex demand functions. For other log-convex demand functions the condition
may be violated and Gaudin and White (2014a) show by way of example that this may lead
to prices that are higher under revenue-sharing.
I now explore how a move to revenue-sharing influences the profitability of the firms, sup-
posing that the same firm that sets the terms of trade under wholesale pricing also sets them
under revenue-sharing. My focus will be on how revenue-sharing influences the profits of
the firm that sets the retail price. The reason for focusing on firm 2’s profits is that the
taxation literature already indicates how firm 1’s profits will change with revenue-sharing.
In particular, in a setting where the government sets a tax scheme and then a monopolist
chooses a price, it is well-known that an ad-valorem tax can raise at least as much revenue
as a per-unit tax (Suits and Musgrave (1953)). Interpreting firm 1 as the government and
setting the target tax revenue equal to firm 1’s equilibrium profit under wholesale pricing,
this existing result ensures that the firm that sets terms of trade earns higher profits when
revenue-sharing is in effect.16 In the next section I will significantly generalize this result by
showing it holds even with bilateral imperfect competition.
Assuming that a given firm will set the retail price regardless of whether revenue-sharing or
wholesale prices are used, how are its profits affected by a move to revenue-sharing?
Proposition 2. Suppose that the order in which firms D and U move is the same under
revenue-sharing as under wholesale pricing (so that the identities of firm 1 and firm 2 are
fixed). Then a sufficient condition for firm 2 to earn lower profits under revenue-sharing
than under wholesale pricing is that demand is either log-linear (λ(p) = λ > 0 for each p)
or constant-elasticity (λ(p) = σp for σ ∈ (0, 1)).
That firm 2 might earn lower profits under revenue-sharing is very different from the pre-
dictions of the taxation literature. In particular, Skeath and Trandel (1994) show that, for
16To see that firm 1 is better off with revenue-sharing, observe that under revenue-sharing firm 1 could instead choosep = pw and if it did then firm 2 would receive a margin of rλ(pw) (using Equation (3)), whereas firm 2 firm wouldreceive a margin of λ(pw) under wholesale pricing. Thus, firm 1 could assure itself a higher margin and the samelevel of demand under revenue-sharing than under wholesale pricing.
12
a target level of tax revenues, a government can, without harming consumers, find an ad-
valorem tax rate that makes the monopolist better off than with a per-unit tax. This is
essentially the opposite of what happens in the present context.
It is somewhat surprising that firm 2 is made worse off by revenue-sharing. After all, industry
profits increase with a move to revenue-sharing, as follows from Lemmas 1 and 2 (noting
that the cases of log-linear and constant-elasticity demand are covered by these lemmas).
Together, they show that the retail price is closer to the industry-profit-maximizing price
under revenue-sharing than under wholesale pricing. One might have thought that firm 2
would enjoy a share of these increased joint profits, but this is not the case.
The log-linear case is a natural one to consider, because with wholesale pricing firm 1 and
firm 2 receive equal profits. But even with log-concave demand firm 2 can be worse off
under revenue-sharing. This is notable because log-concavity implies that firm 2 is at a
disadvantage to firm 1 under wholesale pricing, as discussed prior to Proposition 1, and
so one might have thought that firm 2 would tend to prefer revenue-sharing. To see that
this need not be, suppose that demand is linear, Q(p) = 1 − p, and firms have equal costs,
c1 = c2 = c. For the range of costs that lead to positive equilibrium output (that is, c < 1/2),
the profits of firm 2 under the two sales models are shown in the right-hand panel of Figure
1; the left-hand panel of that figure illustrates the profits of firm 1.17 As is evident, firm 2 is
worse off under revenue-sharing (and firm 1 is better off).
0 0.1 0.2 0.3 0.4 0.5
0
0.05
0.1
0.15
0.2
Marginal cost c = c1 = c2
Fir
m1’s
pro
fit
Wholesale pricingRevenue-sharing
0 0.1 0.2 0.3 0.4 0.5
0
0.02
0.04
0.06
Marginal cost c = c1 = c2
Fir
m2’s
pro
fit
Wholesale pricingRevenue-sharing
Figure 1. Comparison of the profits of firms 1 and 2 under the two salesmodels, with linear demand Q(p) = 1− p and equal costs c = c1 = c2.
17To be precise, this figure illustrates profits for firm 1 and firm 2, under the assumption that the same firm i ∈ {D,U}moves first under both wholesale pricing and revenue-sharing. Also note that although linear demand is log-concave,log-concave demand by itself does not guarantee that firm 2 is worse off under revenue-sharing.
13
However, for other log-concave demand functions, firm 2 is better off under revenue-sharing—
it benefits from the increase in overall demand brought about by the retail price reduction
that occurs under the move to revenue-sharing. For example, when the underlying consumer
valuations are distributed according to a beta distribution with parameters ν = 3 and ω = 1,
it can be shown that firm 2 is better off with revenue-sharing.18 Indeed, at these parameters
the underlying density function is log-concave, which implies that the induced demand func-
tion is log-concave. Thus, Proposition 2 does not fully generalize to all log-concave demand
or even to all demand generated from log-concave density functions. However, as I show in
Proposition 3 below, when considering both a shift to revenue-sharing and a change in the
order in which firms move, the results do generalize.
At a slightly technical level, insight into Proposition 2 and its limitations is provided by
considering log-linear demand. In this case, it can be shown that even if firm 2 chose the
price pa (that would obtain under revenue-sharing) when wholesale pricing is in effect, its
resulting margin would be higher than it is under revenue-sharing (in which the same price pa
is the equilibrium outcome). This means firm 2, even behaving suboptimally under wholesale
pricing, could obtain a higher margin under wholesale pricing than it does behaving optimally
under revenue-sharing. Because it would sell the same number of units in either case, it
must prefer wholesale pricing. For this approach with log-linear demand, the fact that the
markup that firm 1 would choose under wholesale pricing is given by λ(pw)(1− λ′(pw)) = λ
(a constant) greatly simplifies the comparison with revenue-sharing. That is, under more
general demand the markup that firm 1 would choose under wholesale pricing need not be
readily comparable to that under revenue-sharing and indeed as mentioned the result need
not hold for certain demand functions.
So far my discussion and results have centered on the preferences of market players over
changes in either the move order (whether they set retail prices or terms of trade) or whether
revenue-sharing or instead wholesale prices are used. However, it is of interest to know how
players evaluate a change in both the move order and the mechanism by which surplus
is transferred. The reason is that recently there have been many real-world markets that
have undergone a change from the wholesale model to the agency model. Recall that this
corresponds to moving from a situation where firm U sets w and D then sets p to a situation
where D sets r and U then sets p.
Proposition 3. If demand is log-concave, log-linear, or constant-elasticity, then a shift from
the wholesale model to the agency model raises the profits of D but lowers the profits of U .
18The density function of the beta distribution with parameters ν and ω evaluated at x is given by
xν−1(1− x)ω−1
B(ν, ω)
where B(ν, ω) is the beta function. For a broad range of symmetric costs that I simulated, firm 2 was always betteroff with revenue-sharing.
14
From a practical standpoint, Proposition 3, along with its generalization to bilateral imper-
fect competition in the next section, provides a potential explanation for why online firms
such as Amazon and eBay may be switching to the agency model, and also of course suggests
suppliers need not be beneficiaries of this change. This presumes, of course, that the demand
curves in those markets are reasonably approximated by those covered by Proposition 3.19
The most interesting cases are where there are conflicting forces from the combination of
change in move order and change in business model. For example, continuing to maintain
the focus on retailers, consider D, suppose demand is constant-elasticity and that initially
the wholesale model is in place. Then (from the discussion prior to Proposition 1), D earns
higher profits than U does, and so a shift in only the move order would make D worse off.
However, as the current proposition shows, completing the move to the agency model by
also imposing revenue-sharing would make D better off than it was originally. Thus, the
overall intuition for this result is that the advantage of revenue-sharing for the firm setting
the terms of trade is so strong that it overwhelms any other effect. This resonates well with
my earlier result that under revenue-sharing it is always advantageous to set the terms of
trade (Proposition 1).
2. Bilateral Imperfect Competition
In this section I extend the model and analysis above to a setting in which there is imperfect
competition at both layers of the supply chain.20 Rather than specify any particular model
of such competition, I take an approach based on conduct parameters similar to that in the
study of pass-through under wholesale pricing by Weyl and Fabinger (2013), and which is
related to work by Gaudin and White (2014b).21 This allows my results to be stated in
terms of properties of the aggregate demand function in the industry, thus providing a close
analogue to the study of bilateral monopoly above.
2.1. A conduct-parameter approach. Given the work above, it is easier here to adopt an
approach that is slightly more compact. Let Q(p) denote the aggregate demand of consumers,
where one interpretation is that there are multiple substitute products each sold at common
price p, that there are multiple substitute retailers, and that demand is symmetric at p. As
before, λ(p) = −Q(p)/Q′(p). All derivatives in this section measure the effect of an increase
19Note also that all of the demand cases covered by this proposition are also covered by Lemma 2.20Related work that considers market power at multiple stages of production includes Salinger (1988), Reisinger andSchnitzer (2012), and Kourandi and Vettas (2012).21In an earlier draft of this paper, I specified a complete micro-foundation for such competition, and showed thatmoving from the wholesale model to the agency model could lower retail prices yet also make the downstream firmsbetter off at the expense of upstream firms. A downside of that model was that only very limited forms of demandcould tractably be considered. In particular, that model effectively assumed pass-through rates equal to unity.
15
in the common price p of the retail prices of all products sold through all stores. Thus, Q′(p)
measures the effect on aggregate demand from a small increase in all prices.
My analysis proceeds as follows. First, I specify how layer 2 reacts to arbitrary terms of
trade r or w, that is, which retail price p is set by the second layer given such terms of trade.
Second, I specify how the first layer determines these terms of trade, taking as given the
response of the second layer.
Suppose that (symmetric) terms of trade have been set by the first layer: all firms offering
contracts offer either the same revenue share r or the same wholesale price w. Let m2(p)
denote the per-unit margin of the second layer, given that all firms of that layer are setting
the same price p. Under wholesale pricing m2(p) = p−w−c2 whereas under revenue-sharing
m2(p) = rp− c2. The aggregate profit of the second layer is m2(p)Q(p), with derivative
m′2(p)Q(p) +m2(p)Q′(p).
For arbitrary r or w, I define the value of p set by layer 2 to be that which implicitly solves
m2(p)
m′2(p)= θ2λ(p) =⇒ m2(p) = θ2m
′2(p)λ(p), (7)
where θ2 ∈ (0, 1) is an exogenous constant that represents the conduct parameter of layer 2.
Computing the appropriate derivatives m′2(p), under wholesale pricing this says that m2(p) =
p − w − c2 = θ2λ(p) whereas under revenue-sharing (and using m′2(p) = r) this says that
m2(p) = rp− c2 = rθ2λ(p).
Note that if θ2 = 1 then these equations are precisely what was derived earlier for the optimal
response of firm 2 in the case of bilateral monopoly (see Equations (1) and (3)). On the other
hand, if θ2 = 0 then these margins are zero. Thus, varying θ2 spans the range of outcomes
from perfect competition in layer 2 to perfectly collusive behavior in layer 2.22
Now consider the profits of layer 1. As before, it can be supposed that this layer sets p, with
the terms of trade with layer 2 then equilibrating. For any final price p, the per-unit margin
of layer 1 is denoted by m1(p), and satisfies
m1(p) = p− c1 − c2 − m2(p), (8)
where m2(p) is the margin that layer 2 must be receiving if p is the price layer 2 selects.
From the work above, for a given p the margin of the second layer is m2(p) = θ2λ(p) under
wholesale pricing. Under revenue-sharing this margin is rθ2λ(p), but using the equilibrium
condition given just above for layer 2 (that rp− c2 = θ2rλ(p)), r is given by
r =c2
p− θ2λ(p),
22In line with assumptions from the analysis of bilateral monopoly, I suppose that the right-hand side of Equation (7)intersects the left-hand side once and from above, and that layer 2’s profits are quasiconcave in p for any given termsof trade.
16
a close analogue to Equation (4). Thus eliminating r, under revenue-sharing
m2(p) =θ2c2λ(p)
p− θ2λ(p). (9)
For arbitrary p, the overall profitability of the first layer is simply m1(p)Q(p), with derivative
m′1(p)Q(p) +m1(p)Q′(p).
I define the equilibrium value of p to be that which satisfies
m1(p)
m′1(p)= θ1λ(p) =⇒ m1(p) = θ1m
′1(p)λ(p), (10)
where θ1 ∈ (0, 1) is an exogenous constant that represents the conduct parameter of layer 1.
As θ1 ranges from zero to one the resulting price p ranges from that which gives layer 1 no
margin to that which maximizes the profits of layer 1 (given how layer 2 will respond).23
Noting that m′1(p) = 1−m′2(p) (using Equation (8)) and computing the value of m′2(p) under
both wholesale pricing and revenue-sharing, equations characterizing the industry price are
readily derived. As before, let pw and pa denote the equilibrium retail prices when wholesale
prices or instead revenue-sharing is used.
Lemma 3. Under bilateral imperfect competition with conduct parameters θ1 and θ2, the
retail price pw under wholesale pricing satisfies
pw − c1 − c2 = θ2λ(pw) + θ1λ(pw)(1− θ2λ′(pw)), (11)
and the retail price pa under revenue-sharing satisfies
pa − c1 − c2 = θ1λ(pa) + θ2c2λ(pa)
[pa(1− θ1λ
′(pa)) + λ(pa)(θ1 − θ2)
(pa − θ2λ(pa))2
]. (12)
If θ1 = θ2 = 1, these equations reduce to those derived for the bilateral monopoly case
(Equations (2) and (6)). And, for θ1 = θ2 = 0, both of these industry margins equal zero.
Finally, note that pw is, once again, independent of which firm moves first, as can be seen
by expanding the right-hand side of Equation (11).
I will now show that, under broadly similar conditions on the shape of demand, key results
derived for bilateral monopoly continue to hold under bilateral imperfect competition. I
begin by investigating the desire of D or U to move first when revenue-sharing is in effect.
Denote by θD and θU the underlying conduct parameters of the two layers.
Proposition 4. Under revenue-sharing, each firm prefers to be the first-mover if either of
the following conditions hold:
23In line with assumptions from the analysis of bilateral monopoly, I suppose that the right-hand side of Equation(10) intersects the left-hand side once and from above (in the range where p > c2 + θ2λ(p)), and that layer 1’s profitsare quasiconcave in p.
17
(1) market power is balanced (θD = θU),
(2) aggregate demand Q(p) is log-linear or constant elasticity.
This proposition confirms that moving first under revenue-sharing is particularly appealing
under bilateral imperfect competition. The intuition is clearest when market power is bal-
anced. In that case, although the logic is more involved than in the bilateral monopoly case,
a conceptually similar argument can be made. Loosely speaking, it can be shown that the
first-mover can replicate the retail price that would emerge if it were instead the second-
mover.24 Although doing so is suboptimal, it is possible and moreover would lead to the
same number of units being sold. Similar to the earlier result, it is then possible to explicitly
show that charging this lower price would enable the first-mover to ensure itself a higher
share of revenues than it receives as a second-mover. Thus, the first-mover can ensure itself
a higher margin than it gets when it is the second-mover (while also selling as many units
as it sells as the second-mover).
The next question is what happens to the retail price when revenue-sharing is in effect.
Lemma 4. Under bilateral imperfect competition, if the elasticity of Mills ratio is less than
one (pλ′(p)/λ(p) ≤ 1), then the equilibrium retail price under revenue-sharing is strictly less
than that under wholesale pricing (pa < pw). This result holds even if the move order differs
depending on whether wholesale pricing or revenue-sharing is used.
In prior work, Gaudin and White (2014b) independently derive this result for the case where
there is an upstream monopolist (represented by a government maximizing tax revenues).
I extend their result by showing that even when there is competition at both layers of the
supply chain, and regardless of which layer sets the retail price, revenue-sharing tends to
lower prices.25
However, there is an important qualitative difference compared to the case of bilateral mo-
nopoly or upstream monopoly (as considered by Gaudin and White (2014b)). Recall that in
the bilateral monopoly case a move to revenue-sharing lowers the retail price, which moves
it closer to the price that maximizes industry profits. In contrast, with bilateral imperfect
competition a price reduction starting from pw may lower industry profits. The reason is
simply that, depending on the conduct parameters, pw may be lower than the price that
maximizes industry profits, so that a further price reduction further lowers industry profits.
This possibility can be seen clearly in Figure 2, which graphs equilibrium prices in compar-
ison to the price that maximizes industry profits for a linear-demand specification. That
24The description here is loose in that the layers are compelled to pick prices that satisfy the conditions above, ratherthan being free to select arbitrary prices. See the proof for details.25Anderson et al. (2001) show a related result, which is that a government can always raise as much revenue andreach higher output under an ad-valorem tax rather than a unit tax, when selling to competing downstream firms.However, they do not consider the case where the government seeks to maximize revenue.
18
0 0.2 0.4 0.6 0.8 1
0.2
0.4
0.6
0.8
θ = θ1 = θ2
Pri
cespa
andpw
pw
pa
Figure 2. Comparison of equilibrium prices to the the price that maximizesindustry profits, where this latter price is 0.6 and given by the horizontal line,with linear demand Q(p) = 1 − p and equal costs c1 = c2 = 0.1, and equalconduct parameters θ = θ1 = θ2.
figure shows profits for the two layers when market power is balanced (θD = θU = θ) as this
market power varies from bilateral perfect competition to bilateral monopoly. As is clear,
for low or even moderate levels of θ, industry profits are lower under revenue-sharing than
under wholesale pricing.
Strikingly, even if industry profits decrease under revenue-sharing, revenue-sharing nonethe-
less tends to benefit layer 1 and harm layer 2, as was true in the case of bilateral monopoly.
Similarly, even when industry profits are higher under revenue-sharing, layer 2 is harmed
even though layer 1 benefits. Note that because there is competition amongst both layers,
the classic result from the taxation literature (Suits and Musgrave (1953)) no longer applies
to the assessment of layer 1’s profits.
Proposition 5. Suppose that the order in which layers D and U move is the same under
revenue-sharing as under wholesale pricing. If aggregate demand Q(p) is either log-linear or
constant-elasticity, then:
(1) the layer that sets retail prices (that is, the second-mover) earns lower profits under
revenue-sharing, and
(2) the layer that sets the terms of trade (that is, the first-mover) earns higher profits
under revenue-sharing.
As before, the profitability results can hold for some log-concave demand functions. Suppose
that demand is linear, a canonical log-concave form. Figure 3 shows, in the left-hand panel,
that layer one benefits from revenue-sharing and shows in the right-hand panel that layer
19
two is worse off. Indeed, the decline in profits for layer 2 can be quite dramatic, in some
cases being roughly one-third under revenue-sharing what they are with wholesale pricing.
0 0.2 0.4 0.6 0.8 10
0.05
0.1
Conduct parameter θ = θ1 = θ2
Layer
1’s
pro
fit
Wholesale pricingRevenue-sharing
0 0.2 0.4 0.6 0.8 1
0
0.02
0.04
0.06
Conduct parameter θ = θ1 = θ2L
ayer
2’s
pro
fit
Wholesale pricingRevenue-sharing
Figure 3. Comparison of the profits of industry layers 1 and 2 under whole-sale pricing and revenue-sharing, with linear demand Q(p) = 1−p, equal costsc = c1 = c2 = 0.1, and equal conduct parameters θ = θ1 = θ2.
I now consider a shift from the wholesale model to the agency model. The following result
extends Proposition 3, and explains what happens to the profits of the two layers starting
from the wholesale pricing model in which layer U sets w and then layer D sets p, and
moving to the revenue-sharing model in which layer D sets r and then layer U sets p.
Proposition 6. Under bilateral imperfect competition, if demand is log-concave, log-linear,
or constant-elasticity, then a shift from the wholesale model to the agency model raises the
profits of layer D but lowers the profits of layer U .
2.2. Robustness. The conduct-parameter approach used above has several attractive fea-
tures. First, it allows for a parsimonious representation of bilateral imperfect competition
that is easy to work with both under wholesale pricing and revenue-sharing. Second, the
shape of aggregate demand is central to the analysis, thus providing a close analogue to the
study of bilateral monopoly.
At the same time, the approach above utilizes several strong assumptions. For example,
if static competition in prices between firms offering differentiated products is taken as a
microfoundation, then the conduct parameters may depend on the shape of demand at any
given price rather than being constants. Additionally, the conduct parameter of the layer
that moves first may depend on whether wholesale prices or instead revenue-sharing is used
(although the conduct parameter of the layer that moves second is invariant to this).
20
To investigate whether my results are particularly sensitive, I consider a particular micro-
foundation. Suppose there are two suppliers and two retailers, where each supplier sells
to one retailer—competition occurs across vertical chains. The demand for a given retailer
setting a price p while its rival retailer sets a price p is given by
Q(p, p) =1− β − p+ βp
1− β= 1− p− βp
1− β, for β ∈ [0, 1).
As β becomes close to one the market becomes very competitive and for β = 0 the retailers
are monopolists. The four firms have identical marginal costs c > 0. Contract offers are
private so that, for example, if suppliers move first then they privately offer either a wholesale
price or a revenue share to their downstream partner, which then sets a retail price.
It can be shown that this microfoundation leads to a conduct parameter for the first-mover
that depends on whether revenue-sharing or instead wholesale pricing is in effect. Does this
overturn my earlier results? The equilibrium values are readily computed for different values
of c and β. For a broad range of parameters that I considered, my key results from above
typically, although not always, hold.26
In particular, I found that (i) the equilibrium price under revenue-sharing was always lower
than under wholesale pricing, (ii) the firms that set retail prices were always harmed by a
move to revenue-sharing, but that (iii) when β was large (typically, exceeding 0.8), so that
the market is very competitive and profits are close to zero, the firms that move first could
be worse off under revenue-sharing. Despite (iii), I found that (iv) a move from the wholesale
model to the agency model always benefited downstream firms and harmed upstream firms.
Finally, (v) under revenue-sharing it was always advantageous to move first. Of the five
items above, only (iii) diverges from my theoretical predictions under the conduct-parameter
approach, and then only for extreme parameter values.
3. Most-Favored Nation Clauses
In this section I explore the use of retail price-parity clauses such as MFN clauses. Such
clauses specify that a supplier who is setting retail prices through one retailer cannot charge
a lower price through another retailer, if the first has an MFN clause. Such clauses are
growing in popularity, especially in online markets, and so understanding their effects is
important.27 I show how price-parity clauses can raise retail prices and harm consumers. I
26I let β vary between zero and 0.95 in increments of 0.05, and considered a variety of costs from 0.01 to 0.45(production is only positive for c ≤ 0.5).27One reason for their popularity is likely that violations of such clauses can be detected because retail prices aretypically easily observable, especially relative to more traditional wholesale MFN contracts that stipulate parity inwholesale prices. See Baker (1996) for a broad discussion of such contracts. Also see McAfee and Schwartz (1994)and DeGraba and Postlewaite (1992) for assessments of the ability of such contracts do limit expropriation resultingfrom a supplier’s inability to commit in multilateral negotiations.
21
also describe examples of online agency markets where price-parity clauses have been used.
In the conclusion, I discuss potential pro-competitive effects of such clauses.
Rather than using an approach based on exogenous conduct parameters, I adopt specific
microfoundations. Otherwise, I would have to assume that, for example, price-parity clauses
increase the conduct parameter of certain firms and thereby raise prices. Instead, I will
directly prove that such contracts have the effect of raising prices.
3.1. Retail price-parity clauses raise industry prices. To investigate retail price-parity
clauses, I assume throughout this section that retailers set terms of trade and suppliers set
retail prices. The basic argument for why such clauses raise industry prices is that they
soften competition in the terms of trade between retailers, leading retailers to charge higher
commissions. By effectively raising the costs faced by suppliers, such clauses in turn lead to
higher equilibrium retail prices.
The intuition for why price-parity restrictions soften competition between retailers setting
the terms of trade is as follows. In the absence of such clauses, when a retailer raises
the commission that it charges upstream firms, those upstream firms have incentives to
adjust both the retail price they set at that retailer and the retail price they set at other
retailers. Indeed, suppliers have an incentive to adjust these prices asymmetrically, so that
demand is diverted away from the retailer that has raised its commission. This disciplines
the retailer in question, reducing its incentives to raise commissions. However, price-parity
clauses eliminate the ability of suppliers to divert demand in such a manner, so that such
clauses encourage retailers to raise their commissions.
To build intuition and emphasize the basic technique involved, first I focus on a special case
in which each retailer carries but a single supplier’s product, in which contracts offered by
retailers are privately observed, and in which wholesale pricing is used. Afterwards, I explain
how to generalize.
3.1.1. Price-parity restrictions with exclusive retailers, private contracts, and wholesale pric-
ing. Here I consider a static price-setting model with symmetrically differentiated products
and in which wholesale pricing is used. There are two upstream firms and four downstream
firms. Each upstream firm sells to two downstream firms, each of which is exclusive to a
single upstream firm. Contract offers are private.28
Suppose no MFNs are in effect and consider a given supplier. This supplier is selling its
product through two exclusive retailers. Facing a commission of w from one retailer and w
from the other, this supplier chooses retail prices of p and p, respectively. Let Q(p, p, p, p)
denote the quantity it sells through a downstream retailer given that it charges p at that
28Because each retailer only offers a contract to one supplier, I assume that a supplier who receives an unexpectedoffer from a retailer does not adjust its beliefs about any other offers.
22
retailer and p at the other retailer through which it sells, and given that the other supplier
charges p at each of the two retailers to which it sells. Demand is symmetric across retailers
and satisfies all of the usual properties of differentiated product demand.29 The profits of
the supplier under consideration are
(p− w − cU)Q(p, p, p, p) + (p− w − cU)Q(p, p, p, p). (13)
At a symmetric industry solution, all retailers are setting the same wholesale price w and all
suppliers are setting the same price p through all retailers. As verified in the Appendix, for
such an arbitrary (industry-wide) w, this gives an industry-wide price p characterized by
p− w − cU = − Q(p)
Q1(p) +Q2(p), (14)
where Q(p) is defined as Q(p) = Q(p, p, p, p), and similarly Q1(p) = Q1(p, p, p, p) and
Q2(p) = Q2(p, p, p, p) (note that Q1 6= Q2).30 As I show in the Appendix, this same equation
characterizes the industry-wide price p that would be chosen by suppliers facing an industry-
wide w if MFNs were in effect across the industry. The intuition is simply that, if all retailers
are charging the same wholesale price w, then from the perspective of suppliers it doesn’t
matter if MFNs are in effect or not—suppliers have no incentive to charge asymmetric retail
prices when they face symmetric wholesale prices.
In line with earlier analysis, I assume that the right-hand side of Equation (14) intersects the
left-hand side only once and from above. Let p∗(w) denote the solution to Equation (14),
so that p∗(w) gives the industry-wide retail price that holds given that the industry-wide
wholesale price is w, whether MFNs are in effect or not. Note that p∗(w) is increasing.
I now consider the decisions of retailers. For clarity I reiterate that, for a given retailer,
Q(p, p, p, p) is the demand facing this retailer given that its supplier charges a price p through
it and charges a price p through the other retailer carrying its product, and given that p
is charged by the other supplier through both of its retailers. If this retailer is charging a
wholesale price w, its profits are (w − cD)Q(p, p, p, p). Because contracts are private, when
this retailer selects w only the prices chosen by this retailer’s supplier, p and p, change—that
is, the prices set by the other supplier, given by p, do not change. Nonetheless, both p and
p and their derivatives with respect to this retailer’s wholesale price w are also functions
of the wholesale price w charged by the other retailer carrying this supplier’s product and
of the retail price p set for the other supplier’s product through its retailers. I will thus
29Throughout I assume that the demand system is well-behaved. First-order conditions will uniquely characterizesymmetric equilibrium outcomes, and any needed second-order conditions are satisfied (for example, marginal profitswith respect to any choice variable are decreasing in that choice variable).30The reason that Q1 6= Q2 is that Q1 denotes the change in the demand at a given retailer when its supplier raisesthe price set at that retailer, whereas Q2 is the change in the demand at this retailer when its supplier raises theprice it sets at the other retailer to which it sells.
23
write p(w, w, p, p) and p(w, w, p, p), and write p1(w, w, p, p) and p1(w, w, p, p) to denote the
derivatives with respect to the first argument w.
With no MFNs, the first-order condition governing this retailer’s choice of w is
Q(p, p, p, p) + (w − cD) [Q1(p, p, p, p)p1(w, w, p, p) +Q2(p, p, p, p)p1(w, w, p, p)] = 0. (15)
To be clear, for a retailer that raises its wholesale price w slightly, p1 is the change in the
price set at its own store by its supplier and p1 is the change in the price set through the
other store to which this retailer’s supplier sells. Thus, for this retailer, Q1p1 + Q2p1 is
the total change in demand following an increase in its wholesale price, factoring in how its
supplier will change the price that it charges through its two retailers.
At a symmetric industry solution where all retailers set the same wholesale price w and all
suppliers set the same price p∗(w) through all retailers, the following holds.
w − cD = − Q(p∗(w))
Q1(p∗(w))p1(w,w, p∗(w), p∗(w)) +Q2(p∗(w))p1(w,w, p∗(w), p∗(w)). (16)
Now suppose price-parity is in effect across the industry. This retailer’s profit function differs
in that its supplier must charge the same price pM(w, w, p, p) through both retailers it sells
to, meaning this retailer’s profits are (w− cD)Q(pM , pM , p, p), where pM is a function of the
same variables that p and p are functions of. Differentiating and looking for a symmetric
industry-wide solution in which all retailers set w and all suppliers charge p∗(w) through all
their retailers, the following industry-equilibrium condition holds.31
w − cD = − Q(p∗(w))
Q1(p∗(w))pM1 (w,w, p∗(w), p∗(w)) +Q2(p∗(w))pM1 (w,w, p∗(w), p∗(w)). (17)
The effect of industry-wide price-parity restrictions on the equilibrium wholesale price de-
pends on how the solution to Equation (16) differs from the solution to Equation (17).
Inspecting these equations, it is apparent that these differ to the extent that pM1 differs from
p1 and p1, that is in how such restrictions influence the retail price response of suppliers
to changes in the wholesale price charged by a single retailer. As I show in the Appendix,
p1 > pM1 > p1. This says that price-parity restrictions force a supplier, responding to a
wholesale price change by a single retailer, to compress what its price responses otherwise
would be, which has the effect of causing such a wholesale price increase to result in less de-
mand being diverted from the a retailer raising its wholesale price. This encourages retailers
to raise their wholesale prices, which in equilibrium raises the retail price.
Proposition 7. In the model with exclusive retailers and private contracts, industry-wide
retail price-parity restrictions lead to an increase in retail prices.
31In line with earlier analysis, I assume that the right-hand side of both Equations (16) and (17) intersects theleft-hand side only once, and from above, as the industry-wide wholesale price changes.
24
3.1.2. Price-parity restrictions with multi-product retailers and public contracts. I now gen-
eralize Proposition 7 in a setting with a total of two suppliers and a total of two retailers.
In particular, I show the result is robust to (i) allowing each of two retailers to carry the
products of both suppliers, (ii) allowing public rather than private contracts, (iii) allowing
for either revenue-sharing or wholesale pricing.
It is necessary to introduce slightly different notation. Label the suppliers as 1 and 2, and
the two (in total) retailers as A and B. Let p1 and p2 denote the prices charged at retailer A
by suppliers 1 and 2, respectively, and let p1 and p2 denote the prices charged at retailer B
by suppliers 1 and 2, respectively. Let Π(p1, p1, p2, p2) denote the profits of supplier 1, given
whatever terms of trade suppliers face. Additionally, denote the demand for this supplier’s
good at retailer A by Q(p1, p1, p2, p2). As before, the demand system is symmetric.
The situation is somewhat more complicated than before because the price-adjustment pro-
cess of suppliers (in response to the change in a single retailer’s contract) must capture the
overall strategic adjustment of all suppliers’ prices. That is, because contracts are public,
and also because each supplier sells to each retailer, any change in a single retailer’s contract
will alter all the retail prices.
Given symmetry, it is enough to think about retailer A raising either its wholesale price w or
revenue-share r to both suppliers 1 and 2 (depending on whether wholesale prices or instead
revenue-sharing is in effect), given that retailer B is either offering w or r to both suppliers.
Given that suppliers will behave symmetrically given such terms of trade, it will be that
p1 = p2 (both suppliers set the same price at retailer A) and that p1 = p2 (both suppliers
set the same price at retailer B). When wholesale prices are in effect, denote by p(w, w) this
common price set through A (so that p = p1 = p2) and denote by p(w, w) this common price
set through B (so that p = p1 = p2). In an abuse of notation, when revenue-sharing is in
effect I will also denote these prices by p and p, although their arguments are r and r.32
The key to understanding the effect of price-parity restrictions is understanding how they
influence retail price adjustments in response to a change in a single retailer’s terms of trade.
Hence, let p1(w, w) denote the change in the price p(w, w) (that both supplier 1 and 2
charge through retailer A), let p1(w, w) denote the change in the price p(w, w) (that both
suppliers set through retailer B), and let pM1 (w, w) denote the common change in all retail
prices when price parity is in effect, all in response to an increase in the wholesale price w
offered by A to both suppliers. In an abuse of notation, when revenue-sharing is in effect
I let p1(r, r), p1(r, r), and pM1 (r, r) denote the changes in these prices in response to a small
increase in r by retailer A.
32In particular, the function p under wholesale pricing is different from the function p under revenue-sharing, andthe same is true for p.
25
A sufficient condition for price-parity clauses to raise prices is that when a retailer offers
terms that are more advantageous to suppliers, suppliers do not change prices in a way that
diverts demand to the retailer’s rival. For the case of wholesale prices, this is equivalent to
a retailer losing demand if it raises its wholesale price, and this is most conveniently written
as p1 > p1: when retailer A raises its wholesale price, suppliers respond by raising the retail
price at A by more than the retail price is raised at retailer B. Under revenue-sharing,
an increase in r will tend to lower prices by lowering the perceived marginal costs cU/r of
suppliers. Thus, the relevant sufficient condition is that p1 < p1: a retailer who gives a
supplier a larger share of revenue causes a price decrease at its own store that dominates
any price decrease at the other retailer.
Proposition 8. Consider the model with public contracts in which each of two suppliers
sells through each of two retailers. Suppose that, when a retailer offers better terms to its
suppliers, the corresponding price effects do not divert demand to the retailer’s rival. That
is, suppose it is the case that p1 > p1 (under wholesale pricing) or that p1 < p1 (under
revenue-sharing). Then industry-wide retail price-parity restrictions increase retail prices.
The conditions to ensure that, for example, p1 > p1 can readily be stated in terms of stability
conditions for the underlying profit functions of suppliers. In particular, using the work in
the proofs of Propositions 7 and 8, at a symmetric equilibrium (w = w and all retail prices
equal), p1 and p1 are readily derived in terms of partial derivatives of a supplier’s profit
function. Roughly speaking, so long as a supplier’s marginal profits from increasing its price
at a given retailer are more strongly influenced by price changes at that retailer than at the
other retailer, and so long as an increase in all prices tends to lower the marginal gains from
a price increase at a given retailer, then p1 > p1 (under wholesale pricing).33
3.2. Antitrust cases. In this section I describe markets in which retail price-parity clauses,
used in conjunction with the agency model, were investigated by antitrust authorities either
in the EU or the US or both. In all cases, a key concern of authorities was that higher retail
prices either could or did result.
33In particular, let Π represent the profits of, say, supplier 1. Then
p1 =[Πp1p1 + Πp1p2 ]Q1 − [Πp1p1 + Πp1p2 ]Q2
[Πp1p1 + Πp1p2 ]2 − [Πp1p1 + Πp1p2 ]2, and p1 =
[Πp1p1 + Πp1p2 ]Q2 − [Πp1p1 + Πp1p2 ]Q1
[Πp1p1 + Πp1p2 ]2 − [Πp1p1 + Πp1p2 ]2.
Because Q1 < 0 < Q2 and Q1 is larger in magnitude than Q2, the following stability-type conditions ensure p1 > p1,evaluated at equal wholesale and retail prices. First, [Πp1p1 + Πp1p2 ]2 − [Πp1p1 + Πp1p2 ]2 > 0. Second, [Πp1p1 +Πp1p2 ] + [Πp1p1 + Πp1p2 ] < 0. To interpret these, note that Πp1 is the marginal profitability of supplier 1 increasingits price at retailer A. The first condition basically says that the magnitude of the effect on this marginal profitabilityfrom an increase in both suppliers’ prices at this retailer (given by p1 and p2) dominates the effect from an increasein both suppliers’ prices at the other retailer (given by p1 and p2). The second condition ensures that the overalleffect of increasing all of these prices is to reduce the marginal profitability of an increase in p1; in this sense, thesecond condition is similar to the assumption of concave profits in a monopoly pricing problem.
26
The Amazon Marketplace is an online sales platform implemented by Amazon which covers
a wide variety of products and accounts for over 40% of products sold by Amazon.34 Amazon
sets a revenue-sharing contract and suppliers set the retail price that consumers see when they
visit Amazon’s website, sharing with Amazon revenue from completed sales. Additionally,
Amazon has used price-parity clauses.
Both the Bundeskartellamt (Germany’s antitrust agency) and the UK’s Office of Fair Trade
(OFT) investigated Amazon’s use of such clauses. The concern was essentially as outlined
in my analysis above.35 Amazon agreed to stop using such clauses in the affected markets.
The OFT also investigated the use of MFN contracts in the private motor insurance market.
In that market, private motor insurers (PMIs) set the price of their products sold through
price-comparison websites (PCWs) and MFN clauses are used. In its Provisional Findings
Report (December 17, 2013), the OFT provides much the same argument that I have made
above, in particular that MFNs stifle competition in commissions by PCWs, ultimately
leading to higher retail prices. They also present direct evidence that such clauses limit
commission competition.36
Another investigated market is that for online hotel room booking. Leading online travel
agencies such as Expedia and Booking.com take a share of revenue from bookings of rooms
on their websites, where the prices are set by the owners of the hotel rooms. However, there is
some scope for websites to discount offerings, for example by bundling hotel rooms with other
travel-related products and offering an advantageous price on the bundle. Both Expedia and
Booking.com imposed a combination of MFN clauses and resale price maintenance (RPM)
agreements on upstream suppliers. Both the OFT and its French counterpart, The General
Directorate for Competition Policy, Consumer Affairs and Fraud Control, have expressed
concerns about the contractual provisions used by the booking firms (in the US, a private
lawsuit is currently underway).37 As part of an agreement reached in January, 2014, these
firms agreed to stop using such contracts in the UK. However, this agreement was later
overturned and the case was closed on the grounds of administrative priorities.38
The most closely watched market was that for e-books. In 2010, Apple entered the e-book
market as it introduced its tablet computer, the iPad. Prior to Apple’s entry, Amazon was
34See “OFT minded to drop investigation into Amazon pricing policies,” Financial Times, August 29, 2013.35In an August 29, 2013 announcement, the OFT noted the primary concern related to such clauses, saying, “Inparticular, such policies may raise online platform fees, curtail the entry of potential entrants, and directly affect theprices which sellers set on platforms (including their own websites), resulting in higher prices to consumers.”36The report reads, “There is direct evidence that wide MFNs harm competition between PCWs. Notably, one PCWhas tried to reduce its commission fees with motor insurance providers in exchange for lower premiums. However,motor insurance providers were unable to accept the offer due to the presence of wide MFNs in their contracts withother PCWs. A number of motor insurers also told us they had, as a result of wide MFNs, withdrawn from, or didnot consider, such offers from PCWs.”37See, for example, “Expedia, Starwood Ask Judge to Dismiss Price-Fixing Suit,” Bloomberg, December 17, 2013.38In France, specifically for hotel bookings, MFNs of all types (including both so-called narrow and wide MFNs) havebeen banned. Germany has banned all MFNs, not only those affecting hotel bookings.
27
the the only significant player in the market, selling e-books for its dedicated e-book reader,
the Kindle.
Publishers had been unhappy dealing exclusively with Amazon. One reason is that they
thought Amazon’s pricing was contributing to the decline of profits from other formats and
channels, such as hardcover and paperback books sold through brick and mortar retailers.
Apple convinced publishers to adopt the agency model as a condition of its entry into e-
book retailing, and publishers then pressured Amazon to do so as well. Apple also secured
an MFN contract for itself prior to entering. Following Apple’s entry, the price of many
e-books significantly increased.
In April, 2012, the US Department of Justice accused Apple and five major publishers of
conspiring to increase the price of e-books, and a similar investigation began in the EU.
Propositions 7 and 8 predict this increase in retail prices.39 Indeed, the price increase caused
by MFNs under the agency model may be sufficient to outweigh the price decrease that is
predicted to occur from the shift to the agency model from the wholesale model. That is, it
is MFNs not the agency model itself that may have caused the increase in prices.
4. Conclusion
I have presented an analysis of vertical relations in markets where there may be imperfect
competition at both layers of the supply chain. The model is general enough to allow for
changes in which layer sets retail prices and for either wholesale prices or instead revenue-
sharing to be used to intermediate trade. I also examined retail price-parity restrictions.
The broad message from my analysis is that revenue-sharing, the agency model, and retail
price-parity clauses are weapons that firms can use to either shift profits across the supply
chain or to raise retail prices. A move to revenue-sharing tends to raise the profits of the
firms setting terms of trade, while the profits of other firms tend to decrease. Consumers
are better off, at least in the absence of price-parity restrictions. This result is strengthened
when a switch from the wholesale model to the agency model is considered, so that retailer
profits increase but supplier profits fall. Price-parity restrictions, when imposed within the
agency model, raise prices and may erase the gains that otherwise would have accrued to
consumers from the agency model being in place instead of the wholesale model.
Although the analysis above indicates how MFNs may raise prices and harm consumers, it
is not hard to imagine how MFNs might also be pro-competitive. For example, MFNs might
39A potential criticism of this interpretation hinges on the question of whether revenue-sharing contracts raise pro-ducers’ perceived marginal costs in the e-book market, given that one might feel such marginal costs equal zero.However, suppliers have alternative formats and channels, such as paperback or hardcover books sold through brickand mortar retailers, in which they can sell their content. Such alternatives constitute a positive opportunity costof selling an e-book, and it is straightforward to show that this opportunity cost plays the same role as a positivemarginal cost in the existing analysis.
28
encourage investments. This force seems particularly relevant for markets where the retail
landscape is changing rapidly. In particular, many new online retailers have appeared in
recent years, and some of them have chosen to use price-parity restrictions. Some of these
retailers have faced substantial costs of developing their online platforms and it is possible
that price-parity restrictions have encouraged their entry and benefitted consumers. For
instance, Amazon and Apple played important roles in building the e-book market, investing
not only in online stores but in hardware devices that encouraged e-book adoption.40 It is
unclear whether Apple would have made these investments without the guarantees provided
by MFNs, which possibly would have left Amazon unchecked and consumers and publishers
with fewer options. In other words, it is possible that the entry-inducing effects of MFNs
played a role in the e-book market.
Closely related, price-parity restrictions may also have pro-competitive effects if they en-
courage efficient investments among retailers and suppliers. For example, suppose that there
were a single vertically integrated supplier and a single independent retailer. If the inde-
pendent retailer must make investments that benefit the supplier, the incentives for such
incentives may be blunted if the supplier undercuts the retailer through its own retail outlet.
A price-parity agreement may prevent such behavior and possibly benefit consumers.
One limitation of my analysis is that it assumes that all firms use the same business model.
For example, the agency model or instead the wholesale model is used throughout the indus-
try. But of course this may be an individual choice. Consider a retailer pondering enforcing
the agency model for transactions through its store. Fixing all prices at other stores, my
work suggests such a move should be profitable, and indeed likely to draw consumers from
other stores due to decreased prices.
A downside of such a unilateral move is that a given supplier may be tempted to abandon
the retailer if staying with the retailer would lower this supplier’s profits. That is, to the
extent that a supplier can abandon a retailer and not lose sales in the process, doing so may
become attractive if the retailer tries to seize too much of the surplus. On the other hand, it
is far from obvious that a supplier who abandons a retailer will not lose sales. For instance,
if there are multiple suppliers and differentiation amongst retailers is high, a supplier that
withdraws its products from one retailer may not induce many customers to switch retailers,
because those customers may instead simply purchase a product sold by the original retailer
but made by a different supplier. Another constraint on the ability of consumers to switch
relates to their awareness of products. For instance, if consumers visit retailers to learn about
which products they want, then it may be the case that a supplier who abandons a retailer
also gives up the ability to inform consumers (who visit that retailer) about its product.
40See Gaudin and White (2014a) for further analysis of the impact of such devices.
29
This may limit the number of consumers who will switch retailers to follow a supplier who
abandons a retailer, even if the costs of switching retailers are low.
Another limitation of my analysis is that the contracts are restrictive in that they specify only
a revenue-sharing term or a wholesale price. Allowing more general contracts would likely
lead to different outcomes. Nonetheless, the contracts that I have allowed are commonly
observed and so it is important to understand their effects.
5. Appendix: Omitted Proofs
Here are proofs that were omitted from the text.
Proof of Lemma 1: First recall that if firm 1 chooses price p when offered revenue-share
r, its profits equal (rp − c2)Q(p). Because a decrease in r lowers these profits pointwise in
p, it follows that a decrease in r lowers the optimized profits of firm 2.
Let pm denote the price that maximizes industry profits. Suppose pa < pm. Then firm 1
could instead set pa = pm, which would raise industry profits. But so raising pa also entails
lowering the value of r that is offered to firm 2. Hence, firm 2’s profits decline, and because
industry profits are up, firm 1 must have higher profits. Hence pa ≥ pm.
To show that pa is strictly greater than pm, suppose r is such that pa = pm and consider
a small decrease in r. This has a first-order negative effect on firm 2’s profits (given by
−pmQ(pm) < 0, using the envelope theorem). A small change in r causes a small change in
pa, and hence has zero first-order effect on industry profits (because pm maximizes industry
profits). Hence, by lowering r, industry profits have not changed to first-order but there has
been a first-order decrease in firm 2’s profits, implying that firm 1’s profits have increased. �
Proof of Lemma 2: The wholesale price is invariant to move order but the price under
revenue-sharing may depend on the move order. Suppose that the move order under revenue-
sharing is such that the higher of the two possible values of pa obtains. I will now show that
this price pa must be smaller than pw. Suppose for the sake of contradiction that pa ≥ pw.
Under the maintained assumption that λ(p)+λ(p)(1−λ′(p)) has a slope less than one, there
is a unique solution to Equation (2) and so it must be that
pa − c1 − c2 ≥ λ(pa) + λ(pa)(1− λ′(pa)). (18)
On the other hand, the revenue-sharing optimality condition from Equation (6) indicates
that it must be that
pa − c1 − c2 = λ(pa)
[1 + c2
pa(1− λ′(pa))(pa − λ(pa))2
].
30
Thus, a contradiction exists if
λ(pa) + λ(pa)(1− λ′(pa)) > λ(pa)
[1 + c2
pa(1− λ′(pa))(pa − λ(pa))2
].
This condition reduces to
c2pa < (pa − λ(pa))2. (19)
Because pa ≥ pw by assumption, and appealing to (a rearrangement of) Equation (18),
c2 ≤ pa − λ(pa)− λ(pa)(1− λ′(pa))− c1 < pa − λ(pa)− λ(pa)(1− λ′(pa)).
Using this inequality, a sufficient condition for Equation (19) to hold is that
pa[pa − λ(pa)− λ(pa)(1− λ′(pa))] ≤ (pa − λ(pa))2 ⇐⇒ paλ′(pa) ≤ λ(pa).
This establishes a contradiction, and so it must be that pa < pw. �
Proof of Proposition 1: Suppose firm 1 set r = c2/(c1 + c2). Firm 2’s profits would be
(rp− c2)Q(p) = r(p− c2
r
)Q(p) = r(p− c1 − c2)Q(p),
which is proportional to industry profits and hence would lead firm 2 to select the price that
maximizes industry profits. This expression also makes it clear that firm 2 would receive a
share r of maximized industry profits, from which it follows that firm 1 can ensure itself of
at least a share 1− r = c1/(c1 + c2) of maximized industry profits. However, from Lemma 1,
it is optimal for firm 1 to induce a strictly higher retail price (and hence a lower value of
r), and thereby earn strictly higher profits: in equilibrium, firm 1 earns strictly more than a
share c1/(c1 + c2) of maximized industry profits.
This means that for any particular firm i ∈ {D,U}, under revenue-sharing i must earn higher
profit if it moves first than if it moves second. For example, if U moves first it will claim
more than a share cU/(cD + cU) of maximized industry profits, and if instead D moves first
then U will secure less than a share 1− cD/(cD + cU) = cU/(cD + cU) of these profits. �
Proof of Proposition 2: First note that, because log-linear and constant-elasticity de-
mands are weakly log-convex, under wholesale pricing any given firm does better if it moves
second rather than first (this follows from the discussion preceding Proposition 1). Denote
this by WS2 ≥ WS1. Second, note that Proposition 3 implies that, for any given firm and
for a class of demand functions that includes log-linear and constant-elasticity, a firm prefers
to move first under wholesale pricing rather than moving second under revenue-sharing. De-
note this by WS1 > RS2. Combining these two observations implies that, for any given
firm, WS2 > RS2. That is, fixing move order and starting from a position of moving second
31
under wholesale pricing, a move to revenue-sharing harms the second-mover. �
Proof of Proposition 3: Consider U . Under the wholesale model, U moves first and it
could (suboptimally) induce pa, where in this proof by pa I mean the equilibrium price under
the agency model (that is, under revenue-sharing when D is the first-mover). Because D
would impose a markup of λ(pa) in this case, U would be left with a margin of pa− c1− c2−λ(pa). Using the optimality condition associated with the agency model, given in Equation
(6), and setting c2 = cU (because that equation is associated with the agency model), the
markup of U would equalcUp
aλ(pa)(1− λ′(pa))(pa − λ(pa))2
.
At this price Q(pa) units would sell, the as sell in equilibrium under the agency model. I
will now show that U ’s per-unit margin above exceeds what U receives in equilibrium when
the agency model is in effect. In that case, U , as the second-mover, receives an equilibrium
margin of rλ(pa). Using Equation (4) and the fact that c2 = cU in this equation (because U
is the second mover in the agency model), U ’s equilibrium margin equals
cUλ(pa)
pa − λ(pa).
Thus, U would earn a higher margin with wholesale pricing (under the proposed pricing) if
cUpaλ(pa)(1− λ′(pa))(pa − λ(pa))2
≥ cUλ(pa)
pa − λ(pa)⇐⇒ pa(1− λ′(pa)) > pa − λ(pa)⇐⇒ paλ′(pa) ≤ λ(pa).
From this, it follows that U is worse off under the agency model.
Now consider D. Suppose under the agency model that D suboptimally chose r to implement
the equilibrium price under the wholesale model. This would require setting r such that the
perceived marginal cost of firm U is
cUr
= cD + cU + λ(pw)(1− λ′(pw),
where the fact that this would induce pw follows from inspection of the relevant optimality
conditions given in Equations (1), (2), and (3). U would therefore receive a margin of
rλ(pw) =cUλ(pw)
cD + cU + λ(pw)(1− λ′(pw)).
I will now show that this margin is less than U would receive under the wholesale model,
from which it follows that D must receive a higher margin under the proposed pricing
under the agency model than it does in equilibrium under the wholesale model. Under
the wholesale model, U (as the first-mover) obtains a margin λ(pw)(1 − λ′(pw)). Thus the
32
following condition implies that U earns a higher margin under the wholesale model than
under the proposed agency pricing.
cUλ(pw)
cD + cU + λ(pw)(1− λ′(pw))< λ(pw)(1− λ′(pw))⇐⇒
cU < (1− λ′(pw))(cD + cU + λ(pw)(1− λ′(pw))).
For log-concave demand, λ′(pw) < 0 and for log-linear demand λ′(pw) = 0. For either case,
it is readily seen that this inequality holds. Now, with constant-elasticity demand, pw can
be explicitly solved using Equation (2), giving pw = (cD + cU)/(1− σ)2. Because λ′(p) = σ,
the right-hand side of the inequality above reduces to cD + cU and so the condition holds. �
The following Lemma is used in the proof of Proposition 4.
Lemma 5. Let p be defined implicitly by
p− cD − cU = max[θD, θU ]λ(p).
Suppose that the retail price under revenue-sharing exceeds p, regardless of which layer moves
first. Then under revenue-sharing each layer prefers to be the first-mover.
Proof. I require a preliminary result, which is that the profits of the second-mover are declin-
ing in the price p that the first-mover induces. Equivalently, the profits of the second-mover
are increasing in the revenue share r that it is offered. The profits of the second-mover are
(rp− c2)Q(p) =c2θ2λ(p)
p− θ2λ(p)Q(p),
where the expression for the second-mover’s margin (given r) from Equation (9) in the text
has been used. Ignoring the positive constant c2θ2, the derivative with respect to p is[pλ′(p)− λ(p)
(p− θ2λ(p))2
]Q(p) +
[λ(p)
p− θ2λ(p)
]Q′(p).
Dividing by Q(p) > 0, using the definition of λ(p), and multiplying by (p−θ2λ(p))2, the sign
of the above expression is equivalent to the sign of
pλ′(p)− λ(p)− p+ θ2λ(p) = −λ(p)(1− θ2)− p(1− λ′(p)).
This is negative, using the maintained condition that λ′(p) < 1. Hence, the profit of the
second-mover is decreasing in the price that the first layer induces.
With this preliminary result, I now present the main arguments. I will prove that going
first under revenue-sharing is preferable to going second from the standpoint of (layer) D
(with relabeling the result also holds for (layer) U). Let p1 denote the equilibrium retail
33
price when D moves first, and let p2 denote the equilibrium price when D moves second. By
assumption, p ≤ min[p1, p2].
The proof requires separate but very similar analysis of two distinct cases. First, suppose
that p1 ≥ p2. In this case, if D moves first then by quasi-concavity D’s profits are higher
than if it instead set the retail price equal to p2. Suppose nonetheless that D were setting
p = p2, and let r1 be the value of r associated with this price. Also, let r2 be the equilibrium
value of r when U moves first. Then, a sufficient condition for D to prefer moving first is
that
[(1− r1)p2 − cD]Q(p2) ≥ [r2p2 − cD]Q(p2)⇐⇒ 1− r1 ≥ r2.
I will now derive bounds on r1 and r2. Let r1 be the value of r that D would set to induce
U to choose price pU , where pU is such that
pU − cD − cU = θUλ(pU).
That is, r1 satisfies
pU −cUr1
= θUλ(pU) = pU − cD − cU ⇐⇒ r1 =cU
cD + cU.
Next I derive a bound on r2 (the equilibrium value of r when U moves first). Let r2 be the
value of r that leads D to set price pD, where pD satisfies
pD − cD − cU = θDλ(pD).
That is, r2 satisfies
pD −cDr2
= θDλ(pD) = pD − cD − cU ⇐⇒ r2 =cD
cD + cU.
To complete this part of the proof, first note that pU ≤ p ≤ p2 and so it must be that r1 > r1.
Similarly, pD ≤ p ≤ p2, and so it must be that r2 > r2. Therefore,
1− r1 ≥ 1− r1 =cD
cD + cU= r2 > r2.
Thus, 1− r1 > r2: D prefers to set the terms of trade, at least when p1 ≥ p2.
To complete the proof, I must show that this conclusion also holds when p1 ≤ p2. As I
showed at the beginning of this proof, D’s profits as a second-mover would be higher if U
instead set a lower price, namely if it setp = p1. To avoid new notation, I now redefine r1
and r2 as follows. Let r2 be the value of r that would lead D to choose a price of p = p1,
and let r1 be the equilibrium value of r when D moves first.
Hence, a sufficient condition for D to prefer moving first is that
[(1− r1)p1 − cD]Q(p1) ≥ [r2p1 − cD]Q(p1)⇐⇒ 1− r1 ≥ r2.
34
Notationally, this is the same condition as before, and indeed the rest of the proof is essen-
tially identical to the earlier case. In particular, to find a bound on r1, consider the value
r1 that would induce U to set price pU , and to find a bound on r2, consider the value r2
that would induce D to set price pD. This leads to the same bounds as before. Moreover,
as before, p1 ≥ p by assumption, and by construction p ≥ max[pD, pU ]. Thus, r1 ≥ r1 and
r2 ≥ r2. This leads to the conclusion that 1− r1 > r2, proving the result. �
Proof of Proposition 4: I begin with the case of θD = θU . To prove this result, I will
show that the assumed conditions allow the application of Lemma 5, using p as defined in
the statement of that lemma. Using the expression for the equilibrium price under revenue-
sharing given in Equation (12), a sufficient condition that the price under revenue-sharing
exceeds p is that 1− θλ′(p) > 0, which is true given the maintained condition that λ′(p) < 1.
I now turn to the case where demand is log-linear, and consider the result from the perspective
of layer D. The result follows from Propositions 5 and 6. Combined, those results show
that when demand is log-linear and D is moving second, moving from wholesale pricing to
revenue-sharing harms D, but moving from wholesale pricing to revenue-sharing while also
becoming the first-mover benefits D. It follows that, under revenue-sharing, becoming the
first rather than second-mover must be beneficial.
The final case is of constant-elasticity demand (Q(p) = p−1/σ, λ(p) = σp, for σ ∈ (0, 1)).
Direct computation using Equation (12) gives
pa =c1 + c2 − c1σθ2
(1− σθ1)(1− σθ2).
Using the fact that the equilibrium value of r is such that r = c2/(p − θ2λ(pa)), the profits
of a firm moving first or instead second can be computed (note that the value of pa and of r
depend on which firm moves first). Consider the result from the perspective of D. Directly
computing profits, the ratio of D’s profits when it is the first-mover to D’s profits when it is
the second-mover is given by
cU + cD(1− σθU)
cD(1− σθU)
[cD + cU(1− σθD)
cU + cD(1− σθU)
]1/σ
. (20)
This expression is increasing in θU and decreasing in θD. In particular, the derivative with
respect to θU is [cD + cU(1− σθD)
cU + cD(1− σθU)
]1/σσcU + cD(1− σθU)
cD(1− σθU)2> 0,
and the derivative with respect to θD is
−[cD + cU(1− σθD)
cU + cD(1− σθU)
](1−σ)/σcU
cD(1− σθU)< 0.
35
Thus, a sufficient condition for D to prefer moving first is that Equation (20) exceeds one,
after setting θU = 0 and θD = 1. This sufficient condition is[cD + cU(1− σ)
cD + cU
]1/σ
≥ cDcD + cU
. (21)
When σ = 1, this condition holds. I will now show that the left-hand side of Equation (21)
is decreasing for σ ∈ (0, 1), which will prove the result. The sign of the derivative of the
left-hand side of Equation (21) is the same as the sign of
− cUσ
cD + cU(1− σ)− log
(cD + cU(1− σ)
cD + cU
).
At σ = 0, this equals zero. But the derivative of this expression with respect to σ is
− c2Uσ
(cD + cU(1− σ))2< 0.
Thus, the left-hand side of Equation (21) is decreasing on for σ ∈ (0, 1). Because I already
showed that Equation (21) holds for σ = 1, this equation holds for all σ ∈ (0, 1) and the
result follows. �
Proof of Lemma 4: Note that this proof uses different rearrangements of the right-hand
side of Equation (11) depending on the circumstance. Now, the proposition is true even if
the move order differs depending on which sales model is in effect. The reason is that pw does
not depend on move order. Under either order, the inequality just below arises as a sufficient
condition to establish a contradiction to the supposition that pa ≥ pw. So assuming, and
using Equations (11) and (12), a contradiction arises if
c2θ2λ(pw)
[pw(1− θ1λ
′(pw)) + λ(pw)(θ1 − θ2)
(pw − θ2λ(pw))2
]< θ2λ(pw)(1− θ1λ
′(pw)).
Because pa ≥ pw by assumption, using (and slightly re-writing) Equation (11) implies that
c2 = pw − c1 − θ1λ(pw)− θ2λ(pw)(1− θ1λ′(pw)) < pw − θ1λ(pw)− θ2λ(pw)(1− θ1λ
′(pw)).
Thusly replacing c2 from the previous inequality, and rearranging, shows that a contradiction
arises if
[(pw − θ2λ(pw))− θ1λ(pw)(1− θ2λ′(pw))][(pw − θ1λ(pw)) + θ1λ(pw)− θ2p
wλ′(pw)]
≤ (1− θ1λ′(pw))(pw − θ2λ(pw))2.
Expanding this and collecting terms containing pw − θ2λ(pw), this reduces to the condition
θ1λ(pw)(1− θ2λ′(pw))(pwλ′(pw)− λ(pw)) ≤ 0.
36
Because θ1λ(pw)(1 − θ2λ′(pw)) is the margin of layer 1, it is positive, and so this condition
reduces to pwλ′(pw)− λ(pw) ≤ 0. �
Proof of Proposition 5: I begin with the case of log-linear demand, and also with layer 1.
Because log-linear demand satisfies the assumption of Lemma 4, pa < pw, and so it follows
that a sufficient condition for layer 1 to earn higher profits under revenue-sharing is that its
margin under revenue-sharing is weakly higher than under wholesale pricing.
By definition (Equation 10), layer 1’s equilibrium margin is given by θ1λm′1(p), where p = pa
or instead pw as appropriate. From Equation (8), m′1(p) = 1 − m′2(p), where the func-
tional form of m2(p) depends on whether revenue-sharing is in effect (as I further detail
below). Thus, if m′2(pa) ≤ m′2(pw) (where these functions are evaluated using the appropri-
ate functional forms depending on the sales model), then layer 1 earns a higher margin under
revenue-sharing and the result follows. I now compute these derivatives for the appropriate
functional forms.
As described following Equation (8) in the text, under wholesale pricing m2(p) = θ2λ, so
that m′2(pw) = 0 in this case. Under revenue-sharing
m2(p) =θ2c2λ
p− θ2λ,
so that
m′2(pa) = − θ2c2λ
(pa − θ2λ)2< 0.
This proves that, for log-linear demand, layer 1 earns higher profits and margins under
revenue-sharing (constant-elasticity demand is considered below).
I now show that layer 2 earns lower profits and margins under revenue-sharing (first for the
case of log-linear demand). Note that because pa < pw, showing that layer 2 earns lower
profits under revenue-sharing implies that it also earns a lower margin under revenue-sharing.
Because pa < pw, by the assumed quasiconcavity of layer 2’s profits in p, an underestimate
of layer 2’s profits under wholesale pricing is (pa− c2−w)Q(pa), where w is the equilibrium
value which is w = c1 + θ1λ(pw)(1− θ2λ′(pw)) = c1 + θ1λ. To prove the result it is therefore
sufficient to compare this unit margin (given by pa− c2−w = pa− c1− c2− θ1λ) to the unit
margin under revenue-sharing. The margin under revenue-sharing is θ2rλ, and as explained
in the text r = c2/(pa − θ2λ).
Therefore, a sufficient condition for the result to be true is that
pa − c1 − c2 − θ1λ >θ2c2λ
pa − θ2λ.
37
Using Equation (12) to rewrite the left-hand side, and then dividing through by θ2c2λ and
multiplying through by (pa − θ2λ)2, this condition reduces to
pa + λ(θ1 − θ2) > pa − θ2λ⇐⇒ θ1λ > 0
which is true.
I now consider the constant-elasticity case (Q(p) = p−1/σ, λ(p) = σp, for σ ∈ (0, 1)). Direct
computation using Equations (11) and (12) gives
pa =c1 + c2 − c1σθ2
(1− σθ1)(1− σθ2), and pw =
c1 + c2
(1− σθ1)(1− σθ2).
Consider the second-mover. Its profits under wholesale pricing are
θ2λ(pw)Q(pw) = θ2σ(pw)(σ−1)/σ,
given that Q(p) = p−1/σ. Under revenue-sharing, its profits are
rθ2λ(pa)Q(pa) =c2θ2λ(pa)
pa − θ2λ(pa)Q(pa) =
c2θ2σ
1− θ2σQ(pa) =
c2θ2σ
1− θ2σ(pa)−1/σ.
Using the values for pa and pw explicitly given above, the ratio of profits under wholesale
pricing to revenue-sharing isc1 + c2(1− σθ1)
c2(1− σθ1)> 1.
This proves the result for the second-mover. Now consider the first-mover, whose profits
under revenue-sharing are
[(1− r)pa − c1]Q(pa) =
[pa − c2p
a
pa − θ2λ(pa)− c1
]Q(pa) =
[pa − c2
1− σθ2
− c1
](pa)−1/σ.
Under wholesale pricing, its profits are
θ1λ(pw)(1− θ2λ′(pw))Q(pw) = θ1σ(1− θ2σ)(pw)(σ−1)/σ.
Using the values for pw and pa given above, taking the ratio of profits under revenue-sharing
to those under wholesale pricing gives(1
1− σθ2
)(c1 + c2
c1 + c2 − c1σθ2
) 1−σσ
> 1,
where the inequality follows because each of the two terms that are multiplied together on
the left-hand side is greater than one. This completes the proof. �
Proof of Proposition 6: Note that this proof uses facts in the text given following Equa-
tion (8) and that all demand functions considered here satisfy the properties of Lemma 4,
so that pa < pw.
38
I begin with D for the case of (weakly) log-concave demand. Because pa < pw more units are
sold under the agency model and so to show that a switch from the wholesale model to the
agency model benefits (layer) D it is enough to show that D’s per-unit margin is higher under
the agency model. D’s margin under the wholesale model is θDλ(pw) ≤ θDλ(pa), where the
inequality follows from the fact that demand is log-concave (so that λ(p) is non-increasing)
and pa < pw . Under the agency model, from Equation (10) the margin of D is
θDλ(pa)m′1(pa) = θDλ(pa)[1− m′2(pa)] = θDλ(pa)
[1− θUcU
(paλ′(pa)− λ(pa))
(pa − θUλ(pa))2
]. (22)
Because paλ′(pa)− λ(pa) < 0 for weakly log-concave demand, this margin exceeds θDλ(pa),
which as explained above is an upper bound on the margin of D under the wholesale model.
Because D sells more units under the agency model, the result follows.
Now consider D when demand is constant-elasticity (Q(p) = p−1/σ, λ(p) = σp, for σ ∈ (0, 1)).
Here it is necessary to directly compute the profits of D. Under the agency model, Equation
(22) above applies to D’s margin, but specializing to the case of constant-elasticity demand
gives D’s margin as θDλ(pa) = θDσpa, and hence total profits of θDσp
aQ(pa). Under the
wholesale model, D’s margin is θDλ(pw) = θDσpw, for total profits of θDσp
wQ(pw).
Direct computation using Equations (11) and (12) gives, for constant-elasticity demand,
pa =cD + cU − cDσθU
(1− σθD)(1− σθU), and pw =
cD + cU(1− σθD)(1− σθU)
.
From this it can be shown that profits are higher under the agency model so long as
cD(1− σθU) + cU < cD + cU ,
which is true.
I now show that U is worse off under the agency model. Because pa < pw, and because layer
1’s profits are assumed to be quasiconcave in p, under the wholesale model an underestimate
of layer U ’s profit is the profit that layer 1 would get if it set p = pa:
m1(pa)Q(pa) = [pa − cD − cU − m2(pa)]Q(pa) = [pa − cD − cU − θDλ(pa)]Q(pa)
= θUcUλ(pa)
[pa(1− θDλ′(pa)) + λ(pa)(θD − θU)
(pa − θUλ(pa))2
]Q(pa),
where the final equality follows from Equation (12), and keeping in mind that that equation
is for the price under the agency model in which U is firm 2. Now consider U ’s profits under
the agency model:
rθUλ(pa)Q(pa) =θUcUλ(pa)
pa − θUλ(pa)Q(pa),
39
so that a sufficient condition for wholesale profits to be higher is that
pa(1− θDλ′(pa)) + λ(pa)(θD − θU)
(pa − θUλ(pa))2≥ 1
pa − θUλ(pa)⇐⇒ θD(paλ′(pa)− λ(pa)) ≤ 0.
This condition is satisfied for all the demand functions considered in this proposition. Hence
U is worse off under the agency model. �
Proof of Proposition 7: I begin by confirming that Equation (14) indeed characterizes
the symmetric equilibrium for any given industry-wide value of w, with or without MFNs.
Without MFNs, the first-order conditions for p and p (from Equation (13)) are
Q(p, p, p, p) + (p− w − cU)Q1(p, p, p, p) + (p− w − cU)Q2(p, p, p, p) = 0, (23)
and
Q(p, p, p, p) + (p− w − cU)Q1(p, p, p, p) + (p− w − cU)Q2(p, p, p, p) = 0. (24)
At a symmetric industry solution with all retailers setting the same wholesale price w and
all suppliers are setting the same price p through all retailers, the margin of a supplier is
p− w − cU = − Q(p)
Q1(p) +Q2(p),
as claimed in the text.
Suppose now that price-parity restrictions are in place, and once again consider a given
supplier facing wholesale prices of w and w. Because this supplier is constrained to set its
retail prices so that p = p, its profits are
(p− w − cU)Q(p, p, p, p) + (p− w − cU)Q(p, p, p, p),
with a single corresponding first-order condition
2Q(p, p, p, p) + [(p− w − cU) + (p− w − cU)] [Q1(p, p, p, p) +Q2(p, p, p, p)] = 0. (25)
At a symmetric industry outcome in retail and wholesale prices, this gives
p− w − cU = − Q(p)
Q1(p) +Q2(p).
This is identical to Equation (14). Thus, facing a given industry-wide wholesale price w,
suppliers set the same markups whether retail price-parity restrictions are in effect or not.
To show that industry-wide price parity raises industry prices, I compare Equations (16) and
(17). These margins are positive, and so the denominators in both expressions are negative.
Hence, given that the right-hand side of these equations is decreasing in the industry-wide
wholesale price w being charged, the downstream margin is larger under retail price-parity
40
restrictions if
Q1(p)pM1 +Q2(p)pM1 > Q1(p)p1 +Q2(p)p1.
Because Q1 < 0 < Q2 and because pM1 > 0,41 a sufficient condition for this is that p1 > pM1 >
p1. I will show that this condition holds, so that indeed price-parity raises the industry-
wide equilibrium wholesale price w. Because p∗(w) is increasing, it will also follow that the
industry retail price is higher under price parity.
I will show that pM1 = (p1 + p1)/2 and also that p1 > p1, which implies the desired con-
dition. To this end, let Π(p, p) denote the profits of an upstream firm, which are given in
Equation (13). Absent price parity, the first-order conditions are Πp = 0 and Πp = 0, using
subscripts to denote the variable of differentiation. A small change in w must maintain these
first-order conditions, so that the following equations hold:
Πpw + Πppp1 + Πppp1 = 0, and Πpw + Πppp1 + Πppp1 = 0.
Solving for p1 and p1 at a symmetric solution, and using the fact that the symmetry of the
problem implies that Πpp = Πpp, gives
p1 =−ΠpwΠpp + ΠpwΠpp
Π2pp − Π2
pp
, and p1 =−ΠpwΠpp + ΠpwΠpp
Π2pp − Π2
pp
.
To find pM1 , note that profits in the case with retail price-parity are also given by Π(p, p), but
of course subject to the constraint that p = p. The relevant first-order condition emerges
from the fact that an increase in p must be accompanied by an equal increase in p, so that
Πp + Πp = 0. To maintain this condition following an increase in w requires that
Πpw + ΠpppM1 + Πppp
M1 + Πpw + Πppp
M1 + Πppp
M1 = 0.
This gives
pM1 =−Πpw − Πpw
2(Πpp + Πpp).
Direct computation gives
p1 + p1
2=
1
2
[(−Πpw − Πpw)(Πpp − Πpp)
Π2pp − Π2
pp
]=
1
2
[−Πpw − Πpw
Πpp + Πpp
]= pM1 .
Thus, to complete the proof it is enough to show that p1 > p1. Using the expressions above
for these functions, and using the standard sufficient condition for optimality Π2pp−Π2
pp > 0
41pM is increasing in w under the maintained regularity conditions that the optimal decisions of firms are uniquelycharacterized by their first-order conditions. For the present situation, this means that (manipulating Equation (25)),a supplier’s first-order condition under MFNs is
p− (w + w)
2− cU = − Q(p, p, p, p)
Q1(p, p, p, p) +Q2(p, p, p, p).
The right-hand side is assumed to intersect the left-hand side from above. Thus an increase in w raises p.
41
(taking advantage of the symmetry of the problem, so that Πpp = Πpp), p1 > p1 if
(Πpp + Πpp)(Πpw − Πpw) = (Πpp + Πpp)(−Q2(p) +Q1(p)) > 0,
where the equality follows from directly computing Πpw and Πpw from Equation (13). Be-
cause Q1(p) < 0 < Q2(p), it follows that −Q2(p) + Q1(p) < 0, so that the inequality holds
if Πpp + Πpp < 0. But this condition is equivalent to the marginal profitability of raising
both p and p being decreasing, which is simply part of the overall sufficient condition for
optimality.42 Hence, Πpp − Πpp < 0, and the conclusion follows. �
Proof of Proposition 8: First I show that the margin that suppliers set under revenue-
sharing is not affected by whether price-parity restrictions are in place, for a given set of
symmetric contracts. Under the case of wholesale pricing, consider, say, supplier 1’s pricing
decision. This supplier’s profits are
(p1 − w − cU)Q(p1, p1, p2, p2) + (p1 − w − cU)Q(p1, p1, p2, p2). (26)
Obtaining the first-order conditions and looking at a symmetric industry equilibrium (w =
w, p = p1 = p2 = p1 = p2), this leads to the analogue of Equation (14),
p− w − cU = − Q(p)
Q1(p) +Q2(p).
Moreover, as before, this is the correct margin whether or not price-parity holds, following
the same steps as for Proposition 7 (imposing p1 = p1). When wholesale prices are in effect,
let p∗(w) denote the price satisfying the above equation, so that p∗(w) gives the price that
would be set by both suppliers given that both suppliers face the wholesale price w, where
p∗(w) is increasing.
Under revenue-sharing, the only difference compared to wholesale pricing is that the per-unit
margins in Equation (26) are replaced by rp1− cU and rp1− cU , where r is the revenue share
offered by retailer A and r is the revenue-share offered by retailer B. But then taking the
same steps as before (that is, differentiating with respect to p1 and p1) leads to the conclusion
that, at a symmetric outcome in which both retailers set the same contract (r = r) and all
retail prices equal p, and with or without price-parity restrictions, the margin of suppliers is
rp− cU = −r Q(p)
Q1(p) +Q2(p). (27)
For arbitrary r, and in an abuse of notation, let p∗(r) denote the solution to Equation (27).
This gives the price that suppliers would charge if they both faced the same revenue-share
42The marginal profits of raising both p and p must be zero: Πp+Πp = 0. Also, these marginal profits must decreasewhen both p and p are further raised: Πpp + Πpp + Πpp + Πpp < 0. The symmetry of the problem means Πpp = Πpp,giving the condition Πpp + Πpp < 0.
42
term r. Note that p∗(r) is decreasing (unlike p∗(w), which is one reason this is an abuse of
notation) because an increase in r lowers the perceived marginal cost of suppliers.
To give a bit more detail and also to prove a fact that is useful below, note that under
price-parity (where it must be that p1 = p1 and p2 = p2) this supplier’s profits are
(rp1 − cU)Q(p1, p1, p2, p2) + (rp1 − cU)Q(p1, p1, p2, p2).
Differentiating with respect to p1, setting p = p1 = p2 and rearranging gives the condition
for upstream equilibrium for arbitrary r and r:
p− 2cUr + r
= − Q(p)
Q1(p) +Q2(p).
The maintained assumption that the right-hand side of the above expression intersects the
left-hand side from above implies that an increase in r offered by A lowers p, so that pM1 < 0.
The next step is showing that pM1 = (p1 + p1)/2, where the function pM1 describes how all
retail prices change following an increase in the wholesale price charged by retailer A (the
case of revenue-sharing is handled below). I first consider wholesale pricing. Given the
symmetry, it is enough to consider, say, supplier 1. That is, letting Πp1 = 0 and Πp1 = 0
denote the first-order conditions of supplier 1, maintenance of these conditions in response
to an increase in w requires that
Πp1w + (Πp1p1 + Πp1p2) p1 + (Πp1p1 + Πp1p2) p1 = 0, (28)
and
Πp1w + (Πp1p1 + Πp1p2) p1 + (Πp1p1 + Πp1p2) p1 = 0. (29)
These conditions are very similar to those derived in the proof of Proposition 7. The only
difference is that the strategic effects associated with this supplier’s rival (supplier 2) changing
its prices are also included. In the first equation above, for example, these terms are given
by Πp1p2p1 and Πp1p2 p1.
If MFNs are in place, then the relevant condition is given by the sum of Equations (28)
and (29), but with p1 and p1 replaced with pM1 . I am considering symmetric equilibria and
demand, so that Πp1p1 = Πp1p1 ,Πp1p2 = Πp1p2 ,Πp1p2 = Πp1p2 (and of course Πp1p1 = Πp1p1).
But then setting pM1 = (p1 + p1)/2 solves the required condition under price-parity.
Now consider revenue-sharing. The logic is identical to that just presented for wholesale
prices, with the following changes. First, the arguments of the functions describing price
responses are no longer given by the wholesale price w but instead by the revenue-sharing
term r that is offered by retailer A. Other than this, the only changes to Equations (28)
and (29) are that Πp1w and Πp1w are replaced with Πp1r and Πp1r. But symmetry ensures
that the various equalities involving second derivatives of Π continue to hold, and I thus
conclude, exactly as above, that pM1 = (p1 + p1)/2.
43
To show that this implies that competition between retailers in terms of trade is softened,
and that retail prices are therefore increased by price-parity clauses, only a few steps remain.
The first of these is the derivation of the downstream margin. The same steps are followed
as before. For example, with wholesale pricing, retailer A’s profits are
(w − cD)[Q(p1, p1, p2, p2) +Q(p2, p2, p1, p1)
].
The reason is that supplier 1 sells Q(p1, p1, p2, p2) units through A, while supplier 2 sells
Q(p2, p2, p1, p1) through A. Let p denote the common price set by suppliers at A and let
p denote the common price set by suppliers at B (that is, p = p1 = p2 and p = p1 = p2).
Profits are proportional to (w− cD)Q(p, p, p, p). Suppressing arguments for compactness and
considering the effect of A raising the wholesale price w it charges both suppliers yields the
first-order condition
Q+ (w − cD) [(Q1 +Q3)p1 + (Q2 +Q4)p1] = 0,
This differs from before to the extent that it accommodates how the prices of all goods
(through any retailer) change following the increase in w. Evaluating at a symmetric industry
solution, and letting for example Q3(p) denote the change in Q(p, p, p, p) from a small change
in the third argument, the margin of retailers is given by
w − cD = − Q(p∗(w))
[Q1(p∗(w)) +Q3(p∗(w))] p1(w,w) + [Q2(p∗(w)) +Q4(p∗(w))] p1(w,w). (30)
Note that I have written Equation (30) to be explicit about the arguments of all functions
so that it is clear exactly what the equilibrium condition involving the retailers is. In line
with earlier assumptions, the right-hand side of the above expression intersects the left-hand
side from above as the industry-wide wholesale price w changes. Hence the question will be
how the relevant expression changes when price-parity is in effect. When it is in effect, the
only change is that p1 and p1 are replaced with pM1 . Thus, if the right-hand side is larger
under price-parity, the equilibrium wholesale price is higher, meaning that (because p∗(w) is
increasing) the equilibrium retail price is higher. Because the denominator in the right-hand
side is negative (ensuring a non-negative margin for retailers), the right-hand side is larger
under price-parity if
(Q1 +Q3)p1 + (Q2 +Q4)p1 < (Q1 +Q3)pM1 + (Q2 +Q4)pM1 .
Because Q1 and Q3 are negative (because they measure the effect on retailer A’s sales when
both suppliers raise the price they set through retailer A), and because Q2 and Q4 are
positive (because they measure the effect on retailer A’s sales when both suppliers raise the
price they set through retailer B), a sufficient condition is that (i) p1 > pM1 > p1, and (ii)
44
pM1 > 0. As before, however, pM1 > 0, and so it is sufficient that p1 > p1.43 This is true by
assumption, and so the equilibrium wholesale price is indeed higher under price-parity.
Now I consider the effect of price-parity under revenue-sharing, in particular considering the
effect on retailers. Consider the profits of retailer A, given by
[(1− r)p− cD][Q(p1, p1, p2, p2) +Q(p2, p2, p1, p1)].
At an outcome where both suppliers charge p through retailer A and p through retailer B,
these profits are proportional to [(1 − r)p − cD]Q(p, p, p, p). Consider this retailer choosing
r, where I will write the first-order condition suppressing the arguments of Q for notational
compactness. Because I am interested in a symmetric outcome (r = r and all prices given
by p), the first-order condition at equilibrium can be written as
[−p+ (1− r)p1]Q+ [(1− r)p− cD] [(Q1 +Q3)p1 + (Q2 +Q4)p1] = 0.
Thus, at a symmetric industry solution where both retailers set revenue shares equal to r,
the margin (1 − r)p − cD of this retailer satisfies the following, where to be as explicit as
possible I write all of the arguments of the functions out fully.
(1− r)p∗(r)− cD =[p∗(r)− (1− r)p1(r, r)]Q(p∗(r))
[Q1(p∗(r)) +Q3(p∗(r))] p1(r, r) + [Q2(p∗(r)) +Q4(p∗(r))] p1(r, r). (31)
Under price-parity, the same expression emerges but with pM1 taking the place of p1 and
p1. For the equilibrium retail price to be higher under price-parity, it is enough that the
equilibrium revenue-sharing term r is smaller under price-parity. The analogue of earlier
assumptions is that the right-hand side of Equation (31) intersects the left-hand side once
and from above as 1 − r increases (or equivalently, as p increases, letting the industry-
level r equilibrate according to the function p∗(r)). Hence, and now suppressing unneeded
notational elements for simplicity, price-parity lowers the industry revenue-share term r if
p− (1− r)pM1(Q1 +Q3)pM1 + (Q2 +Q4)pM1
>p− (1− r)p1
(Q1 +Q3)p1 + (Q2 +Q4)p1
. (32)
Earlier in this proof I showed that (under revenue-sharing) pM1 < 0 and also that pM1 =
(p1 + p1)/2. Furthermore, by the conditions of the proposition, p1 < p1. Thus, it follows
that p1 < pM1 < p1 and also p1 < pM1 < 0. Because Q1 + Q3 < −(Q2 + Q4) < 0, these
observations imply that the denominators are positive. In turn, because the overall margins
must be positive, the numerators must be positive.
43To be precise, pM1 > 0 so long as the condition
p− (w + w)
2− cU = − Q(p)
Q1(p) +Q2(p)
is such that the right-hand side intersects the left-hand side from above. This just says industry profits are quasi-concave in the industry price p.
45
To prove that the inequality in (32) holds, I will first argue that
p
(Q1 +Q3)pM1 + (Q2 +Q4)pM1>
p
(Q1 +Q3)p1 + (Q2 +Q4)p1
.
This holds so long as
(Q1 +Q3)p1 + (Q2 +Q4)p1 > (Q1 +Q3)pM1 + (Q2 +Q4)pM1 .
Because (Q1 + Q3) < 0, and because p1 < pM1 < 0, it is sufficient that (Q2 + Q4)p1 >
(Q2 +Q4)pM1 . Because (Q2 +Q4) > 0, in turn it is sufficient that p1 > pM1 , which I already
showed. Hence, to prove that Equation (32) holds, it is enough to show that (after eliminating
the common term −(1− r) < 0)
pM1(Q1 +Q3)pM1 + (Q2 +Q4)pM1
<p1
(Q1 +Q3)p1 + (Q2 +Q4)p1
.
Multiplying through and simplifying, and again using the facts that (i) the denominators
are positive, (ii) Q2 + Q4 > 0, and (iii) pM1 < 0, this reduces to the condition that p1 < p1,
which is true by the conditions of the proposition. �
References
Adachi, T. and T. Ebina (2014). Double marginalization and cost pass-through: Weyl-
fabinger and cowan meet spengler and bresnahan-reiss. Economics Letters 122 (2), 170–
175.
Amir, R., I. Maret, and M. Troege (2004). On taxation pass-through for a monopoly firm.
Annales d’conomie et de Statistique (75/76), 155–172.
Anderson, S. P., A. De Palma, and B. Kreider (2001). The efficiency of indirect taxes under
imperfect competition. Journal of Public Economics 81 (2), 231–251.
Bagnoli, M. and T. Bergstrom (2005). Log-concave probability and its applications. Eco-
nomic Theory 26 (2), 445–469.
Baker, J. B. (1996). Vertical restraints with horizontal consequences: Competitive effects of
“most-favored-customer” clauses. Antitrust Law Journal 64, 517–34.
Bishop, R. L. (1968). The effects of specific and ad valorem taxes. Quarterly Journal of
Economics 82 (2), 198–218.
Boik, A. and K. S. Corts (2016). The effects of platform most-favored-nation clauses on
competition and entry. Journal of Law and Economics 59 (1), 105–134.
Bonanno, G. and J. Vickers (1988). Vertical separation. Journal of Industrial Econom-
ics 36 (3), 257–265.
Cachon, G. P. and M. A. Lariviere (2005). Supply chain coordination with revenue-sharing
contracts: Strengths and limitations. Management Science 51 (1), 30–44.
46
Dana, J. D. and K. E. Spier (2001). Revenue sharing and vertical control in the video rental
industry. Journal of Industrial Economics 49 (3), 223–245.
DeGraba, P. and A. Postlewaite (1992). Exclusivity clauses and best price policies in input
markets. Journal of Economics and Management Strategy 1 (3), 421–454.
Edelman, B. G. and J. Wright (2015). Price coherence and excessive intermediation. Quar-
terly Journal of Economics 130, 1283–1328.
Foros, Ø., H. J. Kind, and G. Shaffer (2015). Apple’s agency model and the role of resale
price maintenance. Working Paper .
Gans, J. (2012). Mobile application pricing. Information Economics and Policy 24, 52–59.
Gaudin, G. and A. White (2014a). On the antitrust economics of the electronic books
industry. Working Paper .
Gaudin, G. and A. White (2014b). Unit vs. ad valorem taxes under revenue maximization.
Working Paper .
Hagiu, A. and J. Wright (2015). Marketplace or reseller? Management Science 61 (1),
184–203.
Johansen, B. O. and T. Verge (2016, September). Platform price parity clauses with direct
sales. Working Paper .
Johnson, J. P. (2013). The agency and wholesale models in electronic content markets.
Working Paper .
Kourandi, F. and N. Vettas (2012). Endogenous spatial differentiation with vertical con-
tracting. Working Paper .
Krishnan, H., R. Kapuscinski, and D. A. Butz (2004). Coordinating contracts for decentral-
ized supply chains with retailer promotional effort. Management Science 50 (1), 48–63.
Lin, Y. J. (1990). The dampening-of-competition effect of exclusive dealing. Journal of
Industrial Economics 39 (2), 209–223.
Llobet, G. and J. Padilla (2014). The optimal scope of the royalty base in patent licensing.
Working Paper .
Loertscher, S. and A. Niedermayer (2015, August). Percentage fees in thin markets: An
optimal pricing perspective. Working Paper .
Mathewson, G. F. and R. A. Winter (1985). The economics of franchise contracts. Journal
of Law and Economics 28 (3), 503–526.
McAfee, R. P. and M. Schwartz (1994). Opportunism in multilateral vertical contracting:
Nondiscrimination, exclusivity, and uniformity. American Economic Review 84 (1), 210–
230.
Reisinger, M. and M. Schnitzer (2012). Successive oligopolies with differentiated firms and
endogeneous entry. Journal of Industrial Economics 60 (4), 537–577.
Rey, P. and J. Stiglitz (1995). The role of exclusive territories in producers’ competition.
RAND Journal of Economics 26 (3), 431–451.
47
Salinger, M. A. (1988). Vertical mergers and market foreclosure. Quarterly Journal of
Economics 103 (2), 345–356.
Skeath, S. E. and G. A. Trandel (1994). A pareto comparison of ad valorem and unit taxes.
Journal of Public Economics 53, 53–71.
Suits, D. B. and R. A. Musgrave (1953). Ad valorem and unit taxes compared. Quarterly
Journal of Economics 67 (4), 598–604.
Wang, Z. and J. Wright (2017, forthcoming). Ad-valorem platform fees, indirect taxes and
efficient price discrimination. RAND Journal of Economics .
Weyl, E. G. and M. Fabinger (2013). Pass-through as an economic tool: principles of
incidence under imperfect competition. Journal of Political Economy 121 (3), 528–583.