The Priest-Klein Hypotheses: Proofs, Generality, and

Preliminary Draft

1

The Priest-Klein Hypotheses: Proofs, Generality, and Extensions

Yoon-Ho Alex Lee and Daniel Klerman*

Draft: July 14, 2014

ABSTRACT

Priest and Klein’s 1984 article, “The Selection of Disputes for Litigation,” famously

hypothesized a “tendency toward 50 percent plaintiff victories” among litigated cases.

Despite the passage of thirty years and the article’s enduring influence, their hypothesis

has never been formally proven, and doubts remain about its validity and the

assumptions needed to sustain it. This article makes four contributions. First, it

distinguishes six hypotheses plausibly attributable to Priest and Klein. Second, it

rigorously proves five of these hypotheses, shows that two of them are valid with

greater generality than Priest and Klein assumed, and shows that one of them is false.

Third, it shows that some of the hypotheses remain valid even when an incentive-

compatible bargaining mechanism is employed. And finally, it shows that some of the

hypotheses remain valid when parties use Bayes’ rule and information about the

distribution of all disputes to refine their estimates of the plaintiff’s probability of

prevailing.

* Yoon-Ho Alex Lee is Assistant Professor of Law, USC Gould School of Law.

[email protected]. Daniel Klerman is Charles L. and Ramona I. Hilliard Professor of Law and History,

USC Gould School of Law. Charles L. and Ramona I Hilliard Professor of Law and History, USC Gould

School of Law. www.klerman.com. [email protected]. The authors thank Ken Alexander, Andrew

Daughety, Dan Erman, Jonah Gelbach, Bart Kosko, Jennifer Reinganum, as well as the participants at the

Twenty-Fourth Annual Meeting of the American Law and Economics Association for their helpful

comments and suggestions.

mailto:[email protected]

http://www.klerman.com/

mailto:[email protected]

Preliminary Draft

2

INTRODUCTION

Priest and Klein’s 1984 article about the selection of disputes for litigation famously

hypothesized there will be a “tendency toward 50 percent plaintiff victories” among litigated

cases (p.20). Their article has been one of the most influential law review articles of all time,

and its influence is growing as empirical work on law becomes more prominent. Compare

Shapiro & Pearse (2012) to Shapiro (1996). Even with the introduction of more rigorous models

of settlement—notably game-theoretic models involving asymmetric information—Priest and

Klein’s article continues to be cited by sophisticated empiricists and in the most respected peer-

reviewed journals. See, e.g., Hubbard (2013); Gelbach (2012); Atkinson (2009); Bernardo,

Talley & Welch (2000); Waldfogel (1995); Siegelman & Donohue (1995).

Despite the passage of nearly thirty years since the publication of Priest and Klein’s

original paper, their results have never been rigorously proved, and doubts remain about the

assumptions needed to sustain their conclusions. Because Priest and Klein supported their

argument only with simulations and an informal, graphical proof, the precise statement and

scope of their claims have not been entirely clear. Subsequently, Waldfogel (1995) formalized

Priest and Klein’s model, following carefully Priest and Klein’s original set-up and notation but

without formally proving any results. Shavell (1996 p. 499, nn. 19-20) provided a sketch of a

more rigorous proof. Part of the challenge is that the formalization involves a double integral

over a region of integration that is only implicitly defined. (Waldfogel (1995), p.237). Although

Hylton and Lin (2012) also formalize and prove some of Priest and Klein’s claims, they do so

using a model substantially different and in many way’s less plausible and less general than

Priest and Klein’s.1

In this paper, we provide the first rigorous set of proofs of various hypotheses that can be

attributed to Priest and Klein, while remaining faithful to Priest and Klein’s original set-up as

well as generalizing and extending it. Through our proofs, we establish both exact mathematical

statements of Priest and Klein’s hypothesis and the precise extent to which their statements are

true.

Specifically, we think it helpful to distinguish between six different hypotheses plausibly

attributable to the Priest and Klein (1984):

Trial Selection Hypothesis. This hypothesis states that “the disputes selected for

litigation (as opposed to settlement) will constitute neither a random nor a representative

sample of the set of all disputes” (p.4). This proposition is probably the most important

contribution of their article, and it is universally accepted. It is true even under more

recent asymmetric information models of settlement. See Klerman & Lee (2014).

Fifty-Percent Limit Hypothesis. This hypothesis states that “as the parties’ error

diminishes” there will be a “convergence towards 50 percent plaintiff victories” (pp.18).

This hypothesis is often called the Priest-Klein hypothesis. Priest and Klein acknowledge

1 Hylton and Lin’s model differs from Priest and Klein’s in that Priest and Klein assume that case

strength is measured by a real number, , where plaintiff prevails if , where

represents the decision standard or legal rule. In contrast, Hylton and Lin assume that case strength is a

probability. Although this difference might seem small, it has major modeling implications. In addition,

Hylton and Lin make use of an extrinsic “censoring function” -- the function determining which cases

will be litigated – to arrive at their result.

Preliminary Draft

3

that “the fifty percent success rate will actually be achieved only near the limit” (p.22).

In this paper we provide the first formal proof that the Fifty-Percent Limit Hypothesis is

valid, both under the assumptions in Priest and Klein’s1984 article and under a broader

set of conditions and models.

Asymmetric Stakes Hypothesis. This hypothesis states that, if defendant would lose

more from an adverse judgment than plaintiff would gain, then plaintiff will win less than

fifty percent of the litigated cases; conversely, if plaintiff has more to gain, then plaintiff

will win more than fifty-percent (see pp.24-26). This hypothesis is most plausibly a

statement about the limit percentage of plaintiff victories as the parties become

increasingly accurate in predicting trial outcomes. In this paper, we show that, under the

divergent expectations model used by Priest and Klein, the asymmetric stakes hypothesis

is generally valid. But we also find that even the hypothesis that plaintiffs win less often

when defendants have more to lose is not generally true under an incentive-compatible

mechanism.

Irrelevance of the Dispute Distribution Hypothesis. This hypothesis claims that as the

parties become increasingly accurate in predicting trial outcomes, the limit value of the

plaintiff trial win rate will be “unrelated to the position of the decision standard or to the

shape of the distribution of disputes” (pp. 19 & 22). This hypothesis is closely related to

the Fifty-Percent Limit Hypothesis, but more fundamental. It is also more general,

because it holds even when the stakes are unequal.

No Inferences Hypothesis. This hypothesis states that because selection effects are so

strong, no inferences can be made about the law or legal decisionmakers from the

percentage of plaintiff trial victories, because “the proportion of observed plaintiff

victories will tend to remain constant over time regardless of changes in the underlying

standards applied.” (p. 31). As a result, if one is looking at plaintiff trial win rates,

differences between legal standards or legal decisionmakers “will always be concealed by

the parties’ selection of disputes for litigation.” (p. 49). If the stakes are equal, plaintiff

will prevail fifty-percent of the time (or close to that percentage), regardless of the law,

legal institutions, or the biases of judges or juries. To the extent that there are deviations

from the fifty-percent prediction, they will be the result of asymmetric stakes or factors

other than the legal standard or characteristics of the judge or jury. In a companion article,

Klerman & Lee (2014), we show that the No Inferences Hypothesis is generally false.

Under the divergent expectations model used by Priest and Klein and under more recent

asymmetric information models, changes in the legal standard and differences in legal

decisionmakers may result in predictable changes in the percentage of plaintiff trial

victories.

Fifty-Percent Bias Hypothesis. This hypothesis states that regardless of the legal

standard, the plaintiff trial win rate will exhibit “a strong bias toward . . . fifty percent” as

compared to the percentage of cases plaintiff would have won if all cases went to trial (pp.

5 & 23). In this paper we show that the Fifty-Percent Bias Hypothesis is true less often

than the Fifty-Percent Limit Hypothesis. More specifically, unlike the Fifty-Percent

Limit Hypothesis, the validity of the Fifty-Percent Bias Hypothesis will depend on the

Preliminary Draft

4

symmetry and the log-concavity of the distribution of disputes. So the Fifty-Percent Bias

Hypothesis will be true for normal distributions, but not for many other distributions.

Of course, analytical proofs (and disproofs) of these hypotheses, by themselves, do not imply

their empirical validity. Nonetheless, analytical proofs can guide empirical research by

elucidating under what conditions the hypotheses should be valid.

In addition to proving these hypotheses using Priest and Klein’s original model, we

consider two possible extensions of Priest and Klein’s model. First, the model has been

criticized for lacking an incentive-compatible mechanism. Priest and Klein implicitly assumes

that as long as the plaintiff’s subjective expected net trial recovery is greater than the defendant’s

subjective expected net loss, the parties will settle. Nevertheless, much has been written about ex

post inefficiency arising in strategic bargaining. See Myerson and Satterthwaite (1985); but see

McAfee and Reny (1992). Our paper addresses the possibility of bargaining inefficiency by

coupling Priest and Klein’s model to an incentive-compatible mechanism. Our approach is

similar to Friedman and Wittman (2007) in that we employ the Chatterjee-Samuelson

mechanism, but different in that our model retains Priest and Klein’s original set-up. By

remaining faithful to Priest and Klein’s original model, we can identify the extent to which their

hypotheses are robust to an incentive-compatible mechanism. Under this set-up, we identify one

class of plausible Nash equilibria and show that several of Priest and Klein’s hypotheses remain

valid for this class, including the Trial Selection, Fifty-Percent Limit, and Irrelevance of Dispute

Distribution Hypotheses. Somewhat surprisingly, the Fifty-Percent Limit result appears to hold

for some cases with asymmetric stakes, which strengthens the Fifty-Percent Limit Hypothesis.

Second, the original version of Priest and Klein’s model assumes (implicitly) that parties

estimate the plaintiff’s probability of prevailing without taking into account information about

the underlying distribution of all disputes. More precisely, Priest and Klein assume that the

defendant’s degree of fault can be represented by a real number, , and plaintiff and defendant

estimate the defendant’s fault with error. So plaintiff’s point estimate is , where

has mean zero and standard deviation . Similarly, defendant’s point estimate is ,

where also has mean zero and standard deviation , and and are independent. Parties

then estimate the plaintiff’s probability of prevailing by calculating the likelihood that true is

greater than under the assumption that the true is distributed with mean or and

standard deviation . This means that the parties are not using Bayes’ rule and the information

they may have about the distribution of to estimate the likelihood of plaintiff victory. Later

scholars, including Waldfogel (1995), have similarly assumed that the parties are naïve and non-

Baysean in this regard. Priest and Klein’s model can, however, be modified to include the

assumption that parties are sophisticated Bayseans who take into account the underlying

distribution of disputes. This modification, while plausible, has its own weaknesses, and we

think it is potentially of value only for limiting behaviors. We show that several of Priest and

Klein’s hypotheses remain valid if the parties use Bayes’ rule, including the Trial Selection,

Fifty-Percent Limit, Asymmetric Stakes, and Irrelevance of Dispute Distribution Hypotheses.

The rest of the paper proceeds as follows. Part I explores a formalized version of Priest

and Klein’s model. Part I is faithful to the original model in that, although we generalize or relax

certain assumptions, the set-up is entirely consistent with Priest and Klein’s article. Parts II

explore the validity of some of the hypotheses when parties use the Chatterjee-Samuelson

mechanism. Like Part I, Part II assumes the parties do not use Bayes’ rule and information about

the distribution of disputes in calculating the plaintiff’s probability of prevailing. Finally, Part III

Preliminary Draft

5

explores how things change when parties do use Bayes’ rule in calculating the plaintiff’s

probability of prevailing. Part III does so both under the assumption that parties settle whenever

plaintiff’s subjective expected litigation payoff is lower than defendant’s subjective expected

litigation and under the Chatterjee-Samuelson mechanism.

I. FORMALIZATION OF PRIEST-KLEIN’S ORIGINAL MODEL

This Part assumes familiarity with Priest and Klein (1984) and follows Waldfogel’s

(1995) formalization. Though there have been other attempts to formalize Priest and Klein’s

model (see Wittman (1985) and Hylton and Lin (2012)), Waldfogel offers the formalization that

is most faithful to the model in Priest and Klein’s original paper. See Hylton & Lin (2012), n.5.

Priest and Klein (1984) represent the merits of a case by a real number , and the

decision standard as , where defendant prevails if , and plaintiff prevails if .

For example, in a negligence case, might be the efficient level of precaution expenditures

minus defendant’s actual precaution. Under these assumptions, Y* would be zero. Priest and

Klein, for simulation purposes, assume is distributed according to a standard normal

distribution. We do not impose that restriction but instead assume is distributed according to

a probability density function that is bounded above everywhere, and locally continuous

and nonzero at . Note that if all disputes were to be litigated, the overall plaintiff win rate

would be ∫

If cases go to trial, courts observe the true and give verdicts based on its location as

compared to . Plaintiff and defendant themselves make unbiased estimates of ,

and , respectively, where and are have mean 0, standard deviations and

, respectively, and are distributed according to the joint probability distribution ( )

Priest and Klein assume that and are independent and that ( ) is bivariate

normal with We do not impose these restrictions. Instead we assume that and

are distributed with mean zero, not necessarily independent, and distributed with standard

deviations and , respectively, according to a joint probability density function ( )

such that ( ). In other words,

( ) belongs to a family

of mean-zero probability density functions that vary parametrically with and and can be

standardized with proper scaling. The reason why we make this assumption is that our analysis

requires taking the limit as and approach zero, and therefore, there needs to be a well-

defined family of distributions as the standard deviation parameter varies. A bivariate normal

distribution with mean zero certainly satisfies this condition, but a host of other distributions also

satisfy this condition.2 We assume the support of

( ) is the entire , but our proof

can easily be generalized to cover distributions with finite support.

2 A partial list of probability distributions that satisfy this condition include bivariate distributions

composed of: uniform distributions, triangular distributions, and raised cosine distributions for finite

support; and generalized normal distributions, hyperbolic secant distributions, Laplace distributions,

logistic distributions (if we scale by

√ ), Pearson Type IV distributions. Given any univariate probability

density function with mean zero and standard deviation 1, one can always construct such a family of

bivariate density function by letting ( )

(

) (

)

.

Preliminary Draft

6

In order to estimate the probability with which the plaintiff will prevail, both the plaintiff

and the defendant need to take into account the fact that their estimates of case merit,

and , are not wholly accurate. Therefore, they must estimate both the mean and standard

deviation of their estimates of . Following Priest and Klein (1986) and Waldfogel (1995), in

this section we assume that plaintiff estimates the mean of sampling distribution of to be

and the standard deviation to be .3 As a result, plaintiff’s subjective estimate of the

probability it will prevail, ( | ), will simply be [

], where is

the univariate standardized cumulative distributions of . Similarly, defendant estimates the

mean of to be , and the standard deviation to be . So defendant estimates the

probability that the plaintiff prevails to be [

], where and is the univariate

standardized cumulative distributions of Priest and Klein assume that the parties litigate if , where

is the damages that defendant pays plaintiff if the case is litigated and plaintiff prevails,

and are litigation costs for plaintiff and defendant respectively, and and are settlement

costs for plaintiff and defendant respectively. This condition for litigation makes sense, because

settlement can only happen if both parties perceive the payoffs to settlement to be higher than the

payoffs to litigation. The settlement condition can be rewritten as , where

and . is known as the Landes-Posner-Gould

condition for litigation, after the three scholars who formulated it. Priest and Klein simulate their

results with

. We assume .

Priest and Klein are silent about how the parties bargain to arrive at a settlement.

Technically, the Landis-Posner-Gould condition is merely a sufficient condition for litigation,

but not a necessary one. Litigation might happen even if the condition is violated, because

parties might not be able to agree on the settlement amount, even if there is a range of settlement

amounts that would be in their perceived mutual interest. That is, as modern mechanism design

research as shown, bargaining is frequently inefficient. See Myerson and Satterthwaite (1983);

but see McAfee and Reny (1992). Nevertheless, Priest and Klein (1984) and others using the

divergent expectations model have assumed that the Landes-Posner-Gould condition is necessary

as well as sufficient for litigation, and, in this Part, we proceed with that assumption.4

Priest and Klein allow for the possibility that parties may have asymmetric stakes and

suggest there will be a deviation from fifty-percent in such cases. For example, the defendant

may be more concerned about its reputation or an adverse precedent, so it may lose more from an

adverse judgment than the plaintiff gains from prevailing. As Priest and Klein point out,

asymmetric stakes can be formalized by assuming that, the plaintiff would win if it prevailed

and the defendant would lose if the plaintiff won. Under these assumptions, parties litigate if

if . If we assume , the trial condition becomes

.5 If is greater than 1, then plaintiff faces a greater stake in the litigation than the

defendant, and vice versa. It is often easier to think of cases in which , as in cases in

3 See Part III for an alternative way of estimating the mean and standard deviation of

4 We relax that assumption in Part II.

5 As Waldfogel notes, this condition assumes that parties have symmetric trial and settlement

costs even as they face asymmetric stakes. See Waldfogel (1995), p.236.

Preliminary Draft

7

which defendants has a lot more to lose—such as reputation and future loss of profits—from

litigation than plaintiffs. Nevertheless, in this paper, we place no theoretical limitation on

other than that it is positive. From the trial condition, since it follows that if

plaintiff will not bring the case, because it has no credible threat to take the case to trial.

Without a credible threat to go to trial, defendant has no incentive to settle, so plaintiff derives no

benefit from suing. Thus, we assume throughout

In addition, since plaintiff and

defendant make their decision to go to trial based on their expectation of judgment amount, the

analysis with respect can readily translate to cases where plaintiff and defendant disagree

about the amount of damages.

A. TRIAL SELECTION HYPOTHESIS AND IRRELEVANCE OF THE DISPUTE DISTRIBUTION

HYPOTHESIS

Let be the probability that a case goes to trial when the decision

standard is , where parties predict their case merits with errors and that are distributed

with mean zero and standard deviations and . When , we will simply denote

this probability as .

can be written as the probability that

[

] [

]

6

The fraction of cases litigated is ∫

. This value clearly approaches

zero as and approach zero. Recall that if all disputes were litigated, the overall plaintiff

win rate would be ∫

The plaintiff trial win rate is

∫

∫

Since this is different from ∫

, we can readily see that the set of litigated cases is not

simply a random set of all disputes. The Trial Selection Hypothesis is therefore clearly correct.

Note that , as written, depends on many parameters, including the shape of

the distribution of disputes. In the limit as and approach zero, however, the plaintiff trial

win rate will not vary with the distribution of disputes. The following Proposition provides the

specific functional form for the limit of the plaintiff trial win rate and establishes the Irrelevance

of the Dispute Distribution Hypothesis. All proofs are in Appendix.

PROPOSITION 1 (IRRELEVANCE OF THE DISPUTE DISTRIBUTION HYPOTHESIS). The limit value of

the plaintiff trial win rate under the Priest-Klein model will not depend on the distribution of

disputes if and only if the plaintiff’s stake in not significantly larger than defendant’s stake.

More specifically, suppose for a fixed , is locally continuous and

nonzero around and bounded above everywhere, and and are distributed with mean

6 As discussed above, this formulation assumes that parties do not take the underlying distribution

of disputes into account.

Preliminary Draft

8

zero according to ( ) such that

( ) with full support

over . Then the plaintiff trial win rate is given by

∫

∫

where ∬

( )

and

{( ) | [

] [

] }

For

, the limit of this value as approach 0 reduces to an expression

independent of :

∫

∫

For

, the limit value

will always be 1 and thus also independent of When

, whether the limit value

will depend on and will depend on the shape of [ ] and [ ].

Before we go on, we make a few observations. First, although is well-

defined for all positive values, it is undefined at .7 This is because when , each

party knows whether it will win or lose with 100% certainty and no cases will go to trial.

Proposition 1 is therefore a statement about the limit value of a function at a point at which it is

not continuous (since undefined). Second, the result indicates that for

, the

limit value of the overall plaintiff win rate does not depend on the shape of the distribution of

disputes, , but only on the shape of the distribution of errors, , which in turn

depends on the shape of as well as the region of integration (which is affected by ).

Third, by contrast, when

, the limit value of plaintiff’s win rate will also not depend

on the distribution of disputes because it will be 1 regardless of or values and regardless of

. This is somewhat surprising and might be labeled as the “One Hundred Percent Limit

Hypothesis.” In this case, there is a threshold case estimate for the plaintiff, above

which the defendant will be unable to make an attractive settlement offer even for disputes the

defendant considers as sure-win cases (e.g., and even as the range of error becomes

really small. Thus, as goes to zero, all disputes that are intrinsically stronger cases for

plaintiffs than will always litigate, but no cases below the threshold value will get

litigated. Therefore, in the limit, plaintiffs will all cases litigated, and the limit value is 1.

The formal proof is included in the Appendix. We include only a portion of the proof

here. Since for a fixed as approaches zero, necessarily approaches

zero. We begin by normalizing the variables. For a given , let

. Then we have

7 This does not affect integrability over since the point of discontinuity has measure zero.

Preliminary Draft

9

( )

( )

(

)

Meanwhile, { | [ ] [ ]

} . Therefore, for each

∬ (

)

This normalization of the variables is useful, because before normalization, as a

function of , was a double integral of a fixed bivariate distribution over a region of integration

in the –plane that depended on both and and . After the change of variables,

, as a function of , is a double integral of a bivariate

distribution (whose center has been translated) over a region of integration in the –plane that

does not depend on depend on either or but only on . Figure 1 represents the region of

integration when . Note that in this case, the region of integration in the -plane,

call it , is bounded by a horizontal asymptote above and a vertical asymptote on the left

side. In other words, it will be contained in a translated fourth quadrant. This will generally be

true for

, although depending on the value the location of the asymptotes will vary.

The integral in the numerator of is evaluated by centering the bivariate distribution

of errors at each point along the line in the first quadrant and integrating it over the shaded

area. Two such center points are illustrated in Figure 1.

So, for all we can rewrite as Therefore,

∫

∫

∫

∫

Preliminary Draft

10

Figure 1. Litigation Set in the -Space when

For the main result when

, we are done if we can take the limits under

the integral by applying Lebesgue’s Dominated Convergence Theorem. Then

∫

∫

∫

∫

since is locally continuous at and since . Application of the Theorem,

however, requires existence of a Lebesgue-integrable function that dominates

. Lemma A1 from the Appendix shows when

such construction is possible by

taking the same integral but over a region of integration that properly contains . To

show this upper-bound function is Lesbesgue-integrable—in other words, that the integral of the

constructed function is indeed finite—we make use of bivariate Chebyshev’s Inequality. Note in

particular from Figure 1 that is calculated as follows: for each point , along

the line , the integral of centered at and integrated over .

Since is bounded above, we need only show that is dominated by a Lebesgue-

integrable function. But notice that after some threshold point in either direction,

will always be properly contained in the set which is the complement of a square box centered at

, whose length increases as increases. Therefore, , eventually, will always

be strictly smaller than the integral of centered at but integrated over the

complement of the square box with an increasing length. Because Chebyshev’s Inequality gives

us an inverse quadratic relation between the distance from the mean and the integration of any

probability distribution away from the mean by that distance, the integral of centered

Preliminary Draft

11

at over the complement of the square box must eventually decrease at least as fast as the

speed of , which must converge to a finite value. Hence, a Lebesgue-integrable dominating

function is constructed, and we can take the limits under the integral.

In contrast, when

, it is easy to see that is characterized as a region

to the left of a graph that is characterized by two vertical asymptotes (shown in Figure 2). In this

case, will properly contain a region defined by for some . To get the

limit, we employ Chebyshev’s Inequality to the complement of for the numerator, and

we can show that the numerator and the denominator must diverge to infinity for positive at the

same pace, and thus the limit will equal 1. Therefore, we have the One Hundred Percent Limit

Hypothesis in this case.

Figure 2. Litigation Set in the -Space when and

B. FIFTY-PERCENT LIMIT HYPOTHESIS

The fact that we are able to derive the functional form of the limit value is useful. This

means that the limit value will equal fifty percent only when ∫

∫

is equal to fifty

percent, and not otherwise. Therefore, the limit value of the overall plaintiff win rate will be

determined entirely by the shape of . For example, if is symmetric around

zero, the limit value will equal exactly fifty percent; and if is not symmetric around

zero, the limit value will generally not equal exactly fifty percent except by coincidence. The

rest of this section makes clear that the assumptions made by Priest and Klein satisfy this

corollary and explores some of the other parameters which would also lead to a fifty-percent win

rate at trial.

Preliminary Draft

12

LEMMA 1.1. If plaintiff’s and defendant’s stakes are symmetric and their prediction errors,

and , are distributed with a common standard deviation and according to a joint probability

density function that is symmetric around 0 and symmetric with respect to each other, then the

limit value of the overall plaintiff win rate will be fifty percent. More specifically, if

( ) ( ), which will be necessarily true when ( )is symmetric in ,

in , and with respect to and (that is, ( ) ( )), then is

symmetric around . That is,

for all .

We make a few observations. In particular, if ( ) is symmetric (as in Lemma

1.1), the parties’ stakes are symmetric and the parties are equally competent in their ability to

estimate the merit of their case, then the limit value will indeed equal fifty-percent.

COROLLARY 1.2 (FIFTY-PERCENT LIMIT HYPOTHESIS). If plaintiff’s and defendant’s prediction

errors, and , are distributed with a common standard deviation (i.e., and

according to a joint probability density function that is symmetric around 0 and symmetric with

respect to each other, and if the stakes are equal (i.e., , then the limit value of the plaintiff

trial win rate is fifty percent.

This Corollary formalizes Priest and Klein’s statement that “[t]here will always persist a

tendency toward 50 percent plaintiff victories regardless of the shape of the distribution or the

position of the standard,” (p.22), as long as we understand the phrase “tendency toward 50

percent” as a statement about the limit value. As stated, Corollary 1.2 is more general than Priest

and Klein’s original version, because Priest and Klein assume that ( ) is independent

bivariate normal and the distribution of disputes, is normal. These latter assumptions,

while reasonable, are not strictly necessary. Meanwhile, the specific functional form of the limit

value shows that under skewed distributions, the fifty percent limit hypothesis will be

systematically violated. If ( ) is skewed toward the first quadrant in the -space, the

limit value of the plaintiff’s overall win rate will be greater than fifty percent, and likewise, if it

is skewed to the third quadrant, the limit value of the plaintiff’s overall win rate will be less than

fifty percent.

C. ASYMMETRIC STAKES HYPOTHESIS AND ASYMMETRIC INFORMATION

In this Part, we discuss the case of . This can be interpreted as plaintiff and

defendant facing asymmetric stakes, or disagreeing about the amount of damages. The fact that,

in Theorem 1, there is an upper bound on the asymmetry of the stakes, is intuitive but also

significant. As gets high, plaintiff is more likely to litigate the case regardless of its merit. For

example, if then every single case will be litigated since the trial condition

will always be satisfied. In that case, the overall plaintiff trial win rate will simply be

∫

. In this extreme case, plaintiff’s win rate will depend on the shape of but

will be independent of . In other words, it will be independent of the joint probability

distribution of the errors and the region of integration. For an intermediate range of , the

overall plaintiff win rate will depend on all three factors.

Preliminary Draft

13

Nevertheless, it is somewhat striking that the upper bound on should be relatively

small—that if the plaintiff’s stake is greater than twice the defendant’s stake, the plaintiff trial

win rate depends on the shape of the distribution of disputes. As discussed in Klerman and Lee

(2014), it seems most plausible to assume

⁄ . Therefore, since Theorem 1’s requirement

is that

, this implies an upper-bound of around ⁄ for , before the shape of the

dispute distribution has an effect on the limit value.

Meanwhile, if is lower than the upper bound (but the stakes are still asymmetric), then

the region of integration will not be symmetric with respect to the line , and thus the

limit value will not generally equal to fifty percent (except by coincidence). More specifically,

within a given range of , we have the following corollary.

COROLLARY 1.3 (ASYMMETRIC STAKES HYPOTHESIS). If the plaintiff’s stake in not significantly

larger than defendant’s stake, then the overall plaintiff win rate will be increasing in the ratio of

the plaintiff’s stake to the defendant’s stake. Specifically, for

, the limit value

of plaintiff’s win rate is increases strictly in . In particular, if the limit value is exactly equal to

fifty percent when , then the limit value will be less than fifty percent if the defendant has a

greater stake than the plaintiff, and more than fifty percent if the plaintiff has a greater stake

than the defendant. When

, the limit value will always be 1.

The result is consistent with Priest and Klein’s observation that “where the stakes are

greater to defendants than to plaintiffs, relatively more defendant than plaintiff victories ought to

be observed in disputes that are litigated. The results are reversed where the stakes are greater for

the plaintiff.” (p.25). Nevertheless, when the plaintiff has more at stake than the defendant,

Priest and Klein’s statement must be qualified. The plaintiff will win more than fifty percent

when the stakes are only moderately asymmetric. If the plaintiff has much more at stake, that is,

if

, then the plaintiff win rate in the limit will be 1 and therefore, here again, the

plaintiff will indeed win more than fifty percent.

In addition, although the Priest and Klein did not discuss asymmetric information, it is

interesting to note that by assuming that the parties differ in their ability to estimate trial

outcomes, the Priest-Klein model can generate results that are consistent with modern

asymmetric information models. If plaintiff and defendant differ in their ability to predict the

merit of the case, then, even when ( ) is bivariate normal, the joint probability

function will not be symmetric in and and thus the limit value will not generally equal to

fifty percent (except by coincidence). Instead, the plaintiff’s win rate will vary predictably with

his comparative accuracy (over the defendant) of estimating the case’s merit.

COROLLARY 1.4 (ASYMMETRIC INFORMATION). The plaintiff trial win rate is (comparatively)

higher the more (comparatively) accurately plaintiff can estimate the true , and vice versa. In

other words, given

Furthermore, as approaches

infinity (i.e., plaintiff has full-knowledge), the overall plaintiff trial win rate rate will approach

one; and as approaches zero (i.e., defendant has full-knowledge), the overall plaintiff trial win

rate will approach zero.

Preliminary Draft

14

Note that this corollary is similar to results under the standard screening and signaling

models. The proportion of litigated cases won by the party with the informational advantage will

be larger than the proportion of cases it would have won if all cases were litigated. See Hylton

(1993), Klerman & Lee (2014). The prediction of Corollary 1.4, however, is more extreme, as a

result of Priest and Klein’s assumption that courts decide without error. Under the assumptions

usually made by asymmetric information models – that one knows its type, while the other

knows only the distribution of all types – Corollary 1.4 predicts that the informed party will win

100% of the time, whereas that will not generally be true under asymmetric information models.

This extreme prediction follows from the fact that, in Priest and Klein’s model, defendant type is

a real-number representation of the merits of the case, and the real line is divided into two halves,

a part where the plaintiff wins with certainty, and a part where the defendant wins with certainty.

Thus, if a party knows the defendant’s type, it can predict the outcome with certainty. In contrast,

in most asymmetric information models, party type is the probability that the plaintiff will

prevail, which ranges from zero to one and can take on all intermediate values. As a result,

under most asymmetric information models, if a party knows its type, it usually knows a

probably that is neither zero nor one.

D. FIFTY-PERCENT BIAS HYPOTHESIS

Thus far we have limited our discussions to hypotheses relating to the limit as parties

become increasingly accurate in predicting trial outcomes. Proposition 1, however, also offers

some insight into the Fifty-Percent Bias Hypothesis. That hypothesis says that the plaintiff trial

win rate will be closer to fifty percent than the percentage of cases that plaintiff would have won

if all cases had been litigated and none had settled.

Note, however, that this hypothesis is only plausible if the following two conditions are

met: first, the stakes are symmetric ( ) and both parties are equally well informed ( ;

and second, the plaintiff trial win rate if all cases were litigated isn’t itself 50%. If the first

condition is not satisfied, Proposition 1 tells us that the limit value of the plaintiff win rate will

not itself be fifty percent in general, and therefore, the Bias Hypothesis is likely to be false for

sufficiently small values of . On the other hand, if the first condition is satisfied, the plaintiff

trial win rate will converge to 50%, and thus it is reasonable to think that the plaintiff trial win

rate in the real world (where most cases settle) will be closer to its limit value than the plaintiff

trial win rate in a counterfactual world where no cases settle. The second condition is also

necessary (and plausible) because if the plaintiff win rate if no cases were settled happens to be

fifty percent, it is impossible for plaintiff trial win rates to be closer to fifty percent.

It turns out that, even if the two conditions above are satisfied, the bias hypothesis is not

generally true unless we make more restrictive assumptions about . We show that if

is symmetric (although not necessarily symmetric around ) and is logarithmically concave—

conditions which are satisfied for normal distributions and Laplace distributions—the Fifty-

Percent Bias Hypothesis is true. However, neither symmetry nor logarithmically concave

cumulative distribution by itself is sufficient. See Appendix.

PROPOSITION 2 (FIFTY-PERCENT BIAS HYPOTHESIS). If plaintiff’s and defendant’s prediction

errors, and , are distributed with a common standard deviation (i.e., and

according to a joint probability density function that is symmetric around 0 and symmetric with

respect to each other, and if the stakes are equal (i.e., , and is symmetric and

Preliminary Draft

15

logarithmically concave, then the overall plaintiff trial win rate will be closer to fifty percent

than the plaintiff win rate among all disputes. In other words, | |

|∫

| for all , and all > 0. In addition, the inequality is strict unless

∫

.

E. NO INFERENCES HYPOTHESIS

Priest and Klein asserted that “plaintiff victories will tend toward 50 percent whether the

legal standard is negligence or strict liability, whether judges or juries are hostile or sympathetic”

(p.5). One implication is that it will not generally be possible to make inferences about the legal

standard based on the observed plaintiff trial win rate. As long as parties are aware of the legal

standard and judicial preferences, their settlement behavior will take those factors into account,

and the plaintiff trial win rate will not vary. (p.36). If this hypothesis were about limit values,

then it would be true under the Priest-Klein model since the limit value under Proposition 1 is

independent of and . Nevertheless, the limiting result holds only as goes to zero. But

this result is not necessarily relevant to empirical work, because, goes to zero, the number of

litigated cases also goes to zero. Thus, whenever one is doing empirical work on litigated cases,

one is necessarily dealing with a situation in which is strictly positive. Therefore, we think the

No Inferences Hypothesis can only be understood in the context of a fixed and positive . In a

companion article, Klerman & Lee (2014), we show that despite strong selection effects,

inferences about the legal standard of liability will be possible under a fairly broad set of

assumptions and parameter values. For the sake of completeness, we include our result for

inferences under Priest and Klein’s model but refer the reader to our companion piece for

exposition as well as the proof.

PROPOSITION 3 (NO INFERENCES HYPOTHESIS). The No Inferences Hypothesis is generally false.

Instead, inferences from plaintiff trial win rates to the legal standard are possible. Specifically,

under the assumptions of the Priest-Klein model, where

and parties predict the case’s

true merit with errors distributed according to a bivariate distribution with positive standard

deviation that is the same under the two legal standards and where the distribution of disputes is

log-concave, the plaintiff trial win rate is strictly higher under the more pro-plaintiff legal

standard. In other words,

| ̅

In fact, the proof shows that a natural inference is possible as long as the region of

integration is not empty and is invariant with respect to . This will be true under the model’s

current set-up where the Landes-Posner-Gould condition does not take into consideration the

specific distribution of disputes. Therefore, as long as there is a positive probability that

[ ] [ ]

—a condition violated when

–an inferences

arepossible. ( ) is assumed to have full support, but we need not assume that

( )or that the errors are distributed according to mean zero.

II. BARGAINING UNDER THE PRIEST-KLEIN’S MODEL

Preliminary Draft

16

Priest and Klein’s model has been criticized because it does not include an incentive-

compatible bargaining mechanism. Instead, Priest and Klein assume that whenever settlement

would be in the parties’ perceived mutual interest, they will successfully bargain to a settlement.

That is, they assume that the Landes-Posner-Gould litigation condition, , is a

necessary as well as sufficient condition. Put another way, Priest and Klein implicitly assume

the existence of an ex post efficient8 bargaining mechanism through which the parties would

always be able to settle when doing so was in their perceived mutual best interest.

Nevertheless, modern research in bargaining and mechanism design has reached the

pessimistic conclusion that, when there is asymmetric information, such an efficient mechanism

may not exist. See Myerson and Satterthwaite (1983). On the other hand, Myerson and

Satterthwaite’s theorem does not apply directly to the Priest-Klein model, because the parties’

estimates are not independent and type spaces are infinite. Instead, McAfee and Reny (1992)

suggests that in such cases an incentive-compatible ex post efficient trading may indeed be

possible if we have an outside broker—a budget balancer. Nevertheless, settlement negotiations

seldom if ever employ an outside broker who may contribute his or her own money, nor has

anyone implemented any other kind of efficient mechanism for settlement.Consequently, it is

worth investigating whether the validity of the Priest-Klein hypotheses would be affected by

relaxing the assumption that the Landes-Posner-Gould litigation condition is both necessary and

sufficient, and instead assuming a more plausible (albeit inefficient) bargaining mechanism.

Following Friedman and Wittman (2007), we investigate the implications of the

Chatterjee-Samuelson mechanism. Under that mechanism, plaintiff and defendant each submit

secret offers to a neutral party or computer. If the plaintiff’s offer is greater than the defendant’s

offer, the case goes to trial. If the plaintiff’s offer is less than or equal to the defendant’s offer,

then the case settles for the average of the two offers.9 As explained by Friedman and Wittman

(2007), even though the Chatterjee-Samuelson mechanism is seldom used in actual litigation, it

can be seen as a “reduced form of a more complicated but unspecified haggling between the

plaintiff and defendant lawyers.” (p. 110). We diverge from Friedman and Wittman (2007) in

that we graft the Chaterjee-Samuelson mechanism onto the Priest-Klein model rather than

starting from a completely different model.

We begin by making a few simplifications. First, we assume settlement costs are zero,

.10

Second, we assume , in that the plaintiff and the defendant are equal

in their ability to estimate . Third, we assume and are distributed independently

according to joint probability distribution ( ) ( ) ( ) such that

and is the standard cumulative distribution.

8 The literature uses “ex post inefficiency” to refer to cases where there is a mutually beneficial

agreement but the parties fail to reach it. In the litigation-settlement model, there is always ex post

inefficiency, because the parties would have been better off settling for the judgment amount without

incurring litigation costs. Thus, the relevant inefficiency must be relative to the parties’ ex ante

evaluation of the situation. That is, there is inefficiency if a settlement would have made both parties

think they were better off, given their ex ante evaluations of the case. 9 An alternative mechanism is to consider a sequential bargaining. But such a set-up will present a

“model duality” problem. The outcome may depend on whether the opening offer is made by the plaintiff

or the defendant. See Daughety & Reinganum (1994). 10

Positive settlement costs raise a question as to how to distinguish between cases that are simply

dropped by the plaintiffs and cases for which the two parties agree to settle at zero dollars.

Preliminary Draft

17

The mechanism proceeds as follows. The plaintiff receives a signal and the

defendant receives a signal . Simultaneously, the plaintiff makes a settlement

demand, , and the defendant makes a settlement offer, . If , the parties settle at

; otherwise, parties litigate.

In this Part, we continue to assume that parties do not use Bayes’ rule to estimate the

mean and standard deviation of the true value of . That is, plaintiff estimates the mean of to

be and the standard deviation to be , and defendant estimates the mean of to

be and the standard deviation to be .11

The rest of the set-up follows Friedman and Wittman (2007) closely. We consider pure

strategies contingent on the realized signal. Thus, a plaintiff’s strategy is a measurable function

that assigns the demand ( ) [ when he observes signal .

Similarly, a defendant’s strategy is a measurable function that assigns the demand

when he observes signal . The objective of the plaintiff is to maximize

the expected net payments, conditioned on his realized signal and the defendant’s strategy

. The defendant’s object is to minimize expected net payments.

There are several important differences, however, from Friedman and Wittman (2006).

First, the support for the strategy functions is the entire real line. Therefore, we cannot work

with uniform distributions. Second, the signals and are not independent. Instead, the

plaintiff makes inferences about the distribution of based on and likewise for the defendant.

The plaintiff estimates using a compound distribution: he first figures out the conditional

distribution of given his , and then conditions the expected distribution of on his

expected conditional distribution of . The defendant does likewise.

The payoff function for the plaintiff is

( )

∫ (

)

{ | }

( | )

∫ ( ( | ) ){ | }

( | )

The first-term in the right-hand side is the expected value of settling and the second-term is the

expected value of litigating. Therefore, ( | ) and ( | ) represent the conditional

distribution of the other party’s signal based on the party’s own signal. Likewise, the payoff for

the defendant is

( ( ) )

∫ ( ( )

)

{ | }

( | )

∫ ( ( | ) ){ | ( ) }

( | )

11

See Part III for relaxation of that assumption.

Preliminary Draft

18

We define the Nash equilibrium of the litigation game as follows.

Definition. A Nash equilibrium (NE) of the litigation game is a strategy pair

( ( ) ) such that ( ) ( ) and

( ( ) ).

As Friedman and Wittman (2007) note, existence of such a Nash equilibrium is not guaranteed

because we are limiting our analysis to pure strategy Nash equilibria. Note, however, that there will

always be trivial equilibria such as ( ( ) ) in which every dispute

goes to trial and neither party has any incentive to deviate from its strategy. Because such

equilibria predict that parties never settle, which is inconsistent with the reality of litigation, we

consider only equilibria in which the parties sometimes settle.

In their analysis, Friedman and Wittman (2007) focuses on symmetric Nash equilibria.

Their definition of symmetry would translate to our set-up as follows.

Definition. The strategies ( ) and are symmetric around if there exists

some such that, for all , we have , or equivalently,

.

The equilibrium we identified in Proposition 4 is indeed symmetric around in the limit. It

turns out that all Nash equilibria that are symmetric around will satisfy the Fifty Percent Limit

hypothesis. This can be seen as follows. The region of integration is defined as follows:

{ | } This region will be

symmetric around the line if and only if we can show that whenever ,

we must also have . But if the strategies are symmetric around , we

must have and .

Therefore, we must have if and only if we

have . So we have proved the following

Lemma.

LEMMA 4.1 (SYMMETRIC NASH EQUILIBRIUM). For all families of Nash equilibria that are

symmetric in the limit, the Fifty-Percent Limit Hypothesis holds true and the Asymmetric Stakes

Hypothesis will be false.

The following proposition (proved in Appendix) shows that for all is locally

continuous and nonzero around and bounded above everywhere, there exists at least one

family of non-trivial Nash equilibria which converges to a symmetric Nash equilibrium.

Although this is a family of equilibria (as parameter and vary), the specification in fact

exhibits a single equilibrium for given and .

PROPOSITION 4 (BARGAINING UNDER THE PRIEST-KLEIN MODEL). Suppose is locally

continuous and nonzero around and bounded above everywhere, and and are

distributed with mean zero according to ( ) ( ) ( ) such that

with full support over and is continuously differentiable. Then there exists and

Preliminary Draft

19

some suitable value such that for each the following condition

holds: there exists such that for all , there will generally12

exist a pair of

continuous functions ( ) such that

and for each , the following ( ( ) ) is a Nash equilibrium:13

( ) {

{

Moreover, for all such , the limit value of the plaintiff’s trial win rate will be

fifty-percent. This means simultaneously that for this equilibrium, the Fifty-Percent Limit

Hypothesis will hold but also the Asymmetric Stakes Hypothesis will be false for such values of .

Figure 3. Litigation Set in the -Space under Bargaining

12

We say “generally” because the proof makes use of the Implicit Function Theorem and thus

will depend on a particular Jacobian not taking on the value of zero at the particular equilibrium value.

Because the particular Jacobian is not identically zero, this will generally be the case, although it may be

possible to construct an example in which the Jacobian can take on the value of zero at the particular

equilibrium point. 13

The stability of this Nash equilibrium was checked for normal distributions using Excel

simulations.

Preliminary Draft

20

Figure 3 depicts, the set of case estimates litigated in the -plane when the

parties use the Chatterjee-Samuelson mechanism as approaches zero. In the case depicted

above, is properly contained in , and the set can be

considered the region of ex post inefficiency. What is surprising about Proposition 4 is that

because a similar set can properly contain even when (imagine a very small

change in which changes the shapes of only slightly from ), the Fifty Percent

Limit Hypothesis will continue to hold even with asymmetric stakes.

We cannot, however, claim the results for nonsymmetric equilibria. We are currently in

the process of running Matlab simulations to look for other families of equilibria. For the

moment, we conclude only that under the family of equilibria we have identified and under any

symmetric equilibria, the 50% Limit Hypothesis will be true.

III. ALLOWING PARTIES TO MAKE INFERENCES BASED ON THE DISTRIBUTION OF

DISPUTES

We mentioned that the original version of Priest and Klein’s model assumes (implicitly)

that parties estimate the plaintiff’s probability of prevailing without using Bayes’ rule or

information about the underlying distribution of all disputes. Consequently, thus far, we solved

the model under the assumption that ( | ) [

] and

| [

]. This means that the parties are not taking into

consideration the actual distribution of in estimating the likelihood of the plaintiff’s victory.

It also means that they are assuming that and , the standard deviations of and , are also

the standard deviations of the parties’ estimates .

It is not hard to see why this might be a problematic assumption. For example, consider

the simple discrete case where can only take on an integer value from 1 to 10 with each

integer value equally likely. Suppose each party observes within an error that is zero, +1, or -

1, each with probability one third. In this case, if a party observes 5, he is correct to presume that

the true will be 4, 5, or 6, each with probability one third. As a result, it would be rational for

the party to assume that the mean of the sampling distribution of is 5 and that same error

distribution (0, +1, -1) that generated the party’s observations will also describe the sampling

distribution. Suppose, however, that, with the same distribution of observation errors, never

takes the values 5 or 6. That is, { }, with each value equally likely, and each

party observes within an error that is 0, +1, or -1, each with probability one third. In this

situation, if a party observes 5 and knew about the underlying distribution of , he would know

with certainty that the true is exactly 4. It would be irrational to assume that the mean of the

sampling distribution was his observation,5, and that the true value of was distributed around

that value with errors zero, +1, or -1 with equal probability. This example suggests that a

rational party would use information about the distribution of disputes both in calculating the

mean of its estimate of and in calculating its standard deviation.

There are two potential modifications to Priest and Klein’s model to include the

assumption that parties do take the underlying distribution of disputes into consideration.

Unfortunately, neither of them is entirely satisfactory.

The first method is to interpret and not as the parties’ own estimates of but

merely informative signals the parties receive about defendant’s degree of fault. The parties then

Preliminary Draft

21

use the signals to make their estimates of . This approach is in significant tension with Priest

and Klein’s original model because it means that the parties point estimates of are not and

. This also means that, contrary to Priest and Klein’s model, the parties’ estimates will not be

distributed according to the stipulated functions, and . Rather, it is the signals

that are distributed in this way.

Another interpretation of this modification may be that the parties make mistakes and for

this reason, they observe different and . Suppose, for example, that the evidence (or signal)

is a series of numbers, that inferring the true requires adding those numbers together, and that

parties occasionally type the wrong number in their calculators. Then, even if the parties see the

same evidence (the numbers to be added), their calculations differ. Rational parties would do

their calculations, but would not fully trust the results. Instead, they would combine their

calculations with their knowledge of the distribution of all disputes and their knowledge of the

probably of key stroke error to make a corrected/conditional estimate of given the error-prone

calculation they actually made.

One conceptual difficulty with this approach is that the parties’ estimates are no longer

unbiased. For example, suppose the distribution of disputes is standard normal and true case

merit is . Although the parties will receive signals that average 1.5 (because the signals

are unbiased), they will take into account the fact that the standard normal is centered at zero, so

their estimates will, on average, be somewhat less than 1.5. This assumption is at variance with

a basic assumption of the Priest-Klein model.

Nevertheless, it turns out that this modification does not make a significant difference

when it comes to results pertaining to limits. This is because the new region of integration under

the modified approach, , while no longer independent of or , will coincide the

original region of the integration in the limit. Intuitively, it makes sense that even as

we allow the parties to make inferences based on the distribution of disputes, as we take the limit

as approaches zero, only the distribution of disputes that are very close to will significantly

affect the probability, and therefore the shape of on the whole will matter little. We

summarize the results in the following proposition, proved in Appendix.

PROPOSITION 5 (EQUIVALENCE OF THE LIMIT RESULTS UNDER THE MODIFIED BAYESIAN MODEL).

The results of Proposition 1 (together with Corollaries 1.2, 1.3, and 1.4) and the limit results

within Proposition 4 go through as stated under the modified Priest-Klein model. In other words,

the following results hold true under the modified Bayesian model of the Priest-Klein model:

IRRELEVANCE OF THE DISPUTE DISTRIBUTION HYPOTHESIS. The limit value of the plaintiff

trial win rate under the Priest-Klein model will not depend on the distribution of disputes

if and only if the plaintiff’s stake in not significantly larger than defendant’s stake.

FIFTY-PERCENT LIMIT HYPOTHESIS. If plaintiff’s and defendant’s prediction errors, and

, are distributed with a common standard deviation (i.e., and according to a

joint probability density function that is symmetric around 0 and symmetric with respect

to each other, and if the stakes are equal (i.e., , then the limit value of the plaintiff

trial win rate is fifty percent.

Preliminary Draft

22

ASYMMETRIC STAKES HYPOTHESIS. If the plaintiff’s stake in not significantly larger than

defendant’s stake, then the plaintiff trial win rate will be increasing in the ratio of the

plaintiff’s stake to the defendant’s stake.

ASYMMETRIC INFORMATION. The plaintiff trial win rate is (comparatively) higher the more

(comparatively) accurately plaintiff can estimate the true , and vice versa.

BARGAINING. Suppose is locally continuous and nonzero around and bounded

above everywhere, and and are distributed with mean zero according

to ( ) ( ) ( ) such that with full support over and

is continuously differentiable. Then there exists and some suitable value

such that for each the following condition holds: there

exists such that for all , there will generally14

exist a pair of

continuous functions ( ) such that

and for each , the following ( ( ) ) is a Nash

equilibrium:

( ) {

{

Moreover, for all such , the limit value of the plaintiff’s trial win rate

will be fifty-percent. This means simultaneously that for this equilibrium, the Fifty-

Percent Limit Hypothesis will hold but also the Asymmetric Stakes Hypothesis will be

false for such values of

Meanwhile, the proofs of other results—specifically those that are not about limits--do

not readily go through and require other assumptions. These include the Fifty-Percent Bias

Hypothesis and the No Inferences Hypothesis. The reason these results may be different is that

the disputes most heavily litigated may not be distributed close to the decision standard, . For

example, in the case of normal distributions centered at 0, given a decision standard , the

disputes most heavily litigated will be centered around rather than . This is another

way in which this modification leads to results contrary to Priest and Klein’s model. A central

aspect of Priest and Klein’s model was that disputes lying most closely to the decision standard

would be most heavily litigated. Under the modification, however, litigated disputes will tend to

be farther away from the mean than the decision standard. For large values, the center of

litigated disputes will in fact have little to do with the decision standard. This means that when

the legal standard favors the plaintiff, , litigated cases may more often be those the

14

We say “generally” because the proof makes use of the Implicit Function Theorem and thus

will depend on a particular Jacobian not taking on the value of zero at the particular equilibrium value.

Because the particular Jacobian is not identically zero, this will generally be the case, although it may be

possible to construct an example in which the Jacobian can take on the value of zero at the particular

equilibrium point.

Preliminary Draft

23

defendant will win. Conversely, when the legal standard favors the defendant, , litigated

cases may more often be those the plaintiff will win. For this reason, the Fifty-Percent Bias

Hypothesis is valid and inferences from litigated cases will only be possible when is sufficiently

small.

PROPOSITION 6. (NON-LIMIT RESULTS UNDER MODIFIED BAYSEAN MODEL) When is

standard normal distribution and sufficiently small, the results of Propositions 2 and 3 go

through. In other words,

FIFTY-PERCENT BIAS HYPOTHESIS. If plaintiff’s and defendant’s prediction errors, and

, are distributed with a common standard deviation (i.e., and according to a

joint probability density function that is symmetric around 0 and symmetric with respect

to each other, and if the stakes are equal (i.e., , and is symmetric and

logarithmically concave, then the overall plaintiff trial win rate will be closer to fifty

percent than the plaintiff win rate among all disputes.

INFERENCES FROM LITIGATED CASES. Under the assumptions of the Priest-Klein model,

where

and parties predict the case’s true merit with errors distributed according

to a bivariate distribution with positive standard deviation that is the same under the two

legal standards and where the distribution of disputes is log-concave, the plaintiff trial

win rate is strictly higher under the more pro-plaintiff legal standard.

An alternative modification is to assume that parties receive signals and

but the signals are distributed in such a way that when parties take into

consideration, they recover estimates and . If such signal

distributions can be constructed, then all of the results would go through. Unfortunately, it is not

clear that such distributions can always be constructed.

IV. CONCLUSION

This paper provides an in-depth analysis of Priest and Klein’s hypotheses by revisiting

the mathematics behind their model. We conclude that the Fifty-Percent Limit Hypothesis is

mathematically well-founded and is true under the assumptions made by Priest and Klein and

under a wider array of assumptions. For example, under Priest and Klein’s original model the

Fifty-Percent Limit Hypothesis, Trial Selection Hypothesis, and Irrelevance of Dispute

Distribution Hypothesis are true (a) for any distribution of disputes that is bounded and both

positive and continuous near the decision standard, (b) even if the parties’ prediction errors are

not independent. In addition, these hypotheses remain true if the parties use the Chatterjee-

Samuelson bargaining mechanism and even if parties use Bayes’ rule to calculate the mean and

distribution of true case merit. The Asymmetric Information and Fifty-Percent Bias Hypotheses

are often true, but are less robust to parametric and modeling changes. Finally, the No

Inferences Hypothesis is generally false.

Preliminary Draft

24

APPENDIX

This Appendix contains all the proofs not already stated in the main text.

PROOF OF THEOREM 1. Based on the sketch of the proof included in the text, we begin by

showing Lemma A1.

LEMMA A1. ∫

is finite for all

and

∫

∫

15

Proof of Lemma A1 requires several steps. Our strategy in proving Lemma A1 is to construct a

Lebesgue-integrable function that dominates The function need not be

continuous. It need only integrate to a finite value. We construct by integrating (

) over an area that properly contains . Let us begin by considering Lemma A2, which

describes the shape of .

LEMMA A2.

is non-empty if and only if

.

For all

, is bounded left by a vertical asymptote at

.

For all

, is bounded above by a horizontal asymptote

.

When

, is not bounded above, and if

, it is characterized by a

region to the right of an increasing curve that has two vertical asymptotes, at

and

(

)

PROOF OF LEMMA A2. If

, is empty since [ ] [ ] for all . On the

other hand,

, for high enough and low enough , we can find some value such that [ ]

[ ]

Meanwhile, if [ ] [ ]

then [ ]

, and thus is bounded left by

. If

[ ] [ ]

Thus, is bounded above by

. Meanwhile, if

then the boundary curve [ ] [ ] must

continually increase as increases. Furthermore, if

, then the inequality must hold for all

value of (

) and all values of . Q.E.D.

Importantly, for

is properly contained by a translated fourth

quadrant in the –plane (with the origin at (

)

. This set is obviously contained

15

For

the value of depends on both the shape of [ ] and the value of .

Preliminary Draft

25

in all of . Moreover, for (

), the point (

) is at least

(

) away from

. For (

) (

) is at least (

) away from . Therefore, we have

the following two Corollaries.

COROLLARY A3. Let { (

) (

)} and

{ (

) (

)} Then for

is properly contained

in for every where

{ | |

{ || | | | |

| |

| } | |

COROLLARY A4. For

each ,

where

∬ (

)

We are now ready to prove Lemma A1.

PROOF OF LEMMA A1. For the first part, by Corollary A4, we need only show

∫

is finite.

∫

∫ ∫

∫

Meanwhile, for each

(| | | | |

| |

| ) (| | |

{ }|

|

| |

{ }| ). By bivariate Chebyshev’s Inequality,

(| | |

{ }| |

| |

{ }| )

√ ( )

(|

{ }| )

Therefore,

√ ( )

(|

{ }| )

which is quadratic in in the denominator and therefore integrates to a finite value over [ . The

integral over ] can likewise be shown to be finite. However, in this case we need not assume

since is always bounded by a vertical asymptote.

Meanwhile, for all

is defined by two vertical asymptotes. Therefore,

can be bounded below by which integrates (

) over the

complement of i.e., over a rectangle containing (

), which increases in . Therefore, again

Preliminary Draft

26

by bivariate Chebyshev’s Inequality, we can show that this lower bound integrand is bounded below by

( √ ( )

(|

{ }| )

), which clearly integrates to infinity over all . Q.E.D.

PROOF OF THEOREM 1 (CONTINUED). For

the rest of the proof is explained in

the text. For

, note that

∫

∫

∫

∫

∫

Here

∫

∫

as before, since is

bound by a left asymptote. Meanwhile, we cannot apply Lebesgue’s Dominated Convergence

Theorem to ∫

because contains all of the increasing large

rectangles. Instead, we can write

∫

∫ ( )

∫

∫

where ∬ (

)

where is the complement of

. Then we can apply Lebesgue’s Dominated Convergence Theorem to

∫

. Therefore,

∫

∫

∫

∫

∫

∫

∫

∫

∫

∫

∫

∫

Since

∫

, while all other terms are finite, the limit value

of the plaintiff win rate is 1. Q.E.D.

PROOF OF LEMMA 1.1. We show that for all ,

Notice

∬ ( )

where the region of integration is

Preliminary Draft

27

{( )| [

] [

]

}

{( )| ( [ ( )

]) ( [

])

}

{( )| [

] [

]

}

Since ( )is symmetric with respect to and , ( ) ( ). Thus we can

swap and without changing the value of the integral. The result follows Q.E.D.

PROOF OF COROLLARY 1.2. By Lemma 2, is symmetric around , which implies

is symmetric around zero. By the Theorem, the limit value must be . Q.E.D.

PROOF OF COROLLARY 1.3. The result is immediate since for

,

properly belongs to . Q.E.D.

PROOF OF COROLLARY 1.4. For

is properly contained by a translated fourth

quadrant in the –plane (with the origin at (

)

. affects the slope of the graph

along which to take the integral over . As increases, plaintiff is comparatively more accurate

in assessing the merit of the case than the defendant. But as increases, the slope of the line decreases

and the line becomes closer to in the first quadrant and farther from in the fourth

quadrant. Thus, plaintiff’s win rate increases and defendant’s win rate decreases. Furthermore, as

approaches infinity, the line will approximate the -axis and will eventually be properly

contained in for sufficiently large , and hence ∫

will be infinite. Meanwhile,

the line the line will lie wholly outside (and increasingly so) for negative . Thus,

plaintiff’s win rate will approach 1. Similarly, as approaches 0, plaintiff’s win rate will approach zero.

Q.E.D.

PROOF OF PROPOSITION 2. Since and are both symmetric. Without loss of

generality, assume is centered around 0. The inequality is clearly satisfied when is at . Since

is log-concave, (i) it is also single-peaked (including the possibility of a flat-top) and (ii) its

cumulative distribution, , is also log-concave. Without loss of generality, suppose Then we

need show | | ∫

First notice

This can be seen as

follows. over all of is greater than

over a symmetric image of |

around . The latter graph lies completely under because is symmetric and single-peaked.

And clearly, over a symmetric image of | around is ½. Thus, we need only

show ∫

This is equivalent to showing

∫

∫

∫

∫

∫

This inequality rearranges to

∫

∫

∫

∫

By changing the variable to for the right-side integral and for the left-side

integral and recognizing that is symmetric around , we have

Preliminary Draft

28

∫ (

∫

)

∫ (

∫

)

∫ (

∫

)

Since is strictly decreasing in , we need only show (

∫

) as a probability

density function defined over [ ∞ first-order stochastically dominates (

∫

). Equivalently,

we need to show ∫

∫

∫

∫

, or

( ) ( )

since for all . Since , it now suffices to show ( )

is

decreasing in . Under the Quotient Rule, this is true if

( ) ( )

which holds if

is decreasing in , or put differently, if ( )

. But this last

inequality holds since is log-concave (which must be true since is log-concave).

Meanwhile, for a general log-concave function or a single-peaked function that is not symmetric,

the inequality generally does not hold at the mean for all since does not always equal ½.

For a symmetric counter example, consider , which equals zero everywhere but takes on the value

of 1 on [

] and [

], where

This function is clearly symmetric around

. Then ∫

But in this case, plaintiff’s trial win rate is

∫

∫

∫

By Chebyshev’s Inequality, the numerator can get arbitrarily small as X increases, while the denominator

maintains a certain minimum value. Hence plaintiff’s win rate among litigated cases can get arbitrarily

small, even though the win rate over the entire sample would have been ½. Since this example has a

discontinuity at Y*, the limit hypothesis, too, will not hold. Instead, as prediction errors go to zero, the

plaintiff will lose zero percent of litigated cases. Nevertheless, it is possible to create a continuous

version of this counter-example which closely approximates it, under which the limit hypothesis is true,

but the bias hypothesis is false. The continuous version is the same as discussed above, except

takes on the value 1 on [

] and [

], where is much smaller than x-½. In

this continuous version, as the parties’ prediction errors go to zero, the proportion of cases won by the

plaintiff converges to fifty percent. Nevertheless, if prediction errors, are large compared to but

small as compared to x-½, then the proportion of plaintiff trial victories will be substantially smaller than

fifty percent, even though the proportion of plaintiff victories, if all cases went to trial, would be very

close to fifty-percent. Thus, the bias hypothesis fails, even though the limit hypothesis would hold.

Q.E.D.

Preliminary Draft

29

PROOF OF PROPOSITION 3. A version of this proof comes from Klerman & Lee (2014). Let be

the decision standard and that , where (

). Rewrite the plaintiff trial win rate is

∫

∫

where

∫ ( ) ( )

∫

is the ratio of plaintiff’s wins over defendant’s wins. It thus

suffices to show

Notice that since , we must also have

So if we make a change of variable ′ , then we can rewrite

as . So

∫

∫

By the Quotient Rule and Leibniz’s Rule,

if and only if

(∫

)(∫

)

(∫

)(∫

)

Since is log-concave, we have ( ) This in turn implies

( )

is

constantly (weakly) decreasing in . Therefore, we must have for all

and for all This indicates that the left-hand side is bounded

above by (∫

) (∫

), while the right-hand side is

bounded below by it. Moreover, the inequality must be strict since cannot be a constant function

when is a probability density function.

Finally, to show that we can apply Leibniz’s Rule over the infinite interval, we need to show

|

| | | is bounded by an integrable function. Indeed, |

| | | | | is constantly (weakly) decreasing in . If

is always positive, then | | , which must integrate to a finite value

since , as a log-concave probability density function must approach zero in as goes to infinity. If

is negative at any point, then the integral of | | will be the difference of two

integrals of (over complementary regions), each of which must integrate to a finite value.

Hence, | | is an integrable function, and the result follows. Q.E.D.

PROOF OF PROPOSITION 4. We begin by making the usual change of variables. Rewrite the

plaintiff’s and the defendant’s strategies as follows: ( ) and . Our strategy

is to look for Nash equilibrium strategies in which both parties employ a step function strategy at same

values but at different cutoff points. When the case estimate is very low ( and are both highly

negative), then both parties settle at 0. When the case estimate is very high ( and are both highly

positive), then both parties settle at some . But the plaintiff and the defendant differ in terms of the

cutoff signals, which we write as for the plaintiff and for the defendant. For this

pair of strategies to be an equilibrium for a fixed , the plaintiff must be indifferent between

demanding 0 and demanding at , and the defendant must be indifferent between offering 0

and offering at . It is easy to check that under this set-up, these two indifference conditions

will be the necessary and sufficient condition for the stated pair of strategies to be a Nash equilibrium.

Preliminary Draft

30

For a given , the defendant is indifferent at if and only if

( | ) (

) ( | )

( | )

( | ) ( | )

Rewrite this as

( | ) (

)

( | ) ( ( | ) )

and call this Condition . For the plaintiff, we have

( | ) ( | ) (

)

( | ) ( ( | ) )

( | )

Rewrite this as

( | ) (

)

( | ) ( ( | ) )

and call it Condition .

Now take the limits of Condition and Condition as goes to zero. In the limit, we must

have the following

| (

) | |

| (

) | ( | )

Now, it is clear that | | and | | (assuming we have no probability masses). Furthermore, in the limit we will also have

| | . This can be seen as follows:

( | )

∫

∫

∫

∫

∫

∫

We can take the limits inside the integral since | |

which must integrate to over all . Similarly, we have

( | )

∫ ( )

∫

∫

∫

∫

∫

∫

∫

∫

∫

Preliminary Draft

31

Therefore, in the limit we have | | . This

means that if we have ( ∫

∫

) (∫

∫

) in the limit,

then the two conditions collapse into one. This will be true if ( )

where ∫

∫

. At this point, we show a suitable value exists that the two

conditions (now one) are satisfied:

| (

) | ( | )

Suppose so that . At , this equation is greater than (

) (

) (

) ,

which is positive for reasonable values of and . As approaches infinity, the first term approaches

0 and the second term approaches , which is strictly negative. Therefore, by the Intermediate Value

Theorem, there must be at least one value of for which the equation is true. By continuity, for close

to 1, the same argument shows that such must also exist generally.

Now we show that there exist a pair of continuous functions ( ) such that

and that ( ( ) ) is a Bayesian Nash equilibrium for

each . Again, by continuity, we need only show for . We rewrite Conditions and as follows:

(∫

) (

)

(∫

)( ∫

∫

)

(∫

) (

)

(∫

)( ∫

∫

)

Then ( ) is a continuously differentiable function from to

such that and the function is even definable for negative values of . Then by the

Implicit Function Theorem,16

as long as the Jacobian matrix is invertible at , there is a small

neighborhood around 17 for which we can find a unique ( ) for each such that

( ) and . Thus, we need only check that

the Jabobian matrix is invertible at . Looking at the equation, we see that

and the determinant cannot be identically zero since (

) (

)

(

) (

). The left-hand side has terms involving mostly ’s while the right-hand

side has terms involving ’s and ’s. Hence, the proposition is proved generally.

16

We thank Ken Alexander for suggesting the use of the Implicit Function Theorem to complete

this argument. 17

Although in this model, , as standard deviation, is necessarily positive, both and

simply as mathematical functions in three variables , and , are continuous in

around and well-defined for as well.

Preliminary Draft

32

The similar logic can be applied to show that when but close to it, for a suitable value,

there exists a similar equilibrium in which they settle at 0 for and far less than 0, settle at (

) for and far greater than 0, and seek to litigate when the signals are

sufficiently closer to 0. Notice this equilibrium settlement values equal the first equilibrium when .

Because under the set-up, the two strategies do converge to a pair of points symmetric around 0, in this

case even with asymmetric stakes, the Fifty-Percent Limit hypothesis will go through. Meanwhile, since

the region of integration in the limit is { | } symmetric around the line

, even for , the limit value will be 50% even for such , and the Asymmetric Stakes

Hypothesis will be false. Q.E.D.

PROOF OF PROPOSITION 5. For a given , we employ the same change of variables by letting

. Then we have

( )

( ) (

)

Meanwhile, { |

}

where

∫

∫

. Therefore, for each

∬ (

)

Now the region of integration in the –plane will continue to depend on and . But still, we have

∫

∫

∫

∫

Notice that the same logic as in the proof of Proposition 1 (and other limit results) will go through

as long as, there is an absolute lower limit ̅ and an absolute upper limit ̅ such that, for each

(where can be assumed to be sufficiently small) and for each , is bounded

above by ̅ and bounded on the left side by ̅ In this case,

∬ (

)

{ | ̅ ̅}, which will eventually decrease at least as fast as in | | . In

this case, we can take the limit inside the integral. Then

∫

∫

∫

∫

since is locally continuous and . Meanwhile

∬ (

)

∬ (

)

Since

{ |

∫

∫

∫

∫

}

Preliminary Draft

33

and since each integrand is bounded above by

, which is clearly Lebesgue-integrable, we can

take the limits inside the integral once again and factor out . And therefore,

{ |

∫

∫

∫

∫

}

{ | [ ] [ ]

}

It thus suffices to show that there is an absolute upper limit ̅ and an absolute lower limit ̅ for

, where is sufficiently small. We need show that, for sufficiently small , as

approaches infinity, the boundary of does not go to infinity, and as approaches

negative infinity, the boundary does not go to negative infinity. But this is obvious since

∫

∫

and ∫

∫

, purely as functions defined in terms of variable ,

are continuous in at . This establishes that the results of Proposition 1 and Corollaries 1.2, 1.3,

and 1.4 will go through.

To see that the result of Proposition 4 will go through, we note that, following the logic behind

the proof of Proposition 4, the two indifference Conditions will collapse into one under the limit and in

fact, this condition in the limit will be identical to the single condition from the proof of Proposition 4.

For the existence result to go through for , we apply the Implicit Function Theorem in the

following two functions in the similar manner:

(∫

∫

)(

)

(∫

∫

)( ∫

∫

)

(∫

∫

)(

)

(∫

∫

)( ∫

∫

)

Since the Jacobian is again not identically zero, the result will hold generally. Q.E.D.

PROOF OF PROPOSITION 6. We begin by assuming . Note that ( ′ | ′ )

∫ ( ( ′ ))

∫ ( ( ′ ))

. By making the change of variable ( ′ ) , rewrite this as follows:

∫ ( ( ′ ))

∫ ( ( ′ ))

∫ (( ′ ) ) ′

∫ (( ′ ) )

Note that

Preliminary Draft

34

∫ (( ′ ) ) ′

(

)∫

(

(( ′ ) )

)

′

Meanwhile,

(( ′ ) )

(

) [ (( ′ ) )

]

(

(

√ )

)

[

( ( ′ )

)

′

]

where ′ does not depend on . Factors not dependent on appropriately cancel when we

divide by ∫ (( ′ ) )

. So we are left with a CDF of normal distribution with

standard deviation

√ centered at

( ) evaluated at . The new litigation

condition is that a dispute ′gets litigated if and only if

[( )

( )

(

√ )

] [

(

√ )

]

[

√ ] [

√ ]

Notice that since , we must have So if we make a change of variable and rewrite as .

For the Fifty-Percent Bias Hypothesis, as with Proposition 2, we need to show

∫

∫

∫

∫

∫

∫

∫

Notice that at , the inequality to be shown translates to

∫

∫

∫

∫

which was shown in Proposition 2. Since the expressions are continuous in , for sufficiently small

, the inequality must continue to hold. For the result equivalent to Proposition 3, we again let

∫ ( ) ( )

∫

and we show

for sufficiently small . So

Preliminary Draft

35

∫

∫

By the Quotient Rule, Chain Rule, and Leibniz’s Rule,

if and only if

(∫

) ( ∫

)

is less than

(∫

) ( ∫

).

Again, at , the inequality translates to

∫

∫

∫

∫

which is equivalent to what we showed in the proof of Proposition 3. By the same logic as above, for

small values of , the inequality will continue to hold. Q.E.D.

References

Atkinson, Scott. 2009. “The Economics of a Centralized Judiciary: Uniformity, Forum

Shopping, and the Federal Circuit” 52 Journal of Law and Economics 411.

Bernardo, Antonio, Eric Talley, and Ivo Welch. 2000. “A Theory of Legal Presumptions.”

15 J. Law, Economics & Organization 1-48.

Friedman, Daniel and Donald Wittman. 2007. Litigation with Symmetric Bargaining and

Two-Sided Incomplete Information. The Journal of Law, Economics, & Organization 23:98-126.

Gelbach, Jonah. 2012. “Locking the Doors to Discovery? Assessing the Effects of

Twombly and Iqbal on Access to Discovery.” 121 Yale L. J. 2270.

Hubbard, William. 2013. “Testing for Change in Procedural Standards, with Application

to Bell Atlantic v. Twombly.” 42 J. Legal. Stud. 35.

Hylton, Keith. 1993. “Asymmetric Information and the Selection of Disputes for

Litigation.” 22 Journal of Legal Studies 187-210

Hylton, Keith N. and Haizhen Lin, 2012. “A Formalization and Extension of the Priest-

Klein Hypothesis.” Unpublished. Available at SSRN: http://ssrn.com/abstract=1864728 or

http://dx.doi.org/10.2139/ssrn.1864728.

Klerman, Daniel and Yoon-Ho Alex Lee. 2014. “Inferences from Litigated Cases.”

Unpublished.

McAfee, R. Preston and Philip J. Reny. 1992. “Correlated Information and Mechanism

Design.” 60 Econometrica 395-421.

Myerson, Roger B. and Satterthwaite, Mark A. 1983. “Efficient Mechanisms for Bilateral

Trading.” 29 Journal of Economic Theory 265-81.

Preliminary Draft

36

Priest, George and Benjamin Klein. 1984. “The Selection of Disputes for Litigation,” 13

Journal of Legal Studies 1-55.

Daughety, Andrew, and Jennifer Reinganum. 1994. “Settlement Negotiations with Two-

Sided Asymmetric Information: Model Duality, Information Distribution, and Efficiency.” 14

Int’l Rev. of Law and Econ. 283-298.

Shavell, Steven. 1996. “Any Frequency of Plaintiff Victory at Trial is Possible,” 26

Journal of Legal Studies 493-501.

Shapiro, Fred and Michelle Pearse. 2012. “The Most-Cited Law Review Articles of All

Time,” 110 Mich. L. Rev. 1483 (2012)

Siegelman, Peter and John J. Donohue, III. 1995. “The Selection of Employment

Discrimination Suits for Litigation: Using Business Cycle Effects to Test the Priest-Klein

Hypothesis.” 24 J. Legal Stud. 427

Shapiro, Fred. 1996. “The Most-Cited Law Review Articles Revised.” 71 Chicago-Kent

Law Review 751.

Waldfogel, Joel. 1995. “The Selection Hypothesis and the Relationship between Trial and

Plaintiff Victory.” 103 Journal of Political Economy 229-260.

Waldfogel, Joel. 1998. “Reconciling Asymmetric Information and Divergent

Expectations Theories of Litigation.” 41 Journal of Law & Economics 451-476.

Wittman, Donald. 1985. “Is the Selection of Cases for Trial Biased?” 14 Journal of Legal

Studies 185-214.

Documents

The Priest-Klein Hypotheses: Proofs, Generality, and