1905702

Embed Size (px)

Citation preview

  • 8/10/2019 1905702

    1/33

    Multiple Regression for Systems of EquationsAuthor(s): Gerhard TintnerSource: Econometrica, Vol. 14, No. 1 (Jan., 1946), pp. 5-36Published by: The Econometric SocietyStable URL: http://www.jstor.org/stable/1905702.

    Accessed: 03/11/2014 12:53

    Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at.http://www.jstor.org/page/info/about/policies/terms.jsp

    .JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of

    content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms

    of scholarship. For more information about JSTOR, please contact [email protected].

    .

    The Econometric Societyis collaborating with JSTOR to digitize, preserve and extend access toEconometrica.

    http://www.jstor.org

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/action/showPublisher?publisherCode=econosochttp://www.jstor.org/stable/1905702?origin=JSTOR-pdfhttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/stable/1905702?origin=JSTOR-pdfhttp://www.jstor.org/action/showPublisher?publisherCode=econosoc
  • 8/10/2019 1905702

    2/33

    MULTIPLE

    REGRESSION FOR

    SYSTEMS

    OF

    EQUATIONS*t

    By

    GERHARD TINTNER

    1. THE PROBLEM

    It has

    been

    apparent that the classical

    method of

    least-squares

    fitting of

    single equations in a

    system of

    economic equations leads fre-

    quently to

    unsatisfactory results.

    (But it is

    perfectly adequate for pre-

    diction.) The reason

    for this is

    that the

    actually observed economic

    variables are really

    jointly

    determined by all the equations in the

    sys-

    tem

    and

    not, in

    general, by a single one of

    them. For

    instance, the

    prices and

    quantities actually

    established on

    a market

    are determined

    by the intersection of the demand and supply curves. This was already

    recognized early in

    connection

    with empirical work

    dealing

    with de-

    mand

    function, for

    instance, by Frisch' and E.

    Working.2

    A

    systematic

    treatment

    of

    the

    problem in terms of modern statistical ideas

    is

    due to

    Haavelmo.3 Klein,4

    Koopmans,

    Marschak,6 Wald,7 and Smith8

    made

    important

    contributions.

    *

    The author is much obliged to 0. H.

    Brownlee

    (Ames), W. G. Cochran

    (Ames), H. Hotelling (Columbia), L.

    Hurwicz (Chicago), L. R. Klein (Chicago),

    J. Marschak (Chicago), T. Koopmans (Chicago), A. Wald (Columbia), and F. V.

    Waugh (Washington) for advice and criticism.

    t

    Journal

    Paper

    No. J-1344

    of

    the

    Iowa

    Agricultural

    Experiment

    Station,

    Ames, Iowa, Project No. 40-62-730.

    1

    R. Frisch, Pitfalls in the Statistical

    Constructionof Demand and Supply Curves,

    Veroffentlichungen der Frankfurter Gesellschaft fur Konjunkturforschung. Neue

    Folge, Heft 5, Leipzig, 1933, 39 pp.

    2

    E. J. Working, What Do Statistical

    Demand Curves Show, Quarterly

    Journal of Economics, Vol. 41, February,

    1927, pp.

    212

    ff.

    I

    T. Haavelmo, The Statistical

    Implications of a System of Simultaneous

    Equations,

    ECONOMETRICA,

    Vol. 11, January, 1943, pp. 1-12; The Probability

    Approach in Economics, ibid., Vol. 12,

    Supplement, July, 1944, 118 pp.

    4L.

    R. Klein, Pitfalls in the Statistical

    Determination of the Investment

    Schedule, ECONOMETRICA, Vol. 11, July-October, 1943, pp. 246-258.

    5

    T. Koopmans, Statistical Estimation of Simultaneous Economic Relations,

    Journal of the American Statistical

    Association,

    Vol.

    40, December, 1945, pp. 448-

    466.

    6

    J. Marschak, Economic Interdependence

    and Statistical

    Analysis,

    in

    Stud-

    ies in Mathematical Economics and

    Econometrics, In Memory of Henry Schultz,

    Chicago, 1942, pp. 135-150; (with W.

    H.

    Andrews)

    Random

    Simultaneous

    Equations and the Theory of Production,

    ECONOMETRICA,

    Vol. 12, July-Octo-

    ber, 1944, pp. 143-205.

    7H.

    B. Mann and A. Wald, On the Statistical

    Treatment of Linear Stochastic

    Difference Equations, ECONOMETRICA, Vol. 11, July-October, 1943, pp.

    173-

    220.

    8

    J. H. Smith, Weighted Regressions

    in

    the

    Analysis

    of

    Economic

    Series,

    in

    Studies

    in

    Mathematical

    Economics

    and

    Econometrics,

    n

    Memory of Henry Schultz,

    pp. 151-164.

    ID

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    3/33

    6 GGERHARD TINTNER

    Many of the authors following

    Haavelmo suggested the introduction

    of stochastic or error terms

    on the right-hand side

    of the

    theoretical

    equations of

    the

    contemplated system.

    We

    propose

    here a somewhat

    different approach and will also show that both lines of attack are really

    special cases of a more

    general method of solution

    which has not yet

    yielded to statistical analysis.

    In the following we propose to deal

    with systems of economic

    equa-

    tions. But it is not unlikely

    that our methods may be

    applicable in other

    fields, especially in the case

    of certain biological problems.

    We

    will first give an economic

    theoretical justification of our

    method,

    then sketch the statistical procedures implied. After

    this we will deal

    with the troublesome question of meaningful economic relationships

    (identification) and then

    indicate tests of significance.

    Finally we will

    give

    an

    application to agricultural

    data.

    2. THEORETICAL

    JUSTIFICATION

    In

    order to clarify our ideas we will first assume

    the existence of

    a

    complete static Walrasian system:

    (1) G(zi(Z Z2,

    ..

    IZQ)

    =0,

    i

    =1,

    2,

    ,q

    q variables

    zj.

    We can imagine that

    these equations determine

    completely the q economic variables

    Zl, Z2, * *

    Zq.

    These can be

    interpreted

    as all the

    prices,

    al the

    in-

    terest rates, all quantities

    of commodities exchanged and

    produced,

    wages, etc.

    Apart from these q

    variables we have also

    Q-q parameters

    ZQ+i,

    - -

    *, ZQ.

    These

    parameters

    exert

    an

    influence

    on the

    system

    but

    are

    themselves independent of

    the variables

    zi,

    Z2,

    - -

    *,

    Z,.

    They are,

    for

    instance:

    technological

    coefficients, institutional

    data like

    income

    distribution or property

    of factors of production, periods of payment,

    noneconomic variables like the weather, etc. It will

    frequently depend

    upon

    the

    exact

    nature

    of the system considered which

    variables

    will

    fall

    into the first and which

    into the second category. One important

    consideration is the

    distinction

    between short-term

    and

    long-term sys-

    tems. Fixed capital for instance will belong to the

    second category

    in

    the

    short run

    but to

    the first in the long run.

    The equations (1) are derived by some assumption of rational be-

    havior, i.e., the striving for maximum profit or utility.

    We will disregard

    here the problem of the

    existence of economically significant

    solutions

    for

    our system which has

    been discussed by von Neumann9

    and Wald.10

    9

    J. von Neumann, Ueber

    ein 6konomisches

    Gleichgewichtssystem

    und

    eine

    Verallgemeinerung

    des

    Brouwerschen

    Fixpunktsatzes, Ergebnisse

    eines

    mathe-

    matischen

    Kolloquiums,

    Heft

    8, 1935-36, pp.

    73

    ff.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    4/33

    MULTIPLE

    REGRESSION

    FOR SYSTEMS OF EQUATIONS

    7

    Stability

    conditions will also

    be taken

    as fulfilled. It should be

    men-

    tioned

    that Leontief

    has recently attempted to

    verify

    such static

    sys-

    tems

    in

    a

    somewhat

    simplified form.

    We can also consider system (1) as representing a nonstatic Wal-

    rasian

    system of

    general equilibrium.

    Anticipations

    have then to be

    introduced.

    Such

    ideas have been treated

    extensively

    by Hicks12and

    recently by Lange.'3

    The present

    author

    has shown that the assumption

    of

    specific

    routines

    of anticipations leads

    to dynamic

    terms in the

    equations

    (1).14 This is similar

    to procedures

    adopted earlier

    by Evans'5

    and

    Roos.'6

    That

    is to say, some

    of the

    zi

    in (1)

    have now to

    be inter-

    preted

    as referring to different

    points in

    time, as

    derivatives with re-

    spect to time, integrals, etc.

    It

    seems

    to us that

    such an

    approach is similar

    to the one

    implied

    in

    certain

    mathematical

    business-cycle theories,

    especially

    those

    of

    Tin-

    bergen,'7

    Kalecki,'8 Davis,

    and the author.20

    Empirical verifications

    are available

    in the extensive

    work of

    Tinbergen2l and also

    on a much

    smaller

    scale

    in

    a

    publication

    by the author of this

    article.22

    It has already been

    pointed

    out by Pareto2' and

    emphasized

    by

    later

    writers24

    hat the labor of determining

    empirically

    the system

    (1) would

    10

    A. Wald, Uebereinige Gleichungssysteme dermathematischen Oekonomie,

    Zeitschrift ir

    Nationalokonomie,

    Vol. 7, 1936, pp.

    637-670.

    11

    W.

    Leontief,

    The Structure

    of American

    Economy, 1919-1929,

    Cambridge,

    Mass., 1941,

    181 pp.

    12 J R. Hicks,

    Value and

    Capital, Oxford,

    1939, 331

    pp.

    13

    0. Lange,

    Price Flexibility and Employment,

    Bloomington, Ind., 1944,

    114

    pp.

    1

    G. Tintner,

    A

    Contribution

    to the Nonstatic

    Theory of Production,

    in

    Studies

    in

    MathematicalEconomics

    and Econometrics, n

    Memory

    of Henry

    Schultz,

    pp.

    92-109, esp.

    pp. 106

    ff.;

    A Contribution

    to the Non-Static

    Theory

    of

    Choice, Quarterly

    Journal of Economics, Vol. 56, February, 1942, pp.

    274-306,

    esp. pp. 302 ff.

    1

    G. C. Evans,

    Mathematical

    Introduction

    to Economics,

    New York,

    1930,

    pp.

    36

    ff.

    18

    C. F. Roos,

    Dynamic

    Economics,

    Bloomington, Ind.,

    1934,

    pp.

    14

    ff.

    17

    J.

    Tinbergen,

    Annual

    Survey: Suggestions

    on Quantitative

    Business

    Cycle

    Theory,

    ECONOMETRICA,

    Vol. 3, July,

    1935, pp.

    241-308.

    18

    M. Kalecki, Essays

    in the

    Theory of Economic

    Fluctuations,

    London,

    1935;

    Studies

    in

    Economic

    Dynamics,

    New York, 1944.

    19

    H. T. Davis,

    The Theory

    of Econometrics,

    Bloomington,

    Indiana,

    1941,

    pp.

    408 ff.

    20

    G.

    Tintner,

    A

    'Simple'

    Theory of

    Business

    Fluctuations,

    ECONOMETRICA,

    Vol.

    10,

    July-October,

    1942, pp.

    317-320.

    21

    J.

    Tinbergen,

    Business

    Cycles

    in the United

    States,

    1919-32, Geneva,

    1939.

    22

    G. Tintner,

    The 'Simple'

    Theory

    of Business

    Fluctuations:

    A

    Tentative

    Verification,

    Review

    of

    Economic Statistics,

    Vol. 26,

    August, 1944, pp.

    148-157.

    23

    V. Pareto, Cours

    d'gconomiepolitique,

    Vol. 2, Lausanne,

    1897, pp.

    364 ff.

    24

    F. A. von Hayek,

    The

    Present State

    of the

    Debate

    in Collectivist

    Economic

    Planning,

    London,

    1935,

    pp. 201-243,

    esp. pp.

    207

    ff.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    5/33

    8

    GERHARD

    TINTNER

    be

    prohibitive because of the great number of

    variables actually enter-

    ing into such a system of general equilibrium. This is true

    even if

    ob-

    servations for all the z; were available which is

    certainly

    not

    the

    case.

    Hence we are forced to investigate the possibilities of simplification.

    In economic theory, such simplified models have

    very often been

    con-

    structed. Fisher's famous equation of exchange25 can

    be considered

    as

    a

    very simplified model of the complete Walrasian

    system.

    This

    as-

    sumes of course that this equation or the

    corresponding Cambridge

    equation is not a mere tautology. The equations in Keynes's

    earlier

    Treatise26

    are of a similar nature. Many other

    models

    have

    been con-

    structed, some of which are represented in

    Lundberg's book.27 Keynes's

    General Theory28can be thought of as a simplified model of the Wal-

    rasian system. It is mathematically formulated

    in the articles by

    Hicks,2 Lange,30 and Modigliani,3' to mention

    only a few. We want to

    proceed along similar lines.

    We replace a certain subset of the variables and parameters

    Z1,

    Z2,

    .

    .

    *,

    ZQ

    by one new variable

    Mh

    (h=l1,

    2,

    * * *, p). p is now

    a

    much smaller number than Q. At the same time

    we reduce the

    num-

    ber

    of equations in the system.

    The substitution of

    Mh

    for a subset of our

    original variables and

    parameters amounts to the following: We replace

    for instance

    the vari-

    ous wheat prices by one representative wheat

    price or by the average

    of all

    wheat prices. We replace the quantities of all

    the producers' goods

    in

    the system (like iron, coal, copper, etc.) by an index of the quantity

    of

    producers' goods. We replace the various short-term interest rates

    by

    one

    representative short-term rate or by the average of all

    short-

    term rates, etc. Time will probably also enter

    explicitly into our equa-

    tions.

    After

    replacing subsets of the original variables

    and parameters

    in

    (1) by the new set of variables M1, M2, * * * ,

    Mp we may still have

    variables that are not represented. Write w for a new variable which

    stands

    for those variables which lack

    representation.

    25

    I. Fisher, The

    Purchasing Power of

    Money,

    New York,

    1911.

    26

    J. M. Keynes,

    A Treatise on

    Money, New

    York,

    1930.

    27

    E.

    Lundberg,

    Studies

    in the Theory

    of

    Economic

    Expansion,

    London,

    1937.

    28

    J.

    M. Keynes, The General Theory of Employment, Interest and Money,

    London,

    1936.

    29

    J.

    R. Hicks,

    Mr.

    Keynes and the

    'Classics'; A

    Suggested

    Interpretation,

    ECONOMETRICA, Vol. 5,

    April,

    1937, pp.

    147-159.

    30

    0.

    Lange,

    The

    Rate of

    Interest

    and

    the

    Optimum

    Propensity

    to

    Consume,

    Economica, Vol. 5,

    New

    Series,

    February,

    1938, pp. 12-32.

    31

    F.

    Modigliani,

    Liquidity

    Preference and the

    Theory

    of Money,

    ECONO-

    METRICA, Vol. 12,

    January, 1944,

    pp.

    45-88. See

    also D.

    M. Fort, A

    Theory

    of

    General

    Short-Run

    Equilibrium,

    ECONOMETRICA,

    Vol. 13, October,

    1945,

    pp.

    293-310.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    6/33

    MULTIPLE

    REGRESSION FOR SYSTEMS OF EQUATIONS

    9

    Assume that

    we have

    now R equations in our simplified

    system

    (R

  • 8/10/2019 1905702

    7/33

    10

    GERHARD

    TINTNER

    The

    economic meaning of

    the

    assumption that the

    Mi

    are

    the mathe-

    matical

    expectatiQns

    of

    the

    observed variables

    Xi

    is

    the

    following:

    We

    assume that in the

    long run

    the averages(arithmetic

    means) of

    the dis-

    turbances or deviations mentioned above tend to be zero. The conse-

    quences

    are

    not

    very serious

    if this

    assumption

    should

    not be

    strictly

    fulfilled. The

    resulting

    bias will

    only influence

    the constant

    terms

    if

    we

    consider linear systems, as

    we propose

    to do later.

    The

    more

    impor-

    tant

    regression

    coefficients (which

    represent,

    for

    instance, elasticities),

    are not affected.

    Since we are

    primarily interested in

    the estimation of

    these

    coefficients we may

    assume

    that the biases will not

    impair

    the

    significance of our

    results

    considerably,

    even

    if

    they should

    exist.

    Write Xit for the actual observation of the theoretical variables Mi

    at the

    point

    in

    time

    t. Our observations extend over the

    period

    t=l,

    2,**, N.

    We

    have;

    (3)

    Xit

    -Mit +

    Yit)

    i

    =

    1,

    2, ..,

    p;

    t-=1, 2, ..,

    N.

    We

    have

    by

    definition

    Mt

    =

    EXi.

    Mit

    is the

    mathematical

    expecta-

    tion

    or systematic

    part and

    yit

    the

    random term or the

    disturbance

    (Frisch). These disturbances

    result from

    lack of

    representation, fric-

    tions,

    and

    errors

    of measurement as

    indicated

    above. The last term

    may be absent with

    some of

    the variables, especially

    the one

    represent-

    ing

    time. The

    mathematical expectations

    are not

    random variables. We

    will

    also assume that the term

    wt (t= 1, 2, *

    * * , N)

    which stands for the

    variables

    not

    represented

    in

    the

    system

    is

    random, with

    Ewt

    =0.

    It

    is

    also

    possible

    that

    sometimes some kinds

    of frictions

    may give

    rise

    to

    this

    type

    of

    stochastic

    terms.

    Now we

    finally

    assume the system (2)

    linear in

    first approximation.

    p

    (4)

    k,o

    +

    Z

    k,jM2t

    =

    w,t,

    v

    =

    1,

    2,

    .,

    R;

    t=1, 2,

    ,

    N.

    j=1

    We would

    like

    to mention

    here that our purpose

    is not

    prediction

    but

    estimation of

    the structural

    coefficients

    k,j

    themselves

    (Wald).36

    Their

    importance

    for economic policy has

    been

    stressed by

    Haavelmo.37

    Our

    system is

    theoretically a

    representation of the complete

    Walrasian

    system

    but the individual

    equations are

    not yet meaningful in an eco-

    nomic

    sense. That is to say,

    we cannot in

    general

    interpret the individ-

    ual

    (normalized

    and

    orthogonalized)

    relationships (4)

    as economically

    meaningful

    functions, e.g.,

    demand functions,

    supply

    functions, etc.

    36

    A.

    Wald, The

    Fitting of

    Straight

    Lines if

    Both

    Variables

    are

    Subject

    to

    Error,

    Annals

    of

    Mathematical

    Statistics, Vol.

    11,

    September,

    1940,

    pp. 284-300.

    37

    The

    Statistical

    Implications of a

    System of

    Simultaneous

    Equations,

    cited in note

    3.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    8/33

    MULTIPLE REGRESSION

    FOR

    SYSTEMS

    OF

    EQUATIONS

    11

    This problem, the so-called

    question of identification,

    will be taken

    up later.

    A general theory would require the treatment of the

    system (4) under

    specific assumptions regarding all the various terms entering into it,

    i.e., especially the

    wit

    in

    addition to the

    Mit

    and

    yit.

    Actually, the sta-

    tistical problem of estimation under these very general

    conditions

    is

    as yet not solved. There

    are, however, two different

    approaches avail-

    able:

    A.

    Assume the

    yit

    (disturbances)

    negligible compared with the

    wit.

    Then we get the stochastic systems studied by

    Haavelmo and his

    school.

    B. Assume the wit negligible. That is to say, assume that the chosen

    variables

    Xit

    represent the

    total economic system

    well enough so that

    the

    disturbances

    yit

    are

    actually responsible for all

    or nearly all the

    deviations. This is the approach which will be presented

    in this paper.

    We are now forced to make some additional assumptions,

    some of

    which are

    necessary

    in any case and others of which are

    made only for the

    sake

    of convenience.

    A

    complete

    mathematical

    treatment

    of the prob-

    lem of estimation under

    somewhat less stringent

    conditions is given

    elsewhere.38

    First we neglect the

    term

    w,t

    in (4) and will only try to

    estimate the

    k,

    in the system:

    p

    (5)

    ka

    +

    k,ktMit

    =

    O, v

    =

    1,2,

    ..

    *

    ,R;

    t

    =

    ly,2,

    *

    N.

    This assumption is permissible only

    if

    we feel reasonably

    certain that

    the

    most important

    variables

    pertinent

    to

    our

    system

    have

    been

    repre-

    sented and that the influence of variables and parameters z; which are

    not represented by the

    Ma

    is

    negligible.

    In

    other

    words,

    the

    equations

    (5)

    would

    give

    a

    perfect

    fit without

    any

    deviations

    were

    it

    not for

    the

    disturbances discussed

    above.

    The linear form of the equations,

    which has been assumed

    as

    a

    first

    approximation, may

    also create

    difficulties. Sometimes

    we

    may

    assume

    our

    system

    as linear

    in

    the

    logarithms, especially

    in

    the case of

    produc-

    tion

    functions (Douglas).3

    Certainly,

    the

    assumption

    of

    linearity

    will

    be justified in a small region around the equilibrium. More compli-

    cated

    systems, would

    include,

    for

    instance, squares, cubes, etc.,

    or

    or-

    thogonal polynomials

    as variables

    in

    (5).

    Some of the

    assumptions given

    38

    G.

    Tintner, A

    Note

    on Rank,

    Multicollinearity

    and

    Multiple

    Regression,

    Annals of Mathematical

    Statistics,

    Vol.

    16, September,

    1945,

    pp.

    304-308.

    39

    P. H. Douglas,

    Theory

    of Wages, New York,

    1934, pp.

    113

    ff.,

    and subsequent

    publications.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    9/33

    12

    GERHARD

    TINTNER

    below would

    have

    to be

    slightly

    modified

    in

    consequence,

    especially the

    one referring to

    independence of the

    disturbances.

    It is clear that the

    yit

    (disturbances)

    are

    statistically

    independent of

    the Mit (systematic parts) and we assume that they are normally dis-

    tributed.

    The second assumption is

    made chiefly for the

    sake of

    con-

    venience. If, however, the

    disturbances discussed above are small

    and

    very

    numerous,

    and some additional conditions regarding their

    higher

    moments are

    fulfilled, then normality

    in large samples follows from

    the

    Laplace-Liapounoff theorem.40

    Investigations into price

    dispersions, for

    instance,

    indicate that actually the deviations from normality

    may not

    be

    very great

    if we replace, e.g., a set of prices by a

    representative

    or

    average price

    (Mills).41

    This may not be true with other data, for in-

    stance

    incomes where skewness is very

    extensive.42 It may be necessary

    in

    such cases to include specifically a

    measure of

    skewness,

    e.g.,

    Pareto's

    a

    as one of

    the variables. As far as deviations caused by errors

    of meas-

    urement are

    concerned, there is no

    reason to assume that they are not

    normally

    distributed. The same seems

    to be true of those disturbances

    which go back to frictional causes.

    We

    assume

    further, that the

    yit

    have constant variance43 over the

    period

    considered. The empirical estimate of this variance is designated

    by

    Vi.

    This

    assumption is probably

    justified

    if

    we

    consider periods

    of

    time that are not too long. The

    mathematical analysis would

    be very

    much more

    complicated if the variances were changing over

    time. We

    assume also

    that the

    yit

    are statistically independent of each

    other.

    This

    assumption, which is here made

    only for the sake of

    convenience,

    is probably not strictly justified. The

    complete theory of

    estimation has

    been

    given

    elsewhere without this

    restriction.44

    A

    very important assumption is the

    following: The

    systematic parts

    of

    our

    variables

    (Mit)

    are smooth

    functions of time.45 We have not

    to make more

    specific assumptions

    about the nature of these functions.

    This excludes stochastic

    business-cycle

    theories,

    like, for

    instance,

    Frisch's.46 It

    is,

    on the other hand, compatible with the

    theories men-

    40

    J.

    V.

    Uspensky, Introduction

    to

    Mathematical

    Probability,

    New

    York, 1937,

    pp. 291

    fl.

    41

    F.

    C.

    Mills,

    The

    Behavior

    of

    Prices,

    New

    York, 1927, pp. 251 ff.

    42

    H. T. Davis, The

    Theoryof

    Econometrics,pp. 26

    ff.

    43

    G. Tintner,

    The

    Variate Difference

    Method,

    Bloomington, Indiana, 1940,

    pp. 165

    ff.

    44

    G. Tintner, A

    Note on Rank,

    Multicollinearity and

    Multiply

    Regression,

    cited

    in

    footnote 38.

    45

    G.

    Tintner,

    The

    Variate

    Difference

    Method, pp. 31 ff. See

    also the

    review

    by

    Tjalling

    Koopmans, Review of

    Economic

    Statistics, Vol. 26,

    May,

    1944, pp.

    105-107.

    4

    R.

    Frisch,

    Propagation Problems and

    Impulse

    Problems in

    Dynamic Eco-

    nomics,

    Economic Essays

    in Honor

    of Gustav Cassel,

    London,

    1933, pp. 171-205.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    10/33

    MULTIPLE REGRESSION

    FOR

    SYSTEMS

    OF

    EQUATIONS 13

    tioned above and also with the nonmathematical

    theories of the

    cycle.

    There is one feature in nonstatic systems that may occasionally bring

    about abrupt and sudden changes which may appear as discontinuities.

    This is the kink in the (real or imagined) demand curve of the firm

    under oligopoly. It implies a discontinuity

    in the

    marginal-revenue

    curve. This was recently pointed out by Lange47

    on

    the basis of the

    earlier

    work

    of P.

    Sweezy.48 It

    does not

    seem, however,

    that

    these dis-

    continuities are actually very large and that they are widespread in the

    economic

    system. Continuity

    and

    momentum seem to us

    outstanding

    characteristics of our economy.

    The recent book by von Neumann and Morgenstern49also stresses

    discontinuities and indeterminacies arising in our economy from oligop-

    olistic and

    similar situations.

    These are not unlike

    phenomena ob-

    served in games of strategy. But the actual

    determination

    of prices,

    etc., under such conditions of indeterminacy may depend upon non-

    economic factors like bargaining power, politics,

    etc.

    as in

    the much

    discussed

    problem

    of bilateral monopoly.50 Hence it does not seem to

    us that

    under normal conditions important

    discontinuities must

    neces-

    sarily arise, as long as the total social situation

    is

    reasonably stable.

    It

    is important to note that the stochastic

    features of our

    system

    are

    introduced only through the disturbances

    yit.

    Economically

    this is

    equivalent to the following assertions:

    If we knew

    all

    our

    data

    perfectly

    and if

    the system (1) was frictionless

    and

    represented perfectly

    ra-

    tional behavior then the

    zi

    would appear as smooth (and predictable)

    functions

    of time. This

    is not

    unlike

    the

    famous assertion of

    Laplace

    in classical mechanics.

    Stochastic

    processes

    would

    appear only

    if

    assumption

    A

    above

    is

    adopted or if we attempted to determine the complete system (4).

    This

    would

    in

    some measure correspond to

    Frisch's

    business-cycle theory,52

    the

    problem considered by Slutsky,53

    the

    analysis

    of

    Hurwicz,54

    etc.

    40

    O.

    Lange, Price Flexibility

    and Employment,

    pp. 40 ff.

    48 P.

    M. Sweezy, Demand

    under Conditions of Oligopoly,

    Journal

    of

    Politi-

    cal Economy, Vol. 47, August,

    1939, pp.

    568-573.

    49

    J. von Neumann and 0. Morgenstern, Theory of

    Games

    and Economic Be-

    havior, Princeton,

    1944, pp.

    43 if.

    50

    See, e.g., G. Tintner,

    Note on

    the Problem of

    Bilateral

    Monopoly,

    Journal of Political Economy,Vol. 47, April, 1939, pp. 263-270, and the literature

    quoted there,

    esp. p. 270.

    51

    P.

    S.

    de Laplace,

    Essai

    philosophique

    sur

    les probabilit6s,

    4th

    ed.,

    Paris,

    1819,

    p.

    4.

    52

    R.

    Frisch, Propagation

    Problems

    and

    Impulse

    Problems ...

    ,

    cited in

    note 46.

    53

    E. Slutzky,

    The Summation

    of Random Causes

    as the Source

    of Cyclical

    Processes,

    ECONOMETRICA,

    Vol. 5, April,

    1937, pp. 105-146.

    54

    L.

    Hurwicz, Stochastic

    Models of Economic

    Fluctuations,

    ECONOMETRICA,

    Vol. 12, April, 1944, pp.

    114-124.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    11/33

    14

    GERHARD

    TINTNER

    3. THE

    STATISTICAL

    METHOD

    A.

    Estimation

    We

    give

    now

    a

    short

    description of our method. It

    deals

    essentially

    with a case discussed by Frisch55and is in a sense an extension of the

    work of

    Koopmans,56

    who considered

    the

    existence of only one

    relation-

    ship

    existing between the

    Mit.

    This

    corresponds

    to the case R =1.

    The

    methods presented

    here are

    also similar

    to the ones given

    by the

    author

    in

    an earlier

    article.57

    They are

    definitely

    connected with modern

    work

    in

    multivariate

    analysis.58

    Under the

    assumptions

    stated above we

    endeavor to

    estimate

    the

    coefficients

    k,

    in

    equation

    (5). In order

    to do this we have first

    to

    estimate R itself, i.e., the number of independent linear relationships

    actually

    existing

    among our p variables

    Mit

    in

    the population.

    The

    possibility of

    estimating the

    true

    dimensionality

    of our prob-

    lem in

    statistical terms (a

    problem

    already

    envisaged by Frisch59

    and

    contemplated by Fisher60 and

    others

    in

    discriminant

    analysis )

    dis-

    tinguishes our approach

    from the

    one of Haavelmo

    and

    his

    followers.

    These

    simply assume

    R, the number of

    independent linear

    relationships

    between the

    variables

    in

    the

    population

    as

    given.

    Hence

    it

    seems

    pos-

    sible that they may endeavor to accomplish too much, i.e., to deter-

    mine

    a

    greater number of

    equations

    than

    actually

    exist

    in

    the data.

    This

    would

    by necessity lead to

    nonsensical

    results.

    The reason

    for

    this

    distinction between their

    method

    and

    our

    procedures

    lies

    in

    the different

    role of economic

    theory and

    the

    reliance put into a priori assumed

    eco-

    nomic

    relationships,

    which seem to be

    greater

    with

    Haavelmo

    and

    his

    school

    than with

    us.

    a.

    Estimation

    of

    the

    Variances

    of

    the Disturbances. In

    order

    to

    esti-

    mate R we have first to estimate the variances of the disturbances yit.

    Since

    we

    have assumed above that the

    Mit

    are

    smooth

    functions of

    time

    we

    can

    use

    the variate difference method for

    this

    purpose.

    This

    seems

    to

    us

    preferable because we do not

    have

    to assume

    that the

    sys-

    tematic

    parts

    of

    our

    variables

    are

    specific functions

    of

    time, e.g., poly-

    55

    R.

    Frisch, Statistical Confluence

    Analysis.

    56

    T. Koopmans, Linear RegressionAnalysis in Economic Time Series, Haarlem,

    1937.

    57G. Tintner, An Application of the Variate Difference Method to Multiple

    Regression, ECONOMETRICA,

    Vol.

    12,

    April, 1944, pp.

    97-113.

    58

    H. Hotelling, Relations between

    Two Sets of Variates, Biometrika, Vol.

    28,

    December, 1936, pp. 321-377.

    69

    Statistical Confluence Analysis

    .

    . .,

    cited

    in

    note

    34. See also,

    R.

    Stone.

    T'he

    Analysis of Market Demand, London, National Institute of Economic and

    Social

    Research, 1945.

    60

    R. A.

    Fisher,

    The

    Statistical

    Utilization of Multiple Measurements,

    An-

    nals of Eugenics, Vol. 8, 1938, pp. 376

    ff.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    12/33

    MULTIPLE REGRESSION FOR SYSTEMS

    OF EQUATIONS 15

    nomials. It will be

    shown below that our procedures do not necessarily

    depend upon the use

    of this method.

    It has been shown

    elsewhere6' that we can proceed as follows:

    We

    form the series of first, second differences, etc. of our original data Xit.

    Then we compute

    the variances of these difference series. The

    Mit

    are

    by assumption smooth functions of time.

    It follows that the

    portion

    of the variance of the difference series of

    Xit

    which is due to this sys-

    tematic part will become smaller and smaller

    the higher the

    order of

    the difference. Eventually we shall be left substantially with

    the vari-

    ance of the random terms

    yit

    alone.

    If

    this

    is true in the

    koth

    difference

    series,

    it follows

    that, apart from sampling

    fluctuations,

    the variances

    of the (ko+l)th difference series, of the (ko+2)th difference series, etc.

    must

    be the same as the variance of the

    koth

    difference series

    if

    they

    are all

    correctly

    computed. Test functions have been given elsewhere62

    which serve to

    form an

    idea

    about

    the order of the difference series

    in

    which this is the case, i.e., in which we probably

    are left with

    the vari-

    ance of

    the

    random element alone. Then

    we

    may take the

    variance of

    the

    koth

    difference series of

    Xit

    as an estimate of the population

    vari-

    ance of the corresponding random element.

    It will be denoted

    by

    Vi.

    But our procedures are by no means dependent upon the use of the

    variate difference

    method. There are other

    ways of getting an estimate

    of

    the variances of the random elements

    in our variables

    Xit.

    For in-

    stance

    we

    may

    compute orthogonal polynomials, Fourier series,

    etc. to

    approximate

    the course of the systematic element

    Mit

    over time.

    Then

    we take the residual variance computed from

    the deviations from

    these

    trends

    as

    estimates of the variances of the

    random elements

    or dis-

    turbances

    in

    the

    population.

    Sometimes we may even be able to assume the Vi a priori. This will

    especially

    be the

    case

    if we

    can interpret the disturbances

    yit

    substan-

    tially as the results

    of errors of measurement or sampling.

    If modern

    sampling

    methods

    are more

    frequently

    applied

    in

    economic

    data63 we

    may

    often

    get

    an

    idea of the

    magnitude

    of

    the

    sampling

    error.

    But

    it

    is not impossible to imagine that we know

    in some cases

    a

    priori

    the

    relative accuracy

    of our data. It is for instance

    a well-known

    fact

    that

    price data are frequently

    much more reliable than

    data

    on quantities

    consumed, national income, etc.

    f,.

    Estimation

    of

    the

    Number of

    Relationships. Having

    found

    in one

    way

    or another estimates of the variances

    of the disturbances

    yit

    in

    the

    population

    we

    can now utilize

    a

    very ingenious

    method

    due

    to

    61

    G. Tintner, The Variate

    Difference

    Method.

    12

    Ibid.,

    pp. 67

    ff.,

    73

    ff.

    63

    See, e.g.,

    A.

    J.

    King

    and R.

    J.

    Jessen,

    The Master

    Sample

    of

    Agriculture,

    Journal of

    the

    American Statistical

    Association,

    Vol.

    40,

    March, 1945, pp.

    38-56.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    13/33

    16

    GERHARD

    TINTNER

    P. L. Hsu64

    n order to estimate R.

    This is

    equivalent

    to

    forming

    an

    idea

    about

    the rank of the

    variance-covariance

    matrix

    of the

    Mit

    in the

    population. For this

    and the

    following

    method

    of estimation of the

    k,1

    we have

    to assume

    that we

    are

    dealing

    with

    large samples.

    Then

    our

    estimates

    Vi

    are accurate

    enough

    to

    enable

    us to treat

    them as con-

    stants.

    They

    are assumed to be

    equal

    to the

    corresponding

    population

    values. This

    assumption

    is

    necessary

    for

    our

    method

    and

    simplifies

    the

    procedures greatly.

    It

    will, however,

    only rarely be strictly

    fulfilled

    in

    practice

    and

    certain

    errors

    may appear

    in

    our

    estimates because of this

    assumption. But

    these errors are

    essentially

    errors of weighting

    and

    not likely

    to

    be always

    very important

    as

    Koopmans has shown.65

    Hsu's method leads to the following procedure:

    Consider the determinantal

    equation:

    al

    -

    XV1

    a12

    alp

    (6)

    al2

    a22

    -XV2

    a2p

    .

    .

    .

    .

    . . . .

    . .

    . . .

    .

    . .

    . =

    O,

    alp

    a2

    * *

    app

    -

    XVp

    where ai is the sample covariance of Xit and

    Xit.

    That is to say,

    ai1=(Z:'=xitxit)/(N-1)

    where

    xit

    is

    the

    deviation of

    Xit

    from its

    arithmetic

    mean

    Xi,

    etc. The

    Vi

    are

    our

    estimates

    of the

    random vari-

    ances.

    They

    may

    of course be

    absent

    in

    the

    case of certain

    variables

    that have

    no random term,

    especially the

    variable representing time

    itself.

    In

    the population

    equation

    corresponding to

    (6) there should be r

    roots

    X

    =

    1,

    if there are

    actually r

    independent linear

    relationships be-

    tween the Mit in the population.

    Let X1be the

    smallest root of

    equation (6), X2be

    the next

    smallest,

    etc.

    These

    roots are

    best

    computed

    by Hotelling's66

    methods. We form

    the test

    function:

    (7)

    Ar

    =

    (N- 1)(Xl

    +

    X2

    +

    +

    Xr)

    i.e.,

    the

    sum

    of

    the

    r smallest

    roots of

    (6)

    multiplified by (N

    -

    1). This

    quantity

    (7)

    is distributed like x2

    with

    (N-I-p

    +r)

    r

    degrees of free-

    dom. It may be used to estimate R, i.e., the true number of independent

    relationships of the

    type (5) which actually exist

    in the

    population.

    The

    determinantal

    equation (6) is fundamental

    for our

    procedures.

    64

    P.

    L. Hsu,

    On the

    Problem

    of

    Rank and

    the

    Limiting

    Distribution of

    Fisher's Test

    Function, Annals

    of

    Eugenics, Vol. 11,

    1941, pp.

    39 fif.

    66

    T.

    Koopmans, Linear

    Regression

    Analysis in

    Economic

    Time

    Series,

    pp.

    87 ff.

    6B

    H.

    Hotelling,

    Simplified

    Calculation of

    Principal

    Components,

    Psycho-

    metrika,

    Vol.

    1,

    March, 1936,

    pp. 27-35.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    14/33

    MULTIPLE REGRESSION FOR SYSTEMS

    OF EQUATIONS

    17

    The number

    of degrees of freedom in the distribution of

    A,.

    will be

    inter-

    preted later.

    The following

    hypothesis will be tested:

    There are exactly

    r inde-

    pendent

    linear relations in the population between

    the

    Mit.

    Given a

    level of significance, say 1

    per cent, we will

    reject the hypothesis

    for a

    A,

    which is

    larger than the

    one permitted for the given

    level of sig-

    nificance taken

    from the tables

    of

    X2

    with the appropriate number of

    degrees of freedom. Hence

    we will compute

    the quantity

    (7)

    for

    r=

    1,

    2, 3,

    *

    *

    -

    until we find a

    A,.

    which

    is larger than the

    one permis-

    sible under

    the conditions

    given above. The r corresponding

    to this

    A,

    is to be taken

    as an estimate

    of the true number of

    relationships

    existing in the population between the systematic parts of our variables.

    This estimate

    will be denoted

    by R.

    ay.

    Estimation

    of the Weighted Regression

    Coefficients.

    The method

    of

    maximum

    likelihood for the

    estimation of

    the

    kIq

    eads under our as-

    sumptions

    to the method of least squares.

    Our estimates

    can

    then

    be

    regarded as

    the best linear estimates in large

    samples and

    are unbiased

    under the conditions stated.

    This follows

    from the above assumption

    that

    the

    Vi

    can be treated

    as constants. Hence the

    Markoff

    theorem

    applies.67The sum of squares to be minimized is:

    N

    p

    (8)

    Q =

    E E

    (xt

    -M,t)2/V,

    t=1

    i=1

    where

    mit

    is the deviation

    of

    Mit

    from its

    arithmetic mean. We

    have

    to

    minimize

    Q

    under the R conditions (5).

    These

    conditions

    involve actually Rp+R

    coefficients

    k,,

    (v=1,

    2,

    * *

    ,

    R;

    j=O, 1, 2, * *

    ,

    N) which have to be estimated.

    The

    con-

    stants k,o are evidently given by the condition that the best fit has to

    pass through

    the means of

    all the variables :68

    p

    (9) kao

    +

    k,ikXi

    =

    0,

    v

    =

    ly

    2,

    ..

    I

    R.

    This still leaves

    Rp

    coefficients

    k,,1

    v= 1, 2, * * * R; j=

    1, 2, * * *

    ,

    p).

    But it is

    evident that R2 of these coefficients

    are not really

    necessary.

    Suppose

    we

    suppress in (5)

    the constants

    k,,o

    and substitute

    the devia-

    tions from the means

    mit

    for the variables Mit. Then we can, for in-

    stance, express

    the first p-R

    variables

    mit,

    m2t,

    .

    .

    .

    X

    mp-R

    tin

    terms

    of the

    last

    R variables

    mpR+l,

    t,

    mpR+2,

    t,

    .

    .

    .,

    mpt.

    Hence

    we

    need

    only (p-R)R

    independent

    coefficients

    k,,.

    67

    F. N. David and

    J. Neyman, Extension of the Markoff Theorem

    on

    Least

    Squares, Statistical

    Research Memoirs,

    Vol. 2, 1938, pp.

    105

    ff.

    68

    G.

    Tintner,

    An Application

    of

    the Variate Difference

    Method

    to

    Multiple

    Regression, op. cit.,

    p. 105, formula

    (20).

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    15/33

    18

    GERHARD

    TINTNER

    The

    f2

    conditions

    imposed on the pR

    coefficients

    kq1

    v=

    1, 2, *.**,

    R;

    j=1, 2,

    ,

    p) are

    as follows:

    p

    (10) k ikwiVi =

    vw,,

    v, w = 1, 2, *

    *,

    R

    i=l1

    where

    &vw

    s

    the Kronecker

    delta.

    It is one

    for

    v=

    w

    and zero for v

    w.

    (11)

    E3 E

    ktjx1)

    (

    E

    kwix1t)

    = O,

    t=1

    i==l

    j=1

    v

    p

    w;

    v, w

    =

    1, 2,

    *

    ,

    f.

    These conditions (10) and (11) normalize and orthogonalize the co-

    efficients klcv v-1,

    2,

    .

    * *,

    R; j= 1,

    2,

    . .

    *, p).

    The procedure is

    then

    essentially the

    same as

    weighted regression

    with

    just

    one

    equation.69

    We determine

    first the

    most

    advantageous

    values

    of the

    mit

    (i=

    1,

    2, *

    * *, p;

    t= 1, 2, *

    ,

    N). It can

    be demon-

    strated that

    under

    the given conditions

    this

    leads to a sum

    of squares

    in

    the

    form:

    N

    R

    P

    (12)

    9=

    E

    kvj

    tx)2.

    t=l v=l

    j=l

    This

    expression is very

    similar

    to the

    one appearing

    in the

    earlier analy-

    sis

    with

    just

    one relation.

    The complete

    solution

    in mathematical

    form is given elsewhere.70

    It

    can

    be shown that

    the

    minimization

    of the expression

    (12)

    under

    con-

    ditions (10)

    and (11)

    leads again to

    the determinantal

    equation

    (6).

    The system

    of

    linear homogeneous

    equations

    that yields

    the

    solu-

    tions is as follows:

    (all

    -

    XvV,)kv,

    +

    al2kv2

    +

    +

    aipkvp

    =

    0,

    (13)

    al2kv1

    +

    (a22

    -

    XvV2)kv2

    +

    *

    +

    a2,kvp

    =

    0,

    . .

    .

    .

    .

    . .

    .

    .

    .

    .

    . . .

    .

    . . . . . . .

    .

    .

    aJpkvc a2pkv2

    *

    +

    (app

    -

    XvVp)kvp

    =

    0,

    v

    l=

    21,

    ,

    R.

    This system of equations can evidently only have a nontrivial solu-

    tion if its determinant is

    zero. [A

    trivial solution

    is excluded

    by condi-

    tion

    (10).]

    This is

    achieved by inserting

    for Xvone of

    the roots

    of the

    determinantal equations

    (6).

    Then

    the computation

    of the

    k,j

    is possi-

    ble,

    if

    one additional

    condition

    is imposed,

    e.g., condition

    (10) for

    v

    =w,

    69

    T. Koopmans, Linear

    Regression

    Analysis.

    .

    .

    , cited in note

    56.

    70

    G.

    Tintner,

    A Note on Rank, Multicollinearity

    and Multiple

    Regression.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    16/33

    MULTIPLE

    REGRESSION FOR

    SYSTEMS OF EQUATIONS 19

    which normalizes

    the coefficients

    k,j

    by making their weighted

    sum of

    squares equal to

    one.

    By carrying out

    this process for v= 1, 2,

    ,

    R we get

    a complete

    set of estimates for the coefficients

    k,i

    in equations (5). The minimized

    sum of squares

    in

    (8)

    and

    (12) becomes

    AR(N- 1).

    We can now

    interpret the number of degrees of freedom

    for the dis-

    tribution of the sum of squares

    AR.

    For

    each

    v

    we have to

    minimize N

    sums of

    squaresDf=

    1(xit-mit)2/Vi, which appear in formula

    (8). They

    correspond to

    the deviations of the

    dependent variables from the

    fitted

    regression

    equation in ordinary regression theory. The

    total num-

    ber of

    sums

    is RN since v=

    1, 2,

    * *

    *,

    R. Each sum corresponds to

    one

    of the roots of equation (6), X1,

    X2, * *

    *, XR.

    There are p-R+1 inde-

    pendent constants

    k,j

    (j=0,

    1,

    2,

    * * *, p) to be determined

    for

    each

    v.

    Their total number

    for all v is

    R(p-R)+R.

    The difference

    between

    the number of sums of squares

    and the number of

    constants

    is

    (N-1-p+R)R as given above, the

    number of degrees

    of freedom

    for

    AR.

    The

    procedure

    is

    analogous

    to the one ordinarily given

    in mul-

    tiple regression.

    In fact, for

    R

    =

    1

    the expression becomes

    N

    -

    p,

    the

    correct number

    of degrees of freedom

    in ordinary regression

    analysis

    with p variables, i.e., one dependent and p -1 independent variables.

    The difference

    between our new approach

    and the classical method

    of

    fitting weighted

    regression equations given by Koopmans

    is

    the fact

    that

    we determine

    here all the

    R

    smallest

    roots of the determinantal

    equation (6)

    instead of only the smallest. This enables us to

    estimate

    R

    sets of

    coefficients

    of

    the type (5) instead

    of just one.

    B.

    Identification

    We have now estimated the coefficients kq in (5) for v= 1, 2, * *

    *,

    R;

    j=0,

    1, 2,

    . . .

    ,

    p

    on

    the

    basis

    of our estimate of

    R.

    This is

    equivalent

    to

    establishing

    R weighted regression equations

    whose

    coefficients

    are

    orthogonalized

    and

    normalized

    by the

    conditions (10)

    and

    (11).

    These

    results represent

    the system (1), i.e.,

    the

    Walrasian

    system of general

    equilibrium

    with

    the modifications

    mentioned above.

    But these equations are not yet economically

    meaningful,

    if

    consid-

    ered each

    in isolation. We are for instance

    not able to say

    which ones

    are demand equations, which one is the investment function, supply

    functions,

    etc.7

    In

    order to achieve

    a meaningful system

    of

    equations

    we

    have

    to make additional assumptions.

    Following

    a suggestion of Klein72we put these assumptions

    into the

    71

    T. Haavelmo, The Probability

    Approach in Econometrics, pp.

    91 ff.,

    pp.

    99 ff.

    72

    L. R. Klein, Pitfalls in the

    Statistical Determination of the Investment

    Schedule.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    17/33

    20

    GERHARD

    TlNTNER

    following

    form: Certain

    of the coefficients

    kvj

    (v

    =l,

    2,

    R;

    j =1,

    2,

    .

    *,

    p)

    will

    be

    assumed

    a

    priori

    to be

    equal

    to

    zero and

    hence

    missing

    from

    some of

    tlhe equations

    (5).

    A

    perfect

    estimation

    of

    eco-

    nomically meaningful relationships will be possible if there is in each

    equation of the

    system (5)

    at least

    one variable

    Mit

    that

    enters

    into

    this equation

    alone.

    Hence all the

    coefficients

    k,j

    belonging

    to

    this

    variable

    in all other

    equations

    of the

    system (5)

    are

    equal

    to

    zero.

    We

    could

    say

    that

    the individual

    economically

    meaningful equations

    (e.g.,

    demand

    functions,

    supply

    functions,

    etc.)

    are

    generated

    by

    the

    varia-

    bles

    MIit

    that

    enter

    into them alone and

    into

    none of the

    other

    equa-

    tions.

    To give an example: We will for instance assume that certain specific

    demand factors

    like

    income

    appear

    in

    the

    demand

    functions

    alone and

    that cost factors

    like the

    prices

    of

    factors of

    production

    appear

    solely

    in the

    supply

    equations,

    etc.

    It

    is

    apparent

    that

    there

    is a

    certain

    amount

    of

    liberty

    possible

    in the

    choice of

    the

    specific variables

    that

    should enter into

    given

    economically

    meaningful

    equations,

    like

    for

    in-

    stance demand

    functions. Statistical criteria

    for

    this

    choice

    will

    be

    given

    below,

    but

    they

    must not

    be

    taken as decisive.

    We

    may very

    well

    decide

    in favor of a system that is economically meaningful and gives a

    rather

    poor

    fit

    against

    one that

    gives

    a

    more

    perfect

    statistical

    fit

    but is

    difficult to

    interpret

    economically.

    A

    certain

    amount of

    arbitrariness enters also

    into

    the

    selection

    of

    the

    specific variables

    that should be included

    in

    a

    given

    economically mean-

    ingful equation.

    If

    it is a demand

    function,

    we

    may

    for

    instance

    take

    average

    income,

    the

    distribution

    of

    incomes,

    taste

    factors,

    advertising

    cost,

    etc.

    as

    variables

    that will

    enter

    into

    this

    specific

    equation

    apart

    from the quantities bought and prices paid for this good (and possibly

    also prices of

    related

    goods).

    The demand

    functions

    will

    differ

    according

    to which

    particular

    variables

    we

    include into our

    equation. This

    is

    a

    consequence

    of the

    fact that

    we

    make a

    somewhat

    arbitrary

    selection

    of

    variables and

    parameters

    out of the

    comprehensive Walrasian

    sys-

    tem

    (1).

    An

    error

    will

    be

    committed

    by

    neglecting

    certain

    variables.

    We

    can

    perhaps

    say

    that

    we

    actually replace

    the

    general

    Walrasian

    system

    by

    a more restricted

    system

    of

    a

    kind

    of

    partial

    equilibrium.

    Statistical criteria

    will

    be

    given

    below

    that should

    enable

    us

    to

    ap-

    preciate

    the

    statistical

    goodness

    of fit

    resulting

    from

    the

    inclusion

    of

    certain

    variables

    in a

    specific

    equation.

    A

    comparison

    between

    different

    possible

    selections

    is

    also

    feasible.

    But,

    as indicated

    above,

    such

    purely

    statistical

    considerations

    are

    by

    no

    means decisive.

    Some

    of

    the

    initial

    advantages

    of

    our

    approach

    may

    be lost

    or

    partially

    lost

    in

    this

    fashion.

    The problem

    of

    identification

    is

    important

    especially

    for

    economic

    policy.

    Assume for

    instance a

    policy

    of

    government investments.

    Then

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    18/33

    MULTIPLE

    REGRESSION

    FOR

    SYSTEMS

    OF

    EQUATIONS

    one of the

    equations

    in

    (5)

    which

    represents

    the

    investment

    function

    will

    be

    replaced

    by.the

    new

    conditions of

    government

    investments,

    while all the

    other

    equations

    remain

    the

    same.

    But

    without

    having

    solved the

    problem

    of identification

    we cannot

    say

    which one of the R

    normalized

    and

    orthogonalized

    equations

    (5) (or

    which

    of the

    infinitely

    many

    combinations

    of them that

    are

    equivalent)

    is

    actually

    the

    in-

    vestment

    function. The

    system

    of

    equations

    given

    under the

    previous

    conditions is

    useless for

    purposes

    of

    economic

    policy.

    Hence,

    in

    order to

    get

    meaningful

    equations

    that are

    useful

    for eco-

    nomic

    policy

    (and

    also

    more

    enlightening

    for

    theory)

    we have to

    impose

    the

    conditions

    mentioned above

    and

    carry

    out

    our

    estimation under

    the

    new conditions

    apart

    from the ones

    imposed

    by

    (10)

    and

    (11).

    We will

    later

    give

    statistical

    methods

    to

    estimate

    the loss

    in

    accuracy

    suffered

    in

    the

    process

    of

    obtaining

    meaningful equations.

    The

    estimation of

    the

    new

    coefficients

    kv/

    determined under the new

    conditions

    that

    make the

    equations

    meaningful

    is

    actually

    simpler

    than

    the

    estimation

    of

    the

    system given

    before.

    A number of

    new conditions

    is

    added

    to

    the

    orthogonalizing

    and

    normalizing

    conditions

    (10)

    and

    (11).

    They

    express

    which of the coefficients

    kji

    are

    assumed

    to be zero

    in any specific equation in (5).

    Assume

    that

    s,

    conditions

    are

    imposed

    for

    equation

    v.

    Then

    p/=

    p--sv

    is the

    number of

    coefficients

    kv,/

    (j=1,

    2,

    * *

    *,

    pv')

    to be

    determined.

    The

    method of

    estimation

    is

    essentially

    the same as before. We

    put

    equal

    to

    zero

    in

    (13)

    the

    sv

    coefficients

    ckv

    hat

    are assumed

    to

    be

    equal

    to

    zero

    for the

    purpose

    of

    identification

    and

    get

    the new

    system

    (14).

    Here

    the

    numbering

    of

    the

    variables

    has been

    changed

    and does not

    necessarily correspond to the one previously used, e.g., in (13).

    (an

    -

    Xv'Vi)kvl'

    +

    al2kv2 +

    +-

    aklpvkp,

    v

    =

    0,

    (14)

    al2kv'

    +

    (a22

    -

    Xv'V2)kv2'

    +

    +.

    -

    a2p',kvpv,'

    =

    0,

    (14)

    alpv'kul'

    +

    a2pv,k2

    +

    +'.

    +

    (apvpv

    -

    Xv')kvpv,'

    =

    0,

    v-1, 2, ,

    R.

    This is

    again

    a

    system

    of

    linear

    homogeneous

    equations

    in

    the varia-

    bles

    kv'.

    A

    nontrivial

    solution

    is

    possible only

    if the

    determinant

    is

    equal

    to

    zero. In

    order to

    accomplish

    this

    we have

    to

    modify

    the de-

    terminantal

    equation

    (6)

    in such

    a

    fashion,

    that the

    rows and columns

    corresponding

    to

    the

    sv

    variables that

    do not

    now

    appear

    in the

    system

    (14)

    are left out.

    Denote the smallest

    latent

    root of the

    modified

    equa-

    tion

    (6) by

    Xv'.

    Then

    we

    insert

    this

    solution

    Xv'

    nto

    (14)

    and assume

    21

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    19/33

    22

    GERHARD

    TINTNER

    again

    a

    rule of

    normalization (for instance

    making

    the

    sum of

    the

    squares

    of the

    k,,'

    equal to one). We get estimates for

    the

    k,,'

    which

    are valid

    under our new

    conditions. We repeat this process for

    v= 1, 2, * - *, R with the appropriate conditions satisfied in each case.

    Then

    we

    achieve now the solution of the

    problem of estimation in such

    a

    fashion that each

    individual equation is

    identified, i.e., has

    a

    specific

    economic meaning. Our

    complete system

    contains now all the original

    variables

    and

    all the

    identifying variables. This economic

    advantage

    may

    stand against a

    statistical loss because the sum of squares is

    now

    in general larger than

    before.

    The test

    indicated above

    under A,

    a

    should be used for each set

    of

    variables utilized for a given identified equation in order to make sure

    that

    there is

    exactly

    one

    relationship between these

    variables.

    C.

    Tests

    of

    Significance

    We will

    now give tests of

    significance that should enable us to

    get

    an

    idea about the goodness

    of the fit achieved

    and to estimate the

    loss

    of

    statistical

    accuracy suffered by imposing new

    conditions for

    the

    pur-

    pose

    of

    identification.

    Let

    81+82+ -

    *

    -

    +SR=

    S'

    be the total number of conditions imposed

    in

    the

    process of identification. Define a

    quantity analogous to (7):

    (15)

    AR'

    =

    (N

    -

    1) (X1' + X2 +

    - -

    * + XR

    )

    which is now

    the sum of the R smallest roots

    of the modified determi-

    nantal

    equations (6) multiplified by N-1.

    Each one of the roots

    Xv'

    is

    computed by leaving out

    in (6) the

    appropriate

    s,

    rows and columns.

    The

    quantity (15) is

    distributed like

    x2

    with

    (N-1

    -p+R)R+S'

    de-

    grees of freedom.

    The

    following test of significance is

    analogous to a procedure given

    by

    Wilks.73

    We form

    F

    =

    (N-1-P

    +R)R

    (AR'-AR)/S'AR.

    This is

    ap-

    proximately distributed like

    Snedecor's F with

    (N-1

    -p+R)R

    and

    S'

    degrees

    of

    freedom. We

    fix

    a level

    of

    significance and test the

    following

    hypothesis:

    That the

    difference between ARand

    AR'

    is zero, i.e.,

    that

    the

    introduction of

    the

    S'

    new

    conditions

    necessary

    for identification

    has not

    materially deteriorated our fit.

    There is no equivalent test for the individual equations. The reason

    for

    this is that in the original

    system (5) the

    equations are

    not identified

    and

    we

    hence do

    not

    know

    which one corresponds to a given meaningful

    equation

    of

    our

    new

    set-up.

    Hence

    we

    must

    always

    determine a com-

    plete

    system before we can make a test of

    significance. There is,

    however,

    a

    method

    by which

    we

    can compare

    identified systems that involve

    different sets of

    conditions.

    73S. S. Wilks,

    Mathematical

    Statistics, Princeton, 1943, pp. 166 ff.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    20/33

    MULTIPLE

    REGRESSION

    FOR

    SYSTEMS

    OF

    EQUATIONS

    Assume the

    number of

    conditions

    again

    S'

    in

    the first

    system

    and

    S

    in

    the

    second. Then

    we

    form

    a

    quantity

    AR

    corresponding

    to

    (15).

    The

    quan-

    tity

    F=[(N-1-p+R)R+S]AR'/[(N-1-p+R)R+S']AR

    is

    again

    approximately distributed like Snedecor's F with (N-1-p+R)IR+S'

    and

    (N-1-p+R)R+S

    degrees

    of freedom. The

    hypothesis

    to be

    tested

    is

    again:

    That

    there

    is

    no essential difference

    in

    the

    sum

    of

    squares

    of

    residuals

    divided

    by

    the

    appropriate

    number

    of

    degrees

    of

    freedom

    resulting

    from

    the

    two

    sets of

    conditions

    (or

    in

    the

    two

    sys-

    tems).

    It

    is

    remarkable and

    entirely

    in the

    spirit

    of

    our

    approach

    to the

    problem

    that it

    is

    not

    possible

    to test

    individually

    the

    difference

    be-

    tween the k,j and ki'. It shows again that it is really not individual

    equations

    that are

    determined in

    our case

    but that the entire

    system

    is

    estimated

    as a

    whole.

    Tests of

    significance

    for

    the

    individual

    kv'

    can be

    given

    that

    are

    approximations

    and

    based

    upon

    an idea

    indicated

    by Koopmans.74

    In

    order

    to do

    this

    we

    have to

    assume

    a

    different normalization rule

    from

    the

    one

    given

    above.

    This new

    normalization

    rule is

    also

    more

    appropriate

    and

    useful

    in

    economics. It consists in making one of the variables Mit, say Mlt,

    the

    dependent

    variable.

    E.g.,

    in

    the

    theory

    of demand

    we

    frequently

    consider

    the

    quantity

    demanded

    as

    the

    dependent

    variable

    and the

    price,

    income,

    etc.

    as

    independent

    variables.

    It

    should be

    stressed,

    however,

    that

    now this distinction is

    made

    only

    for

    the

    purpose

    of

    normalization

    and

    that

    it

    has

    definitely

    nothing

    to

    do

    with

    the

    distinction

    between

    dependent

    and

    independent

    variables

    made

    in

    classical

    regression analysis.

    There the

    assumption

    is

    that

    the

    dependent

    variable alone

    is

    subject

    to disturbances

    while

    the

    fit

    is car-

    red

    through

    for

    purposes

    of

    prediction

    assuming

    fixed values

    for

    the

    so-called

    independent

    variables.75

    Here

    our

    purpose

    is the

    estimation

    of

    the

    structural

    coefficients

    themselves

    and

    not

    prediction.

    Dis-

    turbances

    enter

    into all

    our

    variables,

    not

    only

    into

    Xit

    which now

    as-

    sumes

    the

    role

    of

    dependent

    variable for

    purpose

    of

    normalization

    only.

    Let

    our

    new

    normalized

    solutions

    be

    denoted

    by kc,i

    v

    =

    1,

    2,

    * *

    , R;

    j=2,

    3,

    *

    ,

    pv').

    The

    system

    of

    equations

    becomes

    now:

    (a22

    Xv'V2)kv2 t+a23kv3 +

    .

    +a2pv p

    =

    a12,

    (16)

    a23k,2 +-

    (a33-X'V3)kC3 -+

    +a3pk ,p.'t

    =a13,

    (16)

    a2pvkv2/

    a3pkv3+

    *

    *

    +(apvp'v

    -Xv'Fpv)kpv,

    =alpv.

    74

    T.

    Koopmans,

    Linear

    Regression Analysis

    in Economic

    Time

    Series,

    pp.

    72 ff.

    75

    H.

    Hotelling,

    The

    Selection of

    Variates

    for Use in Prediction with Some

    23

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    21/33

    24

    GERHARD

    TINTNER

    This

    system

    of linear

    homogeneous equations will

    again

    have

    a non-

    trivial

    solution

    if we

    substitute for

    Xv' a solution for

    the

    modified de-

    terminantal

    equation (6). The

    modification consists in

    leaving

    out

    the

    s,

    rows and

    columns

    corresponding

    to the

    variables that

    by

    assumption

    do not enter

    into the

    specific

    vth

    equation.

    Using an

    approximation

    given

    by Koopmans76

    we can

    now compute

    the

    variances

    and

    covariances of

    the

    kq '

    under

    the

    assumption that

    the

    variances of the

    systematic parts are

    much

    larger

    than the ones

    of

    the

    disturbances.

    The

    covariance of

    ku i

    and

    kj'

    is then

    given

    by the

    formula:

    (17)

    Ekcvi kvi

    Xv/Cij(

    E

    kh Vh

    +

    V1)

    (N

    -

    pv').

    Here

    c

    is the

    element

    corresponding to a

    i

    (i,

    j-2, 3,

    ,

    Pv') in

    the

    matrix inverse to the

    one

    used in (16) to

    compute

    the

    k,j'.

    The method

    of

    computation

    of this matrix

    is

    the same

    as the one given

    by

    Fisher77

    in ordinary

    classical

    regression

    analysis;

    pv'

    is the

    number of

    variables

    actually

    entering

    into the

    particular

    equation

    v.

    We obtain

    the

    variance of

    ki'

    by putting

    in formula

    (17) i=j

    and

    the standard error by taking the square root. We can then use the

    t-test78

    to

    establish

    approximately

    the

    significance of

    the

    individual

    kv,

    '. The

    quantity t'

    =

    kv'

    /VEk,

    il2

    will

    follow the

    t-distribution with

    N

    -

    Pv'

    degrees of

    freedom for

    large samples.

    It

    should

    be mentioned

    that

    Anderson

    and Girshick

    established

    re-

    cently

    the

    distributions of

    the

    variances and

    covariances

    for some spe-

    cial

    cases.79

    These tests

    of

    significance

    should help

    somewhat

    in

    choosing

    out of

    the many possible ones a specific identifiable system that is economi-

    cally

    sound and

    that also

    has statistical

    significance.

    It

    cannot

    be

    denied, however,

    that the

    solutions

    reached are in

    some degree arbi-

    trary

    and rest rather

    heavily upon

    economic

    assumptions, the validity

    of

    which

    cannot be expressed

    in

    terms of

    probability. There does

    not

    seem

    to be any way

    in

    which this

    fundamental

    difficulty can be

    over-

    Comments

    on

    the General

    Problem of

    Nuisance

    Parameters,

    Annals of

    Mathe-

    matical

    Statistics, Vol. 11,

    September,

    1940, pp.

    271-283.

    7 T.

    Koopmans,

    Linear

    Regression Analysis in

    Economic

    Time Series, p.

    80,

    formula

    (47).

    77

    R. A.

    Fisher,

    Statistical Methods for

    Research

    Workers,8th ed. New York,,

    1941,

    pp. 150

    ff.

    78

    T.

    Koopmans, Linear

    Regression

    Analysis in Economic

    Time

    Series, p.

    95,

    formula

    (4).

    79

    T.

    W.

    Anderson and M. A.

    Girshick,

    Some Extensions

    of the

    Wishart

    Dis-

    tribution,

    Annals of

    Mathematical

    Statistics, Vol. 15,

    December,

    1944, pp. 345-

    357.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    22/33

    MULTIPLE REGRESSION

    FOR SYSTEMS OF

    EQUATIONS

    25

    come except by the

    accumulation

    of econometric studies which combine

    sound economic theory with statistical

    verification

    and

    may eventually

    give

    economics the same firm theoretical and empirically

    verified basis

    that, in physics, is the envy of the social scientist. It is to be hoped

    that the

    above methods may make a

    modest contribution

    towards

    the

    achieving

    of

    this goal, or at least

    point

    the

    way

    to some

    more ade-

    quate approach.

    4.

    AN

    APPLICATION TO AGRICULTURAL

    DATA

    A. The

    Data

    We

    will

    now apply our method

    to

    an

    endeavor to fit

    a demand and a

    supply curve for agricultural products in the United States. Our data

    cover the

    period 1920-1943 (N

    =

    24). The data are

    annual

    figures.

    They are

    the following:80 X1,

    Prices received by farmers, 1910-14

    =

    100;

    X2, National income (billion $);

    X3, agricultural

    production,

    1935-39

    =

    100;

    X4, time, origin between

    1931 and 1932;

    X5,

    prices paid by

    farmers, 1910-14=100. The means of the variables are

    presented

    in

    Table 1.

    TABLE

    1

    ARITHMETIC MEANS (X,)

    Symbol

    Variable

    Mean

    Xi

    Prices

    received by

    farmers

    127.417

    X2

    National

    income

    72.3975

    X3

    Production

    100.625

    X4

    Time

    0.000

    X5

    Prices

    paid

    by

    farmers 136.460

    The variance-covariance matrix of the data is given in Table 2. Since

    all

    matrices

    are

    symmetrical,

    only

    the elements above the

    diagonal

    will

    be

    presented.

    TABLE 2

    VARIANCE-COVARIANCE MATRIX

    (aii)

    Xi X2

    X3 X4

    X5

    X1

    1212.593

    536.182

    87.594

    -72.696 532.000

    X2

    557.636

    207.147 75.022

    220.043

    X3

    106.418

    54.543 30.261

    X4

    50.000 -38.543

    X5

    240.652

    B. Estimation

    of

    the Variances

    of the Disturbances

    We use

    the

    variate difference method in order

    to

    get

    estimates

    of

    80

    The author is

    greatly obliged

    to

    Dr.

    James

    Cavin (Bureau

    of

    Agricultural

    Economics, Washington,

    D.

    C.)

    for

    supplying

    the data and

    suggesting

    the

    prob-

    lem.

    This content downloaded from 132.248.147.90 on Mon, 3 Nov 2014 12:53:35 PMAll use subject to JSTOR Terms and Conditions

    http://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsphttp://www.jstor.org/page/info/about/policies/terms.jsp
  • 8/10/2019 1905702

    23/33

    26

    GERHARD TINTNER

    the error variances.

    We present

    in Table 3 the appropriately

    computed

    variances of

    several difference

    series for our data:

    TABLE

    3

    VARIANCES

    OF DIFFERENCE

    SERIES

    Difference

    Xi

    X2 X3

    X5

    1

    337.885

    72.346

    J4.156

    62.863

    2

    130.649 19.068 10.698

    25.221

    3 47.901

    9.319

    7.961 12.472

    4

    28.580

    7.349 7.160

    9.812

    5 24.349

    7.056 6.827

    8.987

    Tests indicate

    that the variances probably

    become stable

    for the fifth

    difference series.

    Hence we

    will take the items in the last

    line of Table 3

    as

    the estimates

    of the error variances

    Vi

    of our variables.

    The variable time (X4) is

    evidently not

    subject to error and hence we

    have V4 =0.

    C. Estimation of

    the

    Number of Independent

    Linear Relationships

    In order to estimate the number of independent relationships among

    our variables

    we form

    the determinantal equation corresponding

    to

    formula

    (6):

    1212.593-24.349X 536.182

    87.594

    -72.696

    532.000

    557.636 -7.056X 207.147

    75.022 220.043

    (18)

    106.418 -6.827X

    54.543

    30.261

    =0.

    50.000

    -38.543

    240.652 -8.987X

    Since the variable

    X4

    (time) is not subject to error, we have

    V4

    =0 and

    the term

    with

    X

    is missing from the fourth

    line.

    The

    following table summarizes

    the latent roots

    (Xr)

    and the corre-

    sponding

    sums of squares

    (Ar) resulting

    from the determinantal e