29964572

Embed Size (px)

Citation preview

  • 8/13/2019 29964572

    1/7

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved

    RESEARCH STARTERSACADEMIC TOPIC OVERVIEWS

    Time SeriesStatistics > Time Series

    Abstract

    Time series data are used in business forecasting to examine pat-

    terns, trends, and cycles from the past in order to predict patterns,

    trends, and cycles in the future. In general, there are three objec-

    tives for the use of time series data: Understand the mechanism

    underlying the observed data, extrapolate the data in order to

    predict their effect on future behavior or data values, and control

    a process or system so that its outcome is more favorable to the

    business. Time series data can take several distinctive forms andcan be inuenced both by deterministic and stochastic variables.

    Although there are several approaches to modeling time series

    data, they often yield different results. Model building and fore-

    casting using time series data is a complicated process that is part

    art and part science. Human beings will always have to deter-

    mine which variables to include, where the line of best t lays,

    what time periods to consider, and how to interpret the results.

    This can be both strength and a weakness.

    Overview

    Managers and other business decision makers frequently need to

    determine the best course of action and develop strategic plans to

    help the organization reach its goals and objectives. To optimizethe worth of such decisions, good business strategy needs to be

    based on the rigorous analysis of empirical data. Sometimes the

    data are readily obtainable through activities such as an analysis

    of the organizations resources and abilities. Sometimes the data

    require research and analysis such as data concerning competito

    capabilities and offerings. Sometimes the data require creative

    guesswork such as determining market needs and trends in the

    future. Fortunately, there are a wide range of descriptive and

    inferential statistical tools available for analyzing and interpret

    ing the data that managers rely on to make decisions. One of the

    tools that is particularly helpful for the latter category of analy-

    sis where one needs to forecast trends as the basis for decisionmaking is time series analysis.

    Uses of Time Series Data

    Time series data are data gathered on one or more specic char

    acteristics of interest over a period of time at intervals of regula

    length. These data series are used in business forecasting to exam

    ine patterns, trends, and cycles from the past in order to predict

    patterns, trends, and cycles in the future. Time series analysi

    typically involves observing and analyzing the patterns in his

    torical data. These patterns are extrapolated to forecast future

    behavior. Most statistical analysis of time series data involves

    model building, the development of a concise mathematica

    description of past events. These mathematical models are used

    to forecast how the pattern will continue into the future. Time

    series are analyzed through several techniques including nave

    methods, averaging, smoothing, regression analysis, and decom

    position. These analysis techniques assume that the sequence

    of observations is a set of jointly distributed random variables

    Through the analysis of time series data, one can study the struc-

    ture of the correlation (i.e., the degree to which two events or

    variables are consistently related) over time to determine the

    appropriateness of the model.

    Abstract

    Keywords

    Overview

    Applications

    Terms & Concepts

    Bibliography

    Suggested Reading

    Table of Contents

  • 8/13/2019 29964572

    2/7

    Time Series Essay by Ruth A. Wienclaw, Ph.D.

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved Page 2

    Objectives of Time Series Data

    In general, there are several objectives for the use of time series

    data.

    First, time series data are often analyzed in order to under-

    stand the mechanism underlying the observed data and to

    build a model that describes the mechanism and its inuence

    on the variables of interest.

    Second, while understanding the mechanisms that inuence

    events and trends is of interest, time series data are fre-

    quently analyzed not only to understand these mechanisms,

    but more importantly to extrapolate them in order to predict

    their affect on future behavior or data values.

    Third, although sometimes knowing this information is

    sufcient for making decisions and developing strategies,

    there are also situations in which the results of the analysis

    of time series data can also give organizations information

    that will enable them to control a process or system so that

    its outcome is more favorable to the business. For example,

    one might nd that changes in industry technology are pro-

    gressively making a current widget design obsolete and that

    sales are dropping of. If acquired in time, this information

    can point out areas where the organization needs to change

    (e.g., a new widget design that incorporates more technol-

    ogy might be appropriate) in order to change the trend in the

    future.

    Forms of Time Series Data

    As shown in Figure 1, real world time series data can take sev

    eral distinctive forms.Stationarity

    Figure 1a shows the viscosity data on a chemical product over

    time. These data remain fairly constant over time and are said to

    be constant about the mean (an arithmetically derived measure

    of central tendency in which the sum of the values of all the data

    points is divided by the number of data points). This character

    istic of the data is referred to as stationarity. Stationarity exists

    when the probability distribution of a time series does not change

    over time. Stationarity is of interest to analysts because it allows

    one to mathematically model the process with an equation with

    xed coefcients that estimate future values from past history. If

    the process is assumed to be stationary, the probability of a givenuctuation in the process is assumed to be the same at any given

    point in time. The time series data in Figure 1b, however, do no

    exhibit the same degree of constancy or stationarity. It is difcul

    to mathematically model a non-stationary process using a simple

    algebraic equation. However, it can be possible to use a simple

    mathematical procedure to transform non-stationary processes

    into ones that are approximately stationary for purposes of anal

    ysis. This allows the development of models to help the analys

    or decision maker better understand the underlying mechanisms

    in the data series.

    Figure 1: Three Examples of Time Series Data

    (From Mastrangelo, Simpson, & Montgomery, p. 828)

    COMPDOC

    Deterministic Variables

    A third general form that can be taken by time series data is illus

    trated in Figure 1c. Time series data that can be inuenced by

    various deterministic variables are those for which there are spe

    cic causes or determiners. This type of variable includes trends

    business cycles, and seasonal uctuations.

    Trends are persistent, underlying directions in which a factor

    or characteristic is moving in either the short, intermediate,

    or long term. Trends tend to be linear rather than cyclic, and

    grow or shrink steadily over a period of years. For example,

    a trend might be an increasing tendency for business to out-

    source and offshore technical support and customer service

    in many high tech companies. Trends are not necessarily lin-

    Autocorrelation

    Autoregression

    Autoregressive Integrated Moving Average

    Business Cycle

    Data

    Decomposition

    Deterministic Variables

    Forecasting

    Moving Average

    Seasonal Fluctuation

    Stationarity

    Stochastic

    Time Series Data

    Trend

    Variable

    Keywords

  • 8/13/2019 29964572

    3/7

    Time Series Essay by Ruth A. Wienclaw, Ph.D.

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved Page 3

    ear, however. For example, trends in new industries tend to

    be curvilinear as the demand for the new product or service

    grows after its introduction then declines after the product or

    service becomes integrated into the economy.

    Another type of deterministic factor is business cycles.

    These factors are continually recurring variation in total

    economic activity. Business cycles tend to occur across

    most sectors of the economy at the same time. For example,

    several years of a boom economy with expansion of eco-

    nomic activity (e.g., more jobs, higher sales) are frequently

    followed by slower growth or even contraction of economic

    activity.

    Another category of deterministic factor is seasonal uctua-

    tions. These are changes in activity that occur in a fairly reg-

    ular annual pattern and which are related to seasons of the

    year, the calendar, or holidays. Figure 1c shows a seasonal

    uctuation in soft drink sales, with sales going up during the

    warmer months and down during the colder months over a

    period of four years.

    Stochastic Variables

    In addition to deterministic variables, time series data can be

    inuenced by stochastic variables. These variables are caused

    by randomness or include an element of chance or probability

    and are caused by unpredictable factors. Stochastic variables can

    be either irregular or random. Examples of irregular stochastic

    variables include natural disasters such as earthquakes or oods,

    political disturbances such as war or changes in the political party

    in charge, strikes, and other external factors. Other unpredictable

    or random factors include situations such as high absenteeism

    due to an epidemic that could affect a businesss protability.

    The real world includes many variables both deterministic and

    stochastic that can affect time series data, and most real world

    situations and models include both types of variables.

    Applications

    Approaches to Time Series Modeling

    Regression Methods

    There are several approaches to modeling time series data. One of

    the primary tools for analyzing time series data and developing a

    mathematical model of real world situations is regression meth-ods. These are a family of statistical techniques that are used to

    develop a mathematical model for use in predicting one variable

    from the knowledge of another variable. Regression methods

    include both linear and nonlinear techniques. In linear regres-

    sion, a line of best t is tted to a data set to estimate the effect

    of a single independent variable. The slope of the line shows the

    impact of the independent variable on the dependent variable.

    Other regression methods are available for use in multivariate

    and nonlinear situations.

    Smoothing Techniques

    Nave Forecasting

    Another family of methods frequently used in building models

    from time series data is smoothing techniques. One approach to

    smoothing time series data is through nave forecasting models

    These are simple models that assume that future outcomes

    are best predicted by the more recent data in the time series.

    Under this philosophy, for example, one would assume that las

    months sales were a better predictor of next months sales than

    were the sales from six months ago. It should be borne in mind

    however, that nave forecasting models do not consider the pos-

    sibility of trends, business cycles, or seasonal uctuations. For

    example, if ten gross of widgets were sold last month, a nave

    model would conclude that ten gross of widgets would also be

    sold next month. Because of this assumption, nave forecasting

    models work better on data that are reported more frequently

    (e.g., daily or weekly) or in situations without trends or season

    ality. One danger in nave model forecasts lies in the fact tha

    they are often based on the observations of one time period. As a

    result, they can easily become a function of irregular uctuationin data (e.g., Acme corporation needed a one-time purchase of

    widgets to set up their new operations facility which accounts for

    the one-time high demand for widgets in the previous month).

    Averaging Models

    Smoothing of time series data can also be done using averaging

    models. These techniques take into account data from severa

    time periods, thereby neutralizing the problem of nave models in

    which the forecast is overly sensitive to irregular uctuations as

    illustrated in the previous example. In the simple average model

    forecasts for an upcoming time period are made using the aver

    age of the values for a specied number of previous time periods

    (e.g., the forecast of widgets sales for next month might be theaverage number of widgets sold per month over the past six

    months). In the moving average approach, however, the average

    value from previous time periods is used to forecast future time

    periods and is updated in each ensuing time period by including

    the new values not available in the previous average and dropping

    out the date from the earliest time periods. Although the moving

    average approach has the advantage of taking into account the

    most recent data available, it can be difcult to choose the opti

    mal length of time over which to compute the moving average

    Another drawback of the moving average approach is that it does

    not take in to account the effects of trends, business cycles, and

    seasonal uctuations. Because of this fact, analysts often use a

    weighted moving average which gives more weight to some timeperiods in the series than to others. For example, if three months

    ago the company introduced a newly redesigned product into the

    marketplace, the analyst might believe that the past three months

    reect the markets reaction to the new design and be better

    able to forecast the continuing reaction than if s/he did not have

    this information. Similarly, prediction of sales from last years

    holiday season may be better predictors of next years holiday

    sales than sales during the summer months. A third approach to

    smoothing time series data comprises exponential smoothing

  • 8/13/2019 29964572

    4/7

    Time Series Essay by Ruth A. Wienclaw, Ph.D.

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved Page 4

    techniques. Exponential techniques use weight data from previ-

    ous time periods with exponentially decreasing importance so

    that the new forecast is a product of the current forecast and the

    current actual value.

    Autoregression

    Another approach to modeling time series data is autoregres-

    sion, a multiple regression technique in which future values ofthe dependent variable are predicted from past values of the vari-

    able. This technique takes advantage of the relationship of values

    to the values of previous time periods. The independent vari-

    ables are time-lagged versions of the dependent variable so that

    one tries to forecast a future value of a variable from knowledge

    of that variables value in previous time periods. This approach

    can be particularly useful for locating both seasonal and cyclic

    effects.

    Mixed/Integrated Techniques

    Times series data can be modeled using mixed or integrated

    techniques that utilize both moving average and autoregres-

    sive techniques. One of these approaches is the autoregressiveintegrated moving average (ARIMA) model (also called the

    Box-Jenkins model). The ARIMA model is an integrated tool

    for understanding and forecasting using time series data that

    has both an autoregressive and a moving average component.

    ARIMA modeling techniques are powerful and frequently result

    in a better model than either the use of moving averages or

    autoregressive techniques alone. For example, ARIMA can be

    used to determine the length of the weights (i.e., how much of

    the past should be used to predict the next observation) and the

    values of these weights.

    Autocorrelation

    Models based on time series data can produce spurious results

    when the error terms of the model are correlated with each other

    in a situation referred to as autocorrelation or serial correlation.

    Autocorrelation can be problematic in the use of regression anal-

    ysis because regression analysis assumes that error terms are not

    correlated because they are either independent or random. When

    autocorrelation occurs, the estimates of the regression coefcients

    may be inefcient and both the variance of the error terms and

    the true standard deviation may be signicantly underestimated.

    In addition, the autocorrelation effect means that the condence

    intervals and t and F tests are no longer strictly applicable. A

    number of methods are available to determine whether or not

    autocorrelation is present in time series data (e.g., the Durbin-Watson test). Autocorrelated data can sometimes be corrected by

    the addition of independent variables or transforming variables.

    Determining Which Forecast to Use

    As discussed above, there are a number of techniques available

    to forecast stationary time series data (i.e., those that show no

    signicant trend, cyclic, or seasonal effects). However, these

    approaches often yield different results. To help determine which

    forecast better models a given set of data, it is necessary to deter-

    mine the amount of error produced by each technique. This is

    the difference between the forecasted value of a variable and the

    actual value of a variable. A number of techniques are available

    for doing this including mean error, mean absolute deviation

    mean square error, mean percentage error, and mean absolute

    percentage error. Data that are inuenced by trends can be ana

    lyzed by a number of approaches, including linear regression

    and regression using quadratic models. However, if these data

    are also inuenced by seasonal uctuations, other data analy

    sis techniques are more appropriate. One of the most frequently

    used of these techniques is decomposition, in which the time

    series data are broken down into the four component factors

    of trend, business cycle, seasonal uctuation, and irregular or

    random uctuation.

    Modeling Technologies

    Model building and forecasting using time series data is a com

    plicated process that is part art and part science. However, there

    are now a number of software packages available that can help

    in this task. The techniques and concomitant software for devel-

    oping forecasting models continue to improve. However, these

    techniques and tools are still not highly accurate for erraticpatterns with short life cycles. No matter how rened or power

    ful the tools, some randomness or noise will always remain in

    time series data. It is this noise that limits the upper limit of the

    forecast. In addition, forecasting cannot be done wholly based

    on mathematics. Human beings will always have to determine

    which variables to include, where the line of best t lies, what

    periods to consider, and how to interpret the results. This can be

    both a strength and a weakness.

    Terms & Concepts

    Autocorrelation:A problem occurring over time in regression

    analysis when the error terms of the forecasting model are cor-related. Also called serial correlation.

    Autoregression:A multiple regression technique used in fore-

    casting in which future values of the variable are predicted from

    past values of the variable.

    Autoregressive Integrated Moving Average (ARIMA): An

    integrated tool for understanding and forecasting using time

    series data. The ARIMA model has both an autoregressive and a

    moving average component. The ARIMA model is also referred

    to as the Box-Jenkins model.

    Business Cycle:A continually recurring variation in total eco-

    nomic activity. Such expansions or contractions of economic

    activity tend to occur across most sectors of the economy at the

    same time.

    Data:(sing. datum) In statistics, data are quantiable observa

    tions or measurements that are used as the basis of scientic

    research.

    Deterministic Variables:Variables for which there are specic

    causes or determiners. These include trends, business cycles, and

    seasonal uctuations.

  • 8/13/2019 29964572

    5/7

    Time Series Essay by Ruth A. Wienclaw, Ph.D.

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved Page 5

    Decomposition:The process of breaking down time series data

    into the component factors of trends, business cycles, seasonal

    uctuations, and irregular or random uctuations.

    Forecasting:In business, forecasting is the science of estimat-

    ing or predicting future trends. Forecasts are used to support

    managers in making decisions about many aspects of the busi-

    ness including buying, selling, production, and hiring.

    Moving Average: A method used in forecasting in which the

    average value from previous time periods is used to forecast

    future time periods. The average is updated in each ensuing time

    period by including the new values not available in the previous

    average and dropping out the date from the earliest time peri-

    ods.

    Seasonal Fluctuation:Changes in economic activity that occur

    in a fairly regular annual pattern. Seasonal uctuations may be

    related to seasons of the year, the calendar, or holidays.

    Stationarity:The condition of a random process where its sta-

    tistical properties do not vary with time.

    Stochastic:Involving chance or probability. Stochastic variables

    are random or have an element of chance or probability associ-

    ated with their occurrence.

    Time Series Data: Data gathered on a specic characteris-

    tic over a period of time. Time series data are used in business

    forecasting. To be useful, time series data must be collected at

    intervals of regular length.

    Trend: The persistent, underlying direction in which something

    is moving in either the short, intermediate, or long term. Identi-

    cation of a trend allows one to better plan to meet future needs.

    Variable: An object in a research study that can have more than

    one value. Independent variables are stimuli that are manipu-

    lated in order to determine their effect on the dependent variables

    (response). Extraneous variables are variables that affect the

    response but that are not related to the question under investiga-

    tion in the study.

    Bibliography

    Black, K. (2006).Business statistics for contemporary decision

    making(4th ed.). New York: John Wiley & Sons.

    Gilliland, M. & Leonard, M. (2006). Forecasting software the

    past and the future. Country Commerce, 33-36. Retrieved

    September 11, 2007, from EBSCO Online Database

    Business Source Complete.http://search.ebscohost.com/

    login.aspx?direct=true&db=bth&AN=21164212&site=eh

    ost-live

    Mastrangelo, C. M., Simpson, J. R., & Montgomery, D. C.

    (2001). Time series analysis. In Saul I. Gass, S. I. &

    Harris, C. M. (eds),Encyclopedia of Operations Research

    and Management Science(pp. 828-833). New York: Wiley

    Retrieved September 11, 2007, from EBSCO Online

    Database Business Source Complete. http://search.ebsco-

    host.com/login.aspx?direct=true&db=bth&AN=21891965

    &site=ehost-live

    Nazem, S. M. (1988).Applied time series analysis for business

    and economic forecasting. New York: Marcel Dekker.

    Pindyck, R. S. & Rubinfeld, D. L. (1998). Econometric models

    and economic forecasts.Boston: Irwin/McGraw-Hill.

    Suggested Reading

    Craighead, C. W. (2004). Right on target for time-series

    forecasting.Decision Sciences Journal of Innovative

    Education,2(2), p207-212. Retrieved September 12,

    2007, from EBSCO Online Database Business Source

    Complete. http://search.ebscohost.com/login.aspx?direct=t

    rue&db=bth&AN=13279969&site=ehost-live

    De Luna, X. (2001). Guaranteed-content prediction intervals

    for non-linear autoregressions.Journal of Forecasting,

    20(4), 265-72. Retrieved September 12, 2007, from

    EBSCO Online Database Business Source Complete.

    http://search.ebscohost.com/login.aspx?direct=true&db=bt

    h&AN=13638439&site=ehost-live

    Jarvis, C. H. & Stuart, N. (2001). Accounting for error when

    modelling with time series data: Estimating the develop-

    ment of crop pests throughout the year. Transactions in

    GIS,5(4), 327-343. Retrieved September 12, 2007, from

    EBSCO Online Database Business Source Complete.

    http://search.ebscohost.com/login.aspx?direct=true&db=bt

    h&AN=5439610&site=ehost-live

    Ng, S. & Vogelsang, T. J. (2002). Forecasting autoregres-

    sive time series in the presence of deterministic compo-

    nents.Econometrics Journal, 5(1), 196-224. Retrieved

    September 12, 2007, from EBSCO Online Database

    Business Source Complete. http://search.ebscohost.com/

    login.aspx?direct=true&db=bth&AN=6719925&site=eho

    st-live

  • 8/13/2019 29964572

    6/7

    Time Series Essay by Ruth A. Wienclaw, Ph.D.

    EBSCO Research Starters Copyright 2008 EBSCO Publishing Inc. All Rights Reserved Page 6

    Essay by Ruth A. Wienclaw, Ph.D.

    Ruth A. Wienclaw holds a Doctorate in industrial /organizational psychology with a specialization in organization development from

    the University of Memphis. She is the owner of a small business that works with organizations in both the public and private sectors,

    consulting on matters of strategic planning, training, and human /systems integration.

  • 8/13/2019 29964572

    7/7