Attitude Measurement Techniques-unit3

Embed Size (px)

Citation preview

  • 8/12/2019 Attitude Measurement Techniques-unit3

    1/23

    LESSON -3

    DATA COLLECTION AND ATTITUDE MEASUREMENT

    CONTENTS

    3.1INTRODUCTION3.2NON-DISGUISED, NON-STRUCTURED TECHNIQUES3.3DISGUISED, NON-STRUCTURED TECHNIQUES3.4RELIABILITY AND VALIDITY OF MEASUREMENTS3.5NON- DISGUISED, STRUCTURED TECHNIQUES3.6CONCEPT TESTING

    3.1 INTRODUCTION

    Definition of attitude: Attitude has been defined by Gene F. Summers as a predisposition to

    respond to an idea or an object.In marketing, this refers to the consumers predisposition about

    the product or service. If it is favorable, then the consumer is likely to purchase the product or

    service.

    Attitudes about products or services are composed of three elements:

    Beliefs such as the products strength or the economy of the product or service Emotional feelings such as likes or dislikes Readiness to respond to the product or service, i.e. to buy it.

    These three elements combine together to form an image of the product or service in the mind of

    the consumer. When the car manufacturer, the movie producer or the insurance company refers

    to the companys image, they are referring to some general averages of many individuals

    attitudes towards the company. Attitude measurement is commonly referred to as scaling.

    3.2NON-DISGUISED, NON-STRUCTURED TECHNIQUES

    The essence of these methods is that the purpose of the interview is not a secret and that there is

    no fixed structure for conducting the interview.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    2/23

    Qualitative Research

    The most common method of obtaining information about the behavior, attitudes and other

    characteristics of people is to ask them. However it is not always possible, or desirable to use

    direct questioning to obtain information. People may be either unwilling or unable to give

    answers to questions they consider to be an invasion of their privacy, that adversely affect their

    self-perception or prestige, that are embarrassing that concern motivations that they do not fully

    understand or cannot verbalize, or for other reasons. Therefore additional approaches to

    obtaining such information may be necessary.

    Depth interviews and Projective techniques are frequently used by marketing researchers when

    direct questioning is impractical, more costly, or less accurate. These techniques generally

    referred to as Qualitative research.

    Depth interviews

    Individual depth interviews typically require 30-45minutes. The interviewer does not have a

    specific set of pre-specified questions that must be asked according to the order imposed by a

    questionnaire. Instead, there is freedom to create questions, to probe those responses that appear

    relevant, and generally to try to develop the best set of data in any way practical. However the

    interviewer must follow one rule; one must not consciously try to affect the content of the

    answers given by the respondents. The respondent must feel free to reply to the various

    questions, probes, and other, subtler, ways of encouraging responses in the manner deemed most

    appropriate.

    Subject of interest is discussed in detail. There is no fixed pattern for eliciting information from the respondents. Generally conducted by highly trained interviewers. They must be thorough in probing

    the respondents.

    The interviewee is asked about the subject of his choice, coffee, for example, and anattempt is made to explore the respondents attitudes in depth byprobing extensively into

    any other areas which may come up.

    Interviewers have a general series of topics that they will introduce perhaps such topicsas coffee, or sleep, and will introduce them from time to time if the respondent does not

    bring them up.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    3/23

    Tone of the interview is permissive and the respondent is allowed to talk as much as helikes.

    The interviewer must not influence the answers of the respondent. The interpretation of the answers is very subjective and knowledge of human behavior is

    required to analyze the information received. Individual depth interviews uses three

    questioning techniques namely:

    Chapter 8: Scales and Attitude Measurement

    1. Laddering involves having respondents identify attributes that distinguish brands by asking

    questions. Each distinguishing attribute is then probed to determine why it is important or

    meaningful. These reasons are then probed to determine why it is important, and so forth. The

    purpose is to uncover the network of meanings associated with the product, brand, or concept.

    2. Hidden-issue questioning focuses on individual respondents feelings about sensitive issues.

    Analysis on focus on common underlying themes across respondents. These themes can then be

    used to guide advertising development

    3. Symbolic questioning requires respondents to describe the opposites of the product/ activity

    of interest or a specific attribute of the product/ activity. Individual depth interviews have been

    found to generate more and higher quality ideas on a per respondent basis than either focus or

    minigroups. They are particularly appropriate when:

    1. Detailed probing of an individuals behavior, attitude or needs is required;

    2. The subject matter under discussion is likely to be of a highly confidential nature (e. g.

    personal investment)

    3. The subject matter is of an emotionally charged or embarrassing nature;

    4. Certain strong, socially acceptable norms exist (e.g. baby feeding) and the need to conform in

    a group discussion may influence responses;

    5. Where highly detailed understanding of complicated behavior or decision making pattern (e.g.

    planning the family holiday) are required; or

    The interviews are with professional people or with people on the subject of their jobs 9 e.g.

    finance directors)

    Focus group discussions (F.G.Ds):

  • 8/12/2019 Attitude Measurement Techniques-unit3

    4/23

    The standard focus group interview in the United States involves 8 and 12 individuals and lasts

    about 2 hours. Normally each group is designed to reflect the characteristics of a particular

    market segment. The respondents are selected according to the relevant sampling plan and meet

    at a central location that generally has facility for taping and/ or filming the interviews. In

    Europe, focus tend to consist of 6 to 8 respondents, vary in length from 1.5 to 4 hours and are

    often conducted in the home of the recruiter. Otherwise the interviewers are similar. The

    discussion itself is led by a moderator. The moderator attempts to progress through three

    stages during the interviewer:

    (1) Establish rapport with the group, structure the rules of group interaction, and set objectives;

    (2) provoke intense discussion in the relevant areas; and

    (3) summarize the groups responses todetermine the extent of agreement. The general either

    the moderator or a second person prepares a summary of each session after analyzing the

    sessions transcript.

    Focus Group Interviews can be applied to:

    1. Basic- need studies for product idea creation,

    2. New product idea or concept exploration,

    3. Product positioning studies,

    4. Advertising and communications research,

    5. Background studies on consumers frames or reference,

    6. Establishment of consumer vocabulary as a preliminary step in questionnaire development

    7. Determination of attitudes and behavior.

    Advantages

    1. Each individual is able to expand and refine their opinions in the interaction with the other

    members. This process provides more detailed and accurateinformation than could be derived

    from each separately.

    2. A group interview situation is generally more exciting and offers more stimulation to the

    participants than the standard depth interviews.

    3. The security of being in a crowd encourages some members to speak out when they otherwise

    would not.

    4. As the questions raised by the moderator are addressed to the entire group rather than an

    individual the answer contains a degree of spontaneity that is not produced by other techniques.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    5/23

    5. Focus groups can be used successfully with children over five. They are also very useful with

    adults in developing countries where literacy rates are low and survey research is difficult. 88

    6. A final major advantage of focus groups is that executives often observe the interview (from

    behind mirrors) or watch films of the interview.

    Disadvantages

    1. Since focus group interviews last 1.5 to 3 hours and take place at a central location, securing

    cooperation from a random sample is difficult.

    2. Those who attend group interviews and actively participate in them are likely to be different in

    many respects from those who do not.

    3. There are chances that participants may go along with the popular opinion instead of

    expressing their own which may be contrary to the popular opinions.

    4. The presence of a one-way mirror and /or an observer(s) has been found to distort participants

    responses.

    5. The moderator can introduce serious biases in the interview by shifting topics too rapidly

    verbally or nonverbally encouraging certain answers, failing to cover specific areas, and so forth.

    6. Focus groups are expensive on a per respondent basis.

    Minigroups

    Minigroups consist of a moderator and 4 and 5 respondents rather than the 8 to 12 used in most

    focus groups. They are used when the issue being investigated requires more extensive probing

    than is possible in a larger group. Minigroups do not allow the collection of a confidential or

    highly sensitive data as might be possible in an individual depth interview. However, they do

    allow the researcher to obtain substantially depth of response on the topics that are covered.

    Further the intimacy of the small group often allows discussion of quite sensitive issues.

    The advantages and disadvantages of minigroups are similar to those of standard focus groups,

    but on a smaller scale.

    In principle, these interviews are the same as the previous ones, excepting that they areconducted in groups rather than for individuals.

    This method is therefore less expensive and less time consuming than the depthinterviews.

    This method is advantageous because it gives excellent leads to consumer attitudes thatno other method can give.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    6/23

    Another advantage of this method is that each respondent receives stimulation forresponding from his group members and so the interviewer need not prompt the

    interviewee to answer.

    The disadvantage here is that one or two members could dominate in the group andothers might not get a chance to answer. This would again make it an individual effort.

    3.3DISGUISED, NON-STRUCTURED TECHNIQUES

    The essence of these methods is that the interviewee either does not know that his attitude is

    being studied, or does not know for which company the survey is being done, or sometimes he

    does not know both. It involves using various vague stimuli to which the respondent is asked to

    respond. In doing so, it is believed that the respondent reveals several elements of his/ her

    attitude that he would not have revealed in the face of direct questions.

    These tests are not difficult to administer because they are like games played with the

    respondents. Generally, respondents seem to enjoy the exercise.

    Projective Techniques

    Projective Techniques are based on the theory that the description of vague objects requires

    interpretation and this interpretation can only be based on the individuals own background,

    attitudes, and values. The more vague or ambiguous the object to be described the move one

    must reveal of oneself in order to complete the description.

    The following general categories of projective techniques are described:

    Association, Completion, Construction and expression.

    Association Techniques

    Association techniques require the subject to respond to the presentation of a stimulus with the

    first things that come to mind. The word association technique requires the respondent to give

    the first word or thought that comes to mind after researcher presents a word or phrase. In free

    association only the first word or thought is required. In successive word association, the

    respondent is asked to give a series of words or thoughts that occur after hearing a given word.

    The respondent is generally read a number of relatively neutral terms to establish the technique.

    Then the words of interest to the researcher are presented, each separated by several neutral

    terms. The order of presentation of the key words is randomized to prevent any position or order

  • 8/12/2019 Attitude Measurement Techniques-unit3

    7/23

    bias from affecting the results. The most common approach to analyzing the resulting data is to

    analyze the frequency with a particular word or category or word is given in response to the

    word of interest to the researcher.

    Word association techniques are used in testing potential brand names and occasionally for

    measuring attitudes about particular products, product attributes,

    brands, packages or advertisements.

    Completion Techniques

    This technique requires the respondent to complete an incomplete stimulus. Two types of

    completion are of interest to marketing researchers-

    Sentence completion, as the name implies, involves requiring the respondent to complete a

    sentence. In most sentence completion tests the respondents are asked to complete the sentence

    with a phrase. Generally they are told to use the first thought that comes to their mind or

    anything that makes sense. Because theindividual is not required directly to associate himself

    or herself with the answer conscious or subconscious defenses are more likely to be relaxed and

    allow a more revealing answer.

    Story completion is an expanded version of sentence completion. As the name suggests part of a

    story is told and the respondent is asked to complete it.

    Construction Techniques

    This technique requires the respondent to produce or construct something generally a story,

    dialogue, or description. They are similar to completion techniques except that less initial

    structure is provided.

    Cartoon techniques present cartoon-type drawings of one or more people in a particular situation.

    One or more of the individuals are shown with a sentence in bubble form above their heads and

    one of the others is shown with a blank bubble that the respondent is to fill in. Instead of

    having the bubble show replies or comments, it can be drawn to indicate the unspoken thoughts

    of one or more of the characters. This device allows the respondent to avoid any restraints that

    might be felt against having even a carton character speak as opposed to think certain thoughts.

    Third- person techniques allow the respondent to project attitudes onto some vague third person.

    This third person is generally an average woman, your neighbors, the guys where you

    work, most doctors or the like. Thus instead of asking the respondent why he or she did

  • 8/12/2019 Attitude Measurement Techniques-unit3

    8/23

    something or what he or she thinks about something the researcher asks what friends, neighbors

    or the average person thinks about the issue.

    Picture response, another useful construction technique, involves using pictures to elicit stories.

    These pictures are usually relatively vague, so that the respondent must use his or her

    imagination to describe what is occurring.

    Fantasy scenario requires the respondent to make up a fantasy about the product or brand.

    Personification asks the respondent to create a personally for the products or brands.

    Expressive Techniques

    Role-playing is the only expressive technique utilized to any extent by marketing researchers. In

    role playing the consumer is asked to assume the role or behavior of an object or another person,

    such as a sales representative for a particular department store. The role-playing customer can

    then be asked to try to sell a given product to a number of different consumers who raise

    varying objections. The means by which the role player attempts to overcome these objections

    can reveal a great deal about his or her attitudes. Another version of the technique involves

    studying the role-players attitudes on what type of people should shop at the storein question.

    Problems

    As projective techniques generally require personal interviews with highly trained interviewers

    and interpreters to evaluate the responses, they tend to be very expensive.

    Small sample sizes can increase the probability of substantial sampling error. The reliance on

    small samples often has been accompanied by non-profitability selection procedures.

    Some of the projective techniques require the respondents to engage in behavior that may well be

    strange to them; this is particular true for techniques such as role plays. Thus there is reason

    enough to believe that there might be an error in the findings. Measurement error is also a serious

    issue with respect to projective techniques. The possibility of interpreter bias is obvious.

    Word Association

    One of the oldest and simplest projection techniques. Respondents are presented with a number of different words, one at a time.After each

    word, they are asked to give the first word that comes to mind.

    The assumption here is that through free words, the respondents will indicatetheir innerfeelings about the subject.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    9/23

    Responses are timed so that those responses that respondents reason outare identifiedand taken into account in the analysis. The time limit is usually 5 seconds.

    The usual way of constructing such a test is to choose many stimulating and neutralwords. The words are read out to the respondent one at a time, and the interviewer

    essentially records the first word association by therespondent.

    Respondents should not be asked to write their responses because then the interviewerwill not know if the responses were spontaneous or whether the respondent took time to

    think out the responses.

    An example of such a test is: who would eat a lot of oatmeal? The first response isathletes. This means that the respondent feels that the product is more suited for

    sportspersons. More words on the same topic will reveal more about the respondents

    attitude about the product.

    While analyzing the results of word-association tests, responses are arranged along suchlines as favorable - unfavorable and pleasant unpleasant.

    Sentence Completion

    The respondent is given a number of incomplete sentences and asked to complete them. The rule here too, is that respondent must fill in the first thought that comes to mind. Responses are timed. Here the interviewer gets more information than the word association technique. However, it is difficult to disguise the motive of the study from the respondent, who is

    usually able to diagnose the investigators purpose of study.

    For example, a man who reads Sportstar is ------------------------------------------.

    The sentences can be worded in either first or third person. No evidence suggests that one of

    these approaches could be better than the other.

    Story Completion

    Respondents are given a half-completed story. This is enough to draw their attention to aparticular issue, but the ending is left vague, so that responses can be varied.

    This technique is very versatile and has numerous applications to marketing problems. The findings about products/ services give companies inputs to determine advertising and

    promotional themes and product characteristics.

    Pictorial Techniques

  • 8/12/2019 Attitude Measurement Techniques-unit3

    10/23

    These are similar to story completion method, except that here pictures are used as the stimuli.

    The two main methods used here are

    a) Thematic Apperception Tests (TAT)

    b) Cartoon method

    TAT

    Clinical psychologists have long used this method. Here the respondent is shown many ambiguous pictures and he is asked to spin stories

    about them.

    The interviewer may ask questions to help the respondent to think. For example what ishappening here? makes the answer focused towards an action. Or which one is the

    aggressor? makes the respondent think about the picture as one of aggression. The

    reason that respondents must be asked such prompting questions is that the pictures are

    very abstract and general and as such are open to very broad and irreverent

    interpretations. So some amount of focus is needed to channel the respondents thinking.

    Each subject in the pictures is a medium through which the respondent projects his

    feelings, ideas, emotions and attitudes. The respondent attributes these feelings to the

    characters because he sees in the picture something related to himself.

    Responses differ widely and analysis depends upon the ambiguity of the picture, theextent to which the respondent is able to guess the conclusions and the vagueness of the

    support questions asked by the interviewer.

    Cartoon Tests

    They are a version or modification of the TAT, but they are simpler to administer and analyze.

    Cartoon Characters are shown in a specific situation pertinent to a problem.

    One or more balloons indicating the conversation of the characters is left open. The respondent

    has to then fill these balloons and then analyzed.

    3.4RELIABILITY AND VALIDITY OF MEASUREMENTS

    The terms validity, reliability & accuracy are often used interchangeably. But each of these has a

    specific meaning based on the type of measurement error that is present.

    There are two types of measurement errors:

    1. Systematic errors

    2. Variable errors

  • 8/12/2019 Attitude Measurement Techniques-unit3

    11/23

    A systematic error, also known as bias, is one that occurs in a consistent manner each time

    something is measured. For e.g.: A biased question would produce an error in the same direction

    each time it is asked. Such an error would be a systematic error.

    A variable error is one that occurs randomly each time something is measured. For

    e.g.: A response that is less favorable than the true feeling because the respondent was in a bad

    mood (temporary characteristic) would not occur each time that individuals attitude is

    measured. In fact, an error in the opposite direction (overly favorable) would occur if the

    individual were in a good mood. This represents a variable error.

    The term reliability is used to refer to the degree of variable error in a measurement.

    We define reliability as the extent to which a measurement is free of variable errors.

    This is reflected when repeated measures of the same stable characteristic in the same objects

    show limited variation.

    A common conceptual definition for validity is the extent to which the measure provides an

    accurate representation of what one is trying to measure. In this conceptual definition, validity

    includes both systematic and variable error components. However, it is more useful to limit the

    meaning of the term validity to refer to the degree of consistent or systematic error in a

    measurement. Therefore we define validity as the extent to which a measurement is free from

    systematic error.

    Measurement accuracy is then defined as the extent to which a measurement is free from

    systematic and variable error. Accuracy is the ultimate concern of the researcher since a lack of

    accuracy may lead to incorrect decisions.

    Reliability

    There are various operational approaches for estimation of reliability. The following table

    summarizes these approaches.

    Approaches to assessing reliability

    No. Approach Description

    Test-retest reliability

    Applying the same measure to the same objects a second time

    Alternative-forms reliability

    Measuring the same objects by two instruments that are designed to be as nearly alike as possible

    Internal-comparison reliability

  • 8/12/2019 Attitude Measurement Techniques-unit3

    12/23

    Comparing the responses among the various items on a multiple-item index designed to measure

    a homogeneous concept

    Scorer reliability Comparing the scores assigned by two or more judges No one approach is the

    best; several different assessment approaches should generally be used. The selection of one or

    more means of assessing a measuresreliability depends on the errors likely to be present and the

    cost of each assessment method in the situation at hand.

    Test-Retest Reliability

    Test-retest reliability estimates are obtained by repeating the measurement using the same

    instrument using the same instrument under as nearly equivalent conditions as possible. The

    results of the two administrations are then compared and the degree of correspondence is

    determined. The greater the differences, the lower the reliability A number of practical and

    computational difficulties are involved in measuring test retest reliability.

    They are:

    Some items can be measured only once. For e.g.: It would not be possible to remeasurean individuals initial reaction to a new advertising slogan.

    In many situations, the initial measurement may alter the characteristic being measured.Thus an attitude survey may focus the individuals attention on the topic and cause new

    or different attitudes to be formed about it.

    There may be some form of a carryover effect from the first measure. The retaking of ameasure may produce boredom, anger, or attempts to remember the answers given in the

    initial measurement.

    Factors extraneous to the measuring process may cause shifts in the characteristic beingmeasured. For e.g.: A favorable experience with the brand during the period between the

    test and the retest might cause a shift in individual ratings of that brand.

    Alternative-Form Reliability

    Alternative-form reliability estimates are obtained by applying two equivalent forms of the

    measuring instrument to the same subjects. As in test-retest reliability, the results of the two

    instruments are compared on an item-by-item basis and the degree of similarity is determined.

    The basic logic is the same as in test-retest approach.

    Two primary problems are associated with this approach.

    They are:

  • 8/12/2019 Attitude Measurement Techniques-unit3

    13/23

    The extra time, expense and trouble involved in obtaining two equivalent measures. More

    importantly, the problem of constructing two truly equivalent forms Thus a low degree of

    response similarity may reflect either an unreliable instrument or nonequivalent forms. Despite

    these difficulties, researchers should use alternative measures of important concepts whenever

    possible to allow assessment of reliability (and validity) as well as to improve accuracy (by using

    the data from both the measures)

    3. Internal-Comparison Reliability

    Internal-comparison reliability is estimated by the intercorrelation among the scores of the items

    on a multiple-item index. All items on the index must be designed to measure precisely the same

    thing. For e.g.: measures of store image generally involve assessing a number of specific

    dimensions of the store, such as price level, merchandise, service, and location. Because these

    are somewhat independent, an internal-comparison measure of reliability is not appropriate

    across dimensions. However, it can be used within each dimension if several items are used to

    measure each dimension.

    Split-half reliability is the simplest type of internal comparison. It is obtained by comparing the

    results of half the items on a multi-item measure with the results from the remaining items. The

    usual approach to split-half reliability involves dividing the total number of items into two

    groups on a random basis and computing a measure of similarity.

    A better approach to internal comparison is known as coefficient alpha. This measurement, in

    effect, produces the mean of all possible split-half coefficients resulting from different splitting

    of the measurement instrument. Coefficient alpha can range from 0 to 1. A value of .6 or less is

    usually viewed as unsatisfactory.

    4. Scorer Reliability:

    Marketing researchers frequently rely on judgment to classify a consumers response. This

    occurs, for e.g., when projective techniques, focus groups, observation, or open ended questions

    are used. In these situations, the judges or scorers, may be unreliable, rather than the instrument

    or respondent. To estimate the level of scorer reliability, each scorer should have some of the

    items him or her scores judged independently by another scorer. The correlation between the

    various judges is a measure of scorer reliability.

    Validity

  • 8/12/2019 Attitude Measurement Techniques-unit3

    14/23

    Validity like reliability is concerned with error. However it is concerned with consistent or

    systematic error rather than variable error. A valid measurement reflects only the characteristics

    of interest and random error. There are three basic types of validity. They are:

    1. Content validity

    2. Construct validity and

    3. Criterion-related validity (predictive and concurrent)

    These are summarized in the table below:

    Basic Approaches to Validity Assessment

    No. Approach Description

    1 Content validation Involves assessing the representativeness or the sampling adequacy of the

    items contained in the measuring instrument

    2 Criterion-related validation

    Involves inferring an individuals score or standing on some measurement, called a criterion,

    from the measurement at hand

    a. Concurrent validation Involves assessing the extent to which obtained score may be used to

    estimate an individuals presentstanding with respect to some other variable

    b Predictive validation Involves assessing the extent to which obtained score may be used to

    estimate an individuals futurestanding with respect to the criterion variable

    3 Construct validation Involves understanding the meaning of the obtained measurements

    Content Validity

    Content validity estimates are essentially systematic, but subjective, evaluations of the

    appropriateness of the measuring instrument for the task at hand. The term face validity has a

    similar meaning. However, face validity generally refers to nonexpertjudgments of individuals

    completing the instrument and/or executives who must approve its use. This does not mean that

    face validity is not important. Respondents may refuse to cooperate or may fail to treat seriously

    measurements that appear irrelevant to them. Managers may refuse to approve projects that

    utilize measurements lacking in face validity. Therefore, to the extent possible, researchers

    should strive for face validity.

    The most common use of face validity is with multi-item measures. In this case, the researchers

    or some other individual or group of individuals assesses the representativeness, or sampling

    adequacy, of the included items in light of the purpose of the measuring instrument. Thus, an

  • 8/12/2019 Attitude Measurement Techniques-unit3

    15/23

    attitude scale designed to measure the overall attitude towards a shopping center would not be

    considered to have content validity if it omitted any major attributes such as location, layout and

    so on. Content validation is the most common form of validation in applied marketing research.

    Criterion-Related Validity:

    Criterion-related validity can take two forms, based on the time period involved. They are:

    1. Concurrent validity and

    2. Predictive validity

    Concurrent validity is the extent to which one measure of a variable can be used to estimate an

    individuals current score on a different measure of the same, or a closely related variable. For

    e.g.: a researcher may be trying to relate social class to the use of savings and loan associations.

    In a pilot study, the researcher finds a useful relationship between attitudes towards savings and

    loan associations and social class, as defined by Warners ISC scale. The researcher now wishes

    to test this relationship further in a national mail survey. Unfortunately, Warners ISC is difficult

    to use in a mail survey. Therefore, the researcher develops brief verbal descriptions of each of

    Warners six social classes. Respondents will be asked to indicate the social class that best

    describes their household. Prior to using this measure, the researcher should assess its concurrent

    validity with the standard ISC scale.

    Predictive validity is the extent to which an individuals future level on some variable can be

    predicted by his or her performance on a current measurement of the same or a different variable.

    Predictive validity is the primary concern of the applied marketing researcher.

    Some of the predictive validity questions that confront marketing researchers are:

    (a) Will a measure of attitudes predict future purchases?

    (b) Will a measure of sales in a controlled store test predict future market share?

    (c) Will a measure of initial sales predict future sales? and

    (d) Will a measure of demographic characteristics of an area predict the success of a branch bank

    in the area?

    Construct Validity

    Construct validityunderstanding the factors that underline the obtained measurement is the

    most complex form of validity. It involves more than just knowing how well a given measure

    works; it also involves knowing why it works.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    16/23

    Construct validity requires that the researcher have a sound theory of the nature of the concept

    being measured and how it relates to the other concepts. A number of approaches exist for

    assessing construct validity of which the most common is called multi trait-multi method matrix

    approach. These multiple measures (by methods as different from each other as possible) of

    multiple traits or concepts can be analyzed by the Campbell-Fiske procedure, confirmatory factor

    analysis, or direct product model.

    These techniques generally involve ensuring that the measure correlates positively with other

    measures of the same construct (convergent validity), does not correlate with theoretically

    unrelated constructs (discriminant validity), correlates in the theoretically predicted way with

    measures of different but related constructs (nomological validity), and correlates highly with

    itself (reliability). For e.g.: suppose we develop a multi-item scale to measure the tendency to

    purchase prestige brands. Our theory suggests that this tendency is caused by three personality

    variables. They are:

    1. Low self-focus

    2. High need for status and

    3. High materialism

    We believe that it is unrelated to brand loyalty and the tendency to purchase new products.

    Evidence of construct validity would exist if our scale:

    1. Correlates highly with other measures of prestige brand preference such as reported purchases

    and classifications by friends (convergent validity);

    2. Has a low correlation with the unrelated constructs brand loyalty and tendency to purchase

    new products (discriminant validity);

    3. Has a low correlation with self-focus and high correlations with need for status and

    materialism (nomological validity); and

    4. Has a high level of internal consistency (reliability)

    3.5NON- DISGUISED, STRUCTURED TECHNIQUES

    The nonstructured techniques for attitude measurement are primarily of value in exploratory

    studies, where the researcher is looking for the salient attributes of given products and the

    important factors surrounding purchase decisions as seen by the consumer. Structured techniques

    can provide a more objective measurement system, one that is more comparable to a scale or

  • 8/12/2019 Attitude Measurement Techniques-unit3

    17/23

    yardstick. The term scaling has been applied to the efforts to measure attitudes objectively, and a

    number of useful scales have been developed.

    Nominal Data

    A set of data is said to be nominal if the values / observations belonging to it can be assigned a

    code in the form of a number where the numbers are simply labels. You can count but not order

    or measure nominal data. For example, in a data set males could be coded as 0, females as 1;

    marital status of an individual could be coded as Y if married, N if single.

    Ordinal Scales

    They are the simplest attitude measuring scales used in marketing research.

    They serve to rank respondents according to some characteristics such as favorabiliy to a certain

    brand, or to rank items such as brands in order of consumer preference.

    They do not measure the degree of favorability of the different rankings. All the scale tells is that

    the individual or item has more, less, or the same amount of the characteristic being measured as

    some other time.

    They are the most widely used type of scales in marketing research.

    A set of data is said to be ordinal if the values / observations belonging to it can be ranked (put in

    order) or have a rating scale attached. You can count and order, but not measure, ordinal data.

    The categories for an ordinal set of data have a natural order, for example, suppose a group of

    people were asked to taste varieties of biscuit and classify each biscuit on a rating scale of 1 to 5,

    representing strongly dislike, dislike, neutral, like, strongly like. A rating of 5 indicates more

    enjoyment than a rating of 4, for example, so such data are ordinal. However, the distinction

    between neighboring points on the scale is not necessarily always the same. For instance, the

    difference in enjoyment expressed by giving a rating of 2 rather than 1 might be much less than

    the difference in enjoyment expressed by giving a rating of 4 rather than 3.

    Interval Scales

    They separate individuals or items by rank order but measure the distance between rankpositions in equal units.

    Such a scale permits the researcher to say that the position 4 is above position 3 on thescale, and also the distance from position 5 to 4 is same as from 4 to 3.

    Such a scale however does not permit conclusions that position 6 is twice as strong asposition 3 because no zero position has been established. An interval scale is a scale of

  • 8/12/2019 Attitude Measurement Techniques-unit3

    18/23

    measurement where the distance between any two adjacent units of measurement (or

    'intervals') is the same but the zero point is arbitrary. Scores on an interval scale can be

    added and subtracted but cannot be meaningfully multiplied or divided. For example, the

    time interval between the starts of years 1981 and 1982 is the same as that between 1983

    and 1984, namely 365 days. The zero point, year 1 AD, is arbitrary; time did not begin

    then. Other examples of interval scales include the heights of tides, and the measurement

    of longitude.

    Ratio Scales

    If one measures the distance between two points as four feet and between two otherpoints as two feet, it is possible say that one distance is twice that of the other because

    each distance is measured from an absolute zero. A scale that permits such measurements

    is called ratio scale.

    While ratio scales are common in physical science, the measurement of attitudes is stillso crude that they are of little significance in marketing research.

    Semantic Differential Scale

    It is a special type of graphic scale, which is increasingly being used, in marketingresearch.

    It establishes a connection between the brand and company image studies and alsopermits the development of descriptive profiles that facilitates comparison of competitive

    items.

    The unique characteristics of semantic differential is the use of bipolar scales to rate anyproduct, company or concept of interest.

    Respondents are given a group of these scales and asked to check on each one point thatindicates their opinion of the subject in question.

    Each scales consist of two opposing adjectives such as good/bad, clean/dirty mostpopular/ least popular, etc. which are separated by a continuum divided into seven

    segments.

    Respondents are asked to check the segment that represents the degree of thecharacteristics involved that most closely coincided with their opinion of the product or

    item being rated.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    19/23

    It is best when used for image descriptive purposes and is not recommended for overallattitude measurement.

    The advantage of using semantic differential is its simplicity, while producing resultscomparable with those of the more complex scaling methods

    The method is easy and fast to administer, but it is also sensitive to small differences inattitude, highly versatile, reliable and generally valid.

    The Constant Sum Scale

    The constant sum scale requires the respondent to divide a constant sum, generally 10 or 100,

    among two or more objects or attributes on order to reflect the respondents relative preference

    for each object, the importance of the attribute, or the degree to which an object contains each

    attribute.

    The constant sum scale can be used in two cases:

    1. For two objects at a time (paired comparison) or

    2. More than two objects at a time (quadric comparison)

    Advantages

    When rank order data is used the researcher has no way of knowing the characteristic which is of

    overwhelming importance or of knowing a characteristic which is not much more important than

    other characteristics.

    Disadvantage

    A disadvantage could be that individuals could occasionally misassign points such that the total

    is more than, or less than 100. This can be adjusted for by dividing each point allocation by the

    actual total and multiplying the result by 100.

    Thurstone Scale

    It is one of the Multi Item Scales developed by L.L. Thurstones method of Equal Appearing

    Intervals on the concept that, even though people could not assign quantitative measures to their

    own attitudes, they could tell the difference between the attitudes represented by two different

    statements and could identify items that were approximately halfway between the two. The

    procedure is as follows 1. Collect a large number of statements (perhaps as may as several

    hundred) related to the attitude in question 2. Have a number of judges (perhaps 20 or more) sort

  • 8/12/2019 Attitude Measurement Techniques-unit3

    20/23

    the statements independently into 11 piles that vary from the most favorable statement to neutral

    statements to most unfavorable statements.

    3. Study the frequency distribution of ratings for each statement and eliminate those statements

    that the different judges have given widely scattered ratings that are in a number of different

    piles

    4. Determine the scale value of each of the remaining statements that is, the number of the pile

    in pile in which the median of the distribution falls

    5. Select one of the two statements from each of the 11 piles for the final scale.

    Those statements with the narrowest range of rating are preferred as the most reliable.

    Advantages

    It is important to note that there are 11 attitude positions because in a scale with odd number of

    parameters, it is easier to identify a neutral position.

    Disadvantages

    1. Thurstone scales are not widely used for Marketing Research because the are time consuming

    during preparation

    2. The ratings may be influenced by the Judges personal attitude

    3. Different individuals can obtain exactly the same score from agreeing with quite different

    items.

    4. It does not obtain information about the intensity of agreement with the ratings

    Likert Scale

    These scales are sometimes referred to as summated scales. It requires a respondent to indicate a

    degree of agreement or disagreement with each of a series of statements related to the attitude

    object.

    For Example:

    The service at a retail store is very important to me:

    ____ Strongly Agree ____ Agree ____ neither Agree nor Disagree ____ Disagree

    ____ Strongly Disagree

    To analyze a Likert Scale, each response category is assigned a numerical value.

    These examples could be assigned values such as Strongly Agree=1, through

  • 8/12/2019 Attitude Measurement Techniques-unit3

    21/23

    Strongly Disagree=5 or the scoring could be reversed. or a 2 through +2 system could be used.

    They can be analyzed on an item-by-item basis, or they can be summed to form a single score for

    each individual.

    Advantages

    1. It is relatively easy to construct and administer.

    2. Instructions that accompany the scale are easily understood; hence it can be used for mail

    surveys and interviews with children.

    Disadvantages

    1. It takes a longer time to complete as compared to Semantic Differential Scales,etc.

    2. Care needs to be taken when using Likert Scales in cross cultural research, as there may be

    cultural variations in willingness to express disagreement.

    Comparison of Thurstone and Likert Scale

    It is obvious that these two scales have a lot in common. They have been widely used in the past.

    Due to the ordinal nature of the Likert scales, many individuals feel they it may be more reliable

    that the Thurstone Scale.

    DISGUISED, STRUCTURED TECHNIQUES

    The basis premise underlying such tests is that the respondents will reveal their attitudes by the

    extent to which their answers to the objective questions vary from the correct answers to

    objective questions that vary from the correct answers. Respondents are provided with questions

    that they are no able to answer correctly. Thus, they are forced to guess at the answers. The

    direction and extent of these guessing errors is assumed to reveal their attitudes on the subject.

    For example, individuals tend to gather information that supports their attitudes and, therefore,

    the extent and kind of information individuals possess on a given subject indicate something of

    their attitude.

    For example:

    How much do you think it cost for the hot cereal alone in a average bowl of cereal such as youd

    serve at the breakfast?

    Do corn flakes cost less or more per bowl than cereal?

  • 8/12/2019 Attitude Measurement Techniques-unit3

    22/23

    3.6CONCEPT TESTING

    Attitude Scale: Sets of rating scales used to measure one or more dimensions of an individuals

    attitude toward some object. Attitude scales are constructed using likert, semantic differential or

    Stapel scales.

    Concurrent Validity: A measure of how accurate a measure of an object, state or event is now

    as opposed to how accurate it will be in the future (predictive validity), one measure of

    concurrent validity is how comparable the results of Instrument A and Instrument B are when

    both are used to measure the same characteristics in the same object at the same point of time.

    Constant Sum Scale: The constant sum scale requires the respondent to divide a constant sum,

    generally 10 or 100, among two or more objects or attributes on order to reflect the respondents

    relative preference for each object, the importance of the attribute, or the degree to which an

    object contains each attribute.

    Construct Validity: Understanding the factors that underlie the obtained measurement. It

    involves knowing how well and why a given measure works by having a sound theory of the

    nature of the concept being measured and how it relates to other concepts.

    Depth interview: An interviewing procedure in which the interviewer does not have a

    prespecified list of questions. The interviewer is free to create questions and probe responses that

    appear relevant. Respondents are free to respond to questions in any way they think appropriate.

    Types of depth interviews include individual, mini group and focus group.

    External Validity: The ability of the results from an experiment to predict the results in the

    actual situation.

    Face Validity: A form of content validity that exists when non experts such as respondents or

    executives judge the measuring instrument as appropriate for the task at hand.

    Free Word Association: A projective technique that requires the respondent to give the first

    word or thought that comes to mind after the researcher presents a word or phrase.

    Internal Validity: The degree of replicability of an experiment or assurance that experimental

    results are due to the variables manipulated in the experiment in that specific environment.

    Interval Scale: Numbers are used to rank items such that numerically equal distances on the

    scale represent equal distances in the property being measured. The location of the zero point and

    the unit of measurement is determined by the researcher; consequently, ratios calculated on data

    from interval scales are not meaningful.

  • 8/12/2019 Attitude Measurement Techniques-unit3

    23/23

    Ordinal Scale: A rating scale in which numbers, letters, or other symbols are used to assign

    ranks to items. An ordinal scale requires the respondent to indicate if one item has more or less

    of a characteristic than another item. The magnitude of difference between the items is not

    estimated.

    Predictive Validity: The extent to which the future level of some variable can be predicted by a

    current measurement of the same or a different variable.

    Projective Technique: The technique of inferring a subjects attitudes or values based on his or

    her description of vague objects requiring interpretation. Common types used in market research

    include cartoon, picture-response, third person and sentence completion.

    Ratio Scale: A rating scale in which items are ranked so that numerically equal scale distances

    represent equal distances in the property being measured. These scales have a natural and known

    zero point.

    Reliability: The extent of variable error in a measurement. Reliability exists when repeated

    measures of the same stable characteristics in the same objects or persons show limited variation.

    Scorer Reliability: The extent of agreement among judges (scorers) working independently to

    categorize a series of objects. The higher the degree of agreement between the judges, the greater

    the reliability of the categorization

    Semantic Differential Scale: An attitude scaling device, it requires the respondent to rate the

    attitude object on a number of itemized, seven-point rating scales bounded at each end by one of

    two bipolar adjectives or phrases.

    Sentence Completion Technique: A projective technique requiring the subject to complete a

    sentence using the first phrase that comes to mind. The subject is not required to associate

    himself or herself with the response.

    Split Half Reliability: A measure of reliability in which the results form half the items on a

    multi-item measure are compared with the results for the remaining items. If there is a substantial

    variation between the groups, the reliability of the instrument is in doubt.

    Validity: The amount of systematic error in a measurement.