Chapter 14Basic Data Analysis
TRUE/FALSE
1. Coding is the process of assigning a numerical score or other character symbol to previously edited
data.
2. Discrete coding can be used for dichotomous responses like “yes “or “no.”
3. Coding using the rule that a male appearing in an advertisement is coded as “1,” a female is “2,” a
child is “3,” and an animal is “4,” is an example of class coding.
4. The researcher examining descriptive statistics for any particular variable is using univariate statistics.
5. The type of measurement scale used in the research study determines the possible statistical tests that
can be used appropriately with the resulting data.
6. All statistics that are appropriate to use for higher-order scales (ratio scales are the highest) are also
appropriate to use with lower-order scales (nominal scales are the lowest).
7. A planogram is a graphical way of showing the frequency distribution in which the height of a bar
corresponds to the frequency of a category.
8. Tabulation refers to the orderly arrangement of data in a table or other summary format.
9. A simple tabulation of a variable’s frequency distribution is sometimes called a marginal tabulation.
10. Cross-tabulation allows the inspection and comparison of differences among groups based on interval
or ratio measures.
11. A contingency table is a data matrix that displays the frequency of some combination of possible
responses to multiple variables.
12. The row and column totals in a contingency table are called subtotals because they are less than the
total.
13. Researchers usually are most interested in the marginals in a contingency table.
14. A 3 x 4 table represents a contingency table with twelve variables.
15. The reference base is the number of respondents or observations (in a row or column) used as a basis
for computing percentages in a cross-tabulation.
16. A common form of elaboration analysis is to do cross-tabulation of data within subgroups of the
sample under study.
17. When a third variable inserted into the analysis changes the results when two other variables were
studied previously, this third variable is called a moderator variable.
18. The process of changing data from their original form to a format that more closely fits the research
objectives of the research study is called data transformation.
19. Combining the data from adjacent categories of a Likert-scale item is a common form of data
transformation.
20. A category split means respondents below the observed median go into one category and respondents
above the median go into another.
21. When a data set is bimodal, a median split of the data will lead to error.
22. Index numbers allow researchers to track a variable’s value over time and compare a variable with
other variables.
23. Monovariate statistical analysis tests hypotheses involving only one variable.
24. Multivariate statistic analyses require five or more variables.
25. A statistical test’s significance level or p-value becomes a key indicator of whether or not a hypothesis
can be supported.
26. A significance level is a critical probability associated with a statistical hypothesis test that indicates
how likely it is that an inference supporting a difference between an observed value and some
statistical expectation is true.
27. The term p-value stands for power-value.
28. The researcher using sampling runs the risk of committing two types of errors: primary errors and
secondary errors.
29. A Type I error occurs when a condition that is true in the population is rejected based on statistical
observations.
30. The ttest is most appropriate for small samples (e.g., smaller than 30).
MULTIPLE CHOICE
1. The assignment of numbers to edited data is known as _____.
a.
editing
b.
adjusting with a plug value
c.
coding
d.
all of the above
2. Assigning males the value of zero and females the value of one in a database to record the gender of
the respondents is an example of _____.
a.
coding
b.
test tabulation
c.
editing
d.
verification
3. Codes are rules for ______ data.
a.
interpreting
b.
classifying
c.
recording
d.
all of these choices
4. Assigning a “1” or “0” code where each number represents an alternate response such as “yes” or
“no,” is an example of _____.
a.
editing
b.
dummy coding
c.
reverse coding
d.
plugging
5. A survey question asked respondent how much of the Super Bowl they watched by asking them to
check one of the four following choices: “all of it,” “most of it,” “some of it,” or “none of it.” How
many dummy variables would a researcher need to dummy code this question?
a.
one
b.
two
c.
three
d.
four
6. Which of the following is an alternative to dummy coding using the values of -1 and +1 to represent
two categories of responses?
a.
reverse coding
b.
inverse coding
c.
effects coding
d.
verified coding
7. Coding that assigns numbers to categories in an arbitrary way merely as a means of identifying some
characteristic is called _____.
a.
nominal coding
b.
descriptive coding
c.
labeling
d.
class coding
8. The transformation of raw data into a form that makes the data easier to understand and to interpret is
called _____.
a.
descriptive analysis
b.
outlier analysis
c.
computer mapping
d.
creating a box and whisker plot
9. All of the following are examples of descriptive statistics EXCEPT _____.
a.
p-value
b.
average
c.
range
d.
standard deviation
10. The researcher examining descriptive statistics for any particular variable is using which type of
statistics?
a.
multivariate
b.
interval
c.
nominal
d.
univariate
11. Which of the following is a graphical way of showing the frequency distribution in which the height of
a bar corresponds to the frequency of a category?
a.
perceptual map
b.
histogram
c.
contingency table
d.
frequency chart
12. Arranging data into a table is called _____.
a.
tabulation
b.
frequency
c.
analysis
d.
interpretation
13. Counting the number of responses to questions in a survey by hand is called _____.
a.
index analysis
b.
tallying
c.
elaboration analysis
d.
moderator analysis
14. Arranging data so that the number of times each category occurs is called a _____ table.
a.
cross-tabulation
b.
frequency
c.
percentage
d.
pre-coding
15. A researcher is creating a table to present to a client that gives the number of respondents selecting the
various product concepts of interest in the study. The client will use this information in the decision
of which product concept to pursue. This table is referred to as a _____.
a.
frequency table
b.
concept table
c.
results table
d.
decision table
16. Counting the number of responses to different parts of a question in a survey and arranging these
responses in a frequency distribution is called _____.
a.
elaboration analysis
b.
spurious analysis
c.
marginal tabulation
d.
index analysis
17. If 60 males are asked if they recognize the brand name, “Focus,” and 35 of them correctly identify the
product as a model of Ford’s product line within Ford Motor Co., the proportion of males in the
sample who recognize this brand name is approximately _____.
a.
0.60
b.
0.58
c.
0.35
d.
0.79
18. Which of the following is the appropriate technique for addressing research questions involving
relationships among multiple less-than interval variables?
a.
cross-tabulation
b.
ANOVA
c.
regression
d.
cluster analysis
19. If a business wants to know if more males or females purchase their product or a competitor’s product,
which of the following answers this question?
a.
regression analysis
b.
cluster analysis
c.
cross-tabulation
d.
interaction analysis
20. A researcher interested in a data matrix that displays the frequency of some combination of possible
responses to multiple variables should construct a _____.
a.
perceptual map
b.
contingency table
c.
regression equation
d.
marginal table
21. A researcher is examining the row and column totals in a contingency table of stores shopped at by
gender. What are these called?
a.
marginals
b.
subtotals
c.
totals
d.
running totals
22. If a researcher wants to summarize the responses of subjects by gender and awareness of a particular
brand (“Yes” or “No”), he or she would use a ______ contingency table.
a.
1 x 2
b.
2 x 2
c.
2 x 3
d.
normal
23. The number of respondents or observations (in a row or column) used as a basis for computing
percentages in a contingency table is referred to as a(n) _____.
a.
reference point
b.
moderator
c.
statistical base
d.
analytical point
24. An analysis of the basic cross-tabulation for each level of a variable not previously considered, such as
subgroups of the sample, is referred to as _____.
a.
group analysis
b.
cluster analysis
c.
ANOVA
d.
elaboration analysis
25. Breaking down the answer to the question: “Have you ever purchased a ticket online for an American
Airlines flight?” into subgroups based on gender and zip code is an example of _____.
a.
a box and whisker plot analysis
b.
an index number
c.
elaboration analysis
d.
interquartile analysis
26. When a third variable is included in the analysis that is studying the relationship between an
independent variable and a dependent variable, and this third variable changes the relationship between
the independent variable and the dependent variable in an important way, this third variable is called
a(n) _____.
a.
spurious variable
b.
moderator variable
c.
contingency variable
d.
outlier variable
27. It is hypothesized that an individual’s style of processing information (i.e., verbal or visual) will
influence the impact an advertising execution will have on attitudes toward the brand advertised.
Style of processing, then, is considered which type of variable?
a.
dependent variable
b.
external variable
c.
internal variable
d.
moderating variable
28. Another name for data transformation is _____.
a.
index analysis
b.
data conversion
c.
quadrant analysis
d.
data exchange
29. When a respondent’s answers to ten Likert-scale items are added up to form a total subtest score for
these questions, this is an example of _____.
a.
data indexing
b.
data transformation
c.
contingency analysis
d.
data indexing
30. When a researcher combines the “Strongly Disagree” and “Disagree” responses on a Likert scale item
to a single “Strongly Disagree/Disagree” percentage, this is an example of _____.
a.
data indexing
b.
collapsing the data
c.
the outlier effect
d.
a box and whisker plot
31. Dividing a data set into two categories by placing respondents below the median in one category and
respondents above the median in another is called a(n) _____.
a.
median split
b.
mediation
c.
elaboration analysis
d.
bimodal split
32. Data with which type of distribution are appropriate for division based on the median split?
a.
normally distributed
b.
unimodal distribution
c.
bimodal distribution
d.
uniform distribution
33. Scores or observations recalibrated to indicate how they relate to a base number are referred to as
_____.
a.
index numbers
b.
rank orders
c.
elaborated numbers
d.
real numbers
34. If the price of Brand A is transformed to a value of 100 and Brand B’s value is 75, indicating its price
is 25 percent lower than Brand A’s price, 100 and 75 represent _____.
a.
nominal rankings
b.
bivariate statistics
c.
p-values
d.
index numbers
35. Index numbers require which level of measurement?
a.
nominal
b.
interval
c.
ratio
d.
ranked
36. Which of the following is used to simplify and clarify data?
a.
tables
b.
graphs
c.
charts
d.
all of these choices
37. Which type of statistical analysis test hypotheses involves only one variable?
a.
primary statistical analysis
b.
bivariate statistical analysis
c.
univariate statistical analysis
d.
monovariate statistical analysis
38. Which of the following becomes a key indicator of whether or not a hypothesis can be supported?
a.
critical value
b.
significance level
c.
chi-square
d.
degrees of freedom
39. A researcher is testing a hypothesis and is examining the data to determine whether or not the observed
relationship between a salesperson’s age and sales are significantly related. Which of the following
will he use to determine the significance level of the results?
a.
b.
c.
y-value
d.
p-value
40. The two types of errors researcher run the risk of committing when sampling are _____.
a.
primary and secondary errors
b.
Type I and Type II errors
c.
critical and noncritical errors
d.
Type A and Type B errors
41. Which type of error occurs when the researcher concludes a relationship exists when in fact one does
not exist?
a.
Type I
b.
Type II
c.
Type A
d.
Type B
42. When a researcher sets an acceptable significance level a priori (), he or she is determining how
much tolerance he or she has for which type of error?
a.
Type I
b.
Type II
c.
Type A
d.
Type B
43. Failing to identify a hypothesized difference using a sample result when one really does exist in the
population is which type of error?
a.
primary
b.
secondary
c.
Type I
d.
Type II
44. When the probability of a Type II error is ______ , the probability of a Type I error is _____.
a.
reduced, increased
b.
reduced; reduced
c.
increased; increased
d.
none of these choices
45. Which test is appropriate for testing hypotheses involving an observed mean against a specified value?
a.
F-ratio
b.
µ-test
c.
t-test
d.
-test
COMPLETION
1. Assigning numbers to edited data is known as _____.
2. _____ analysis is the elementary transformation of data in a way that describes the basic characteristics
such as central tendency, distribution, and variability.
3. The orderly arrangement of data into a table is known as ______.
4. The arrangement of data into a row-and-column format that gives the number of responses for each
category of the variable is known as a(n) ______ table.
5. When tabulation is done by hand to create a frequency table, this is called ______ the data.
6. The appropriate technique for addressing research questions involving relationships among multiple
less-than interval variables is _____.
7. When the responses to two survey questions are presented as a cross-tabulation of the two variables
studied in these two questions, the result is called a(n) ______ table.
8. A two-way contingency table is referred to as a(n) _____ table.
9. The _____ is the number of respondents or observations used as a basis for computing percentages.
10. An analysis of the cross-tabulations for each level of a variable as it applies to sub-groups of the
sample is known as ______ analysis.
11. When a third variable changes in important ways the relationship between an independent variable and
a dependent variable, this third variable is called a(n) ______ variable.
12. The process of changing data from its original form to a format that more closely matches the research
objectives of the study is called data ______.
13. Researchers often perform a(n) _____ to collapse a scale with multiple response points into two
categories.
14. Scores or observations recalibrated to indicate how they relate to a base number are called _____
numbers.
15. The type of statistical analysis that deals with a hypothesis about a single variable (e.g. market share)
is called ______ statistical analysis.
16. Analyses that test hypotheses and models involving multiple (three or more) variables or sets of
variables are referred to as _____ statistical analyses.
17. The critical probability associated with a statistical hypothesis test that indicates how likely it is that an
inference supporting a difference between an observed value and some statistical expectation is true is
called _____.
18. The term _____ stands for probability-value.
19. A(n) ____ error occurs when a condition that is true in the population is rejected based on statistical
observation.
20. A univariate _____ is appropriate for testing hypotheses involving an observed mean from a small
sample (i.e., n < 30) against a specified value.
ESSAY
1. Explain what coding is and describe why and how dummy coding is done.
2. What is descriptive analysis and how is it used in marketing research?
3. Explain what a contingency table is and discuss how it is useful in marketing research.
(2)
Compute the standard error.
4. Explain what an index number is and how it is computed. What level of measurement is required to
compute index numbers?
5. Compare and contrast Type I errors and Type II errors and explain which one is of more concern to
researchers.
6. A restaurant owner is interested in knowing if the average number of visits by regular customers is
greater than three times per month. What is the appropriate test to address this research question?
Describe how this test is performed.
0.05.