Exhibit 21.2 illustrates an important property of p-values – as the observed value gets
further from the benchmark, the p-value gets smaller, meaning that the chance of the
mean actually equaling the benchmark is smaller.
In discussing confidence intervals, statisticians use the term confidence level, or
confidence coefficient, to refer to the level of probability associated with an interval
estimate.
However, when discussing hypothesis testing, the terminology is changed to a
significance level, α (the Greek letter alpha).
An Example of Hypothesis Testing
The example illustrates the conventional statistical approach to testing a univariate
hypothesis with an interval or ratio variable.
The Pizza-In restaurant is concerned about its store image, one aspect of which is the
friendliness of the service.
A sample of 225 customers is asked to indicate their perception of the service on a
5-point scale, where 1 indicates “very unfriendly” and 5 indicates “very friendly” service.
Suppose Pizza-In believes the service has to be different from 3.0 before they can make a
decision about expansion.
The researcher formulates the null hypothesis that the population mean is equal to 3.0:
H0 :
= 3.0
The alternative hypothesis is that the mean does not equal 3.0:
H1 :
3.0
More practically, the researcher is likely to write the substantive hypothesis (as it would
be stated in a research report or proposal) as:
H1: Customer perceptions of friendly service are significantly greater than three.
The substantive hypothesis matches the “alternative” phrasing, and in practical terms,
is the only hypothesis formally stated.
Next, the researcher must decide on a significance level, which corresponds to a region of
rejection on a normal sampling distribution shown in Exhibit 21.1.
The shaded area shows the region of rejection when
= .025 in each tail of the curve.
The values within the unshaded area are called acceptable at the 95 percent confidence
level (or 5 percent significance level, or 0.05 alpha level), and if we find that our sample
mean lies within this region of acceptance, we conclude that the means are not different
from the expected value
More precisely, we fail to reject the null hypothesis.
In other words, the range of acceptance:
1. identifies those acceptable values that reflect a difference from the hypothesized
mean in the null hypothesis and
2. shows the range within which any difference is so minuscule that we would conclude
that this difference was due to random sampling error rather than to a false null
hypothesis.
The Pizza-In restaurant hired research consultants who collected a sample of 225
interviews.
The mean friendliness score on the 5-point scale was 3.78.