21
and the intercept and slope are both statistically significant. What does the regression imply about the
relationship between past performance and present performance? What values would the slope and
intercept have to take on for the future performance to be as good as the past performance, on average?
(b) Being somewhat puzzled about the results, you call your econometrics professor and describe the
results to her. She says that she is not surprised at all, since this is an example of “Galton’s Fallacy.” She
explains that Sir Francis Galton regressed the height of offspring on the mid–height of their parents and
found a positive intercept and a slope between zero and one. He referred to this result as “regression
towards mediocrity.” Why do you think econometricians refer to this result as a fallacy?
(c) Your professor continues by mentioning that this is an example of errors–in–variables bias. What does
she mean by that in general? In this case, why would batting averages be measured with error? Are
baseball statisticians sloppy?
(d) The top three performers in terms of highest batting averages in 1997 were Tony Gwynn (.372), Larry
Walker (.366), and Mike Piazza (.362). Given your answers for the previous questions, what would be
your predictions for the 1998 season?
11) Your textbook compares the results of a regression of test scores on the student–teacher ratio using a
sample of school districts from California and from Massachusetts. Before standardizing the test scores
for California, you get the following regression result:
= 698.9 – 2.28×STR
n = 420, R2 = 0.051, SER = 18.6
In addition, you are given the following information: the sample mean of the student–teacher ratio is
19.64 with a standard deviation of 1.89, and the standard deviation of the test scores is 19.05.
a. After standardizing the test scores variable and running the regression again, what is the value of the
slope? What is the meaning of this new slope here (interpret the result)?
b. What will be the new intercept? Now that test scores have been standardized, should you interpret the
intercept?
c. Does the regression R2 change between the two regressions? What about the t–statistic for the slope
estimator?