3) Consider the following panel data regression with a single explanatory variable
Yit = β0 + β1Xit + uit.
In each of the examples below, you will be adding entity and time fixed effects. Indicate the total number
of coefficients that need to be estimated.
(a) The effect of beer taxes on the fatality rate, annual data, 1982–1988, nine U.S. regions (New England,
Pacific, Mid–Atlantic, East North Central, etc.).
(b) The effect of the minimum wage on teenage employment, annual data, 1963–2000, five Canadian
Regions (Atlantic Provinces, Quebec, Ontario, Prairies, British Columbia).
(c) The effect of savings rates on per capita income, data for three decades (1960–1969, 1970–1979, 1980–
1989; one observation per decade), 104 countries of the world.
(d) The effect of pitching quality in baseball (as measured by the Team ERA) on the winning percentage,
annual data, 1998–1999 season, 1999–2000 season, 30 teams.
4) Your textbook modifies the four assumptions for the multiple regression model by adding a new
assumption. This represents an extension of the cross–sectional data case, where errors are uncorrelated
across entities. The new assumption requires the errors to be uncorrelated across time, conditional on the
regressors as well (cov(uit, uis Xit, Xis) = 0 for t ≠ s.).
(a) Discuss why there might be correlation over time in the errors when you use U.S. state panel data.
Does this mean that you should not use OLS as an estimator?
(b) Now consider pairs of adjacent states such as Indiana and Michigan, Texas and Arkansas, New York
and Connecticut, etc. Is it likely that the fifth assumption will hold here, even though the
“contemporaneous” errors are correlated? If not, can you still use OLS for estimation?