Applied Statistics and Probability for Engineers, 7th edition 2017
12.2.2 Consider the linear regression model from Exercise 12.1.2. Is the second-order term necessary in the regression model?
Computer output for the model with ratio and ratio squared is shown below. The P-value for the test of whether the
coefficient of ratio squared equals zero is 0.309 > 0.05. Therefore, there is not sufficient evidence that the ratio
squared variable is useful to the model.
The regression equation is
viscosity = 0.198 + 1.37 ratio − 1.28 ratio2
Predictor Coef SE Coef T P
Constant 0.1979 0.4466 0.44 0.676
ratio 1.367 1.488 0.92 0.400
ratio2 −1.280 1.131 −1.13 0.309
S = 0.146606 R−Sq = 37.5% R−Sq(adj) = 12.5%
Analysis of Variance
12.2.3 Consider the regression model of Exercise 12.1.3 attempting to predict the percent of engineers in the workforce from
various spending variables.
(a) Are any of the variables useful for prediction? (Test an appropriate hypothesis).
(b) What percent of the variation in the percent of engineers is accounted for by the model?
(c) What might you do next to create a better model?
12.2.4 You have fit a regression model with two regressors to a data set that has 20 observations. The total sum of squares is
1000 and the model sum of squares is 750.
(a) What is the value of R2 for this model?
(b) What is the adjusted R2 for this model?
(c) What is the value of the F-statistic for testing the significance of regression? What conclusions would you draw
about this model if
= 0.05? What if
= 0.01?
(d) Suppose that you add a third regressor to the model and as a result, the model sum of squares is now 785. Does it
seem to you that adding this factor has improved the model?