MicrosoftВ® Office ExcelВ® 2007: Data Analysis and Business Modeling (Bpg -- Other)

What is the best prediction equation?   We find in the Coefficients column (column B of the summary output) that the best equation of Form 1 that can be used to predict monthly cost is

Predicted monthly cost=35,102.90+2.07(AMade)+4.18(BMade)+4.79(CMade)

A natural question is, which of our independent variables are useful for predicting monthly cost? After all, if we had chosen the number of games won by the Seattle Mariners during a one month period as an independent variable, we would expect that this variable would have little effect on predicting monthly operating cost. When you run a regression, each independent variable has a p-value between 0 and 1. Any independent variable with a p-value (see column E) of less than or equal to 0.15 is considered to be useful for predicting the dependent variable. Thus, the smaller the p-value, the higher the predictive power of the independent variable. Our three independent variables have p-values of 0.23 (for A Made), 0.025 (for B Made), and 0.017 (for C Made). These p-values may be interpreted as follows:

Our p-values indicate that A Made does not add much predictive power to B Made and C Made, which means that if we know B Made and C Made, we can predict monthly operating cost about as well as we can if we include A Made as an independent variable. Therefore, we can opt to delete A Made as an independent variable and use just B Made and C Made for our prediction. I copied our data to the worksheet titled A Removed and deleted the A Made column (column C). I then adjusted the Input X range to be C3:D22. In the worksheet titled NoA, you can see the regression output shown in Figure 47-5 and Figure 47-6.

Figure 47-5: Multiple regression output with A Made data removed as an independent variable

Figure 47-6: Residual output calculated when A Made data is removed as an independent variable

We see that A Made and B Made each have very low p-values (0.002 and 0.007, respectively). These values indicate that both these independent variables have useful predictive power. Using the new coefficients in column B, we can now predict monthly operating cost using the equation

Predicted monthly operating cost=35,475.3+5.32(BMade)+5.42(CMade)

Категории