Section 7 Case Study - Marketing Plan
7.1 Data Overview
The Advertising
data set consists of the sales of that product in 200 different markets, along with advertising budgets for the product in each of those markets for three different media: TV
, radio
, and newspaper
. The data are displayed in Figure 7.1.
The plot displays sales, in thousands of units, as a function of TV
, radio
, and newspaper
budgets, in thousands of dollars, for 200 different markets. In each plot we show the simple least squares fit of sales to that variable, as described in previous section. In other words, each blue line represents a simple model that can be used to predict sales using TV
, radio
, and newspaper
, respectively.
7.2 Important Questions
Suppose that in our role as statistical consultants we are asked to suggest, on the basis of this data, a marketing plan for next year that will result in high product sales. Here are a few important questions that we might seek to address:
- Is there a relationship between advertising sales and budget?
This question can be answered by fitting a multiple regression model (7.1) of sales onto TV
, radio
, and newspaper
, as:
\[\begin{equation} \Large sales = \beta_0 + \beta_1 × TV + \beta_2 × radio + \beta_3 × newspaper + \epsilon. \tag{7.1} \end{equation}\]
and testing the hypothesis \[ H_0: \beta_{TV} = \beta_{radio} = \beta_{newspaper} = 0 \]
F-statistic can be used to determine whether or not we should reject this null hypothesis. In this case, the p-value corresponding to the F-statistic in Table 6.1 is very low, indicating clear evidence of a relationship between advertising and sales.
Quantity | Value |
---|---|
Residual standard error | 1.69 |
\(R^2\) | 0.897 |
F-statistic | 570 |
2.How strong is the relationship?
First, the RSE estimates the standard deviation of the response from the
population regression line. For the Advertising
data, the RSE is 1,681 units while the mean value for the response is 14,022, indicating a percentage error of roughly 12%. Second, the \(R^2\) statistic records the percentage of variability in the response that is explained by the predictors. The predictors explain almost 90% of the variance in sales. The RSE and \(R^2\) statistics are displayed in Figure 6.1.
- Which media contribute to sales?
To answer this question, we can examine the p-values associated with each predictor’s t-statistic. In the multiple linear regression displayed in Figure 6.3, the p-values for TV
and radio
are low, but the p-value for newspaper
is not. This suggests that only TV
and radio
are related to sales.
Coefficient | Std. error | t-statistic | p-value | |
---|---|---|---|---|
Intercept |
2.939 | 0.3119 | 9.42 | < 0.0001 |
TV |
0.046 | 0.0014 | 32.81 | < 0.0001 |
radio |
0.189 | 0.0086 | 21.89 | < 0.0001 |
newspaper |
-0.001 | 0.0059 | -0.18 | 0.8599 |
- How large is the effect of each medium on sales?
The standard error of \(\hat{\beta_j}\) can be used to construct confidence intervals for \(\beta_j\). For the Advertising
data, the 95% confidence intervals are as follows: (0.043, 0.049) for TV
, (0.172, 0.206) for radio
, and (−0.013, 0.011) for newspaper
.
The confidence intervals for TV
and radio
are narrow and far from zero, providing evidence that these media are related to sales. But the interval for newspaper
includes zero, indicating that the variable is not statistically significant given the values of TV
and radio
.
We saw in previous section that collinearity can result in very wide standard errors.
Could collinearity be the reason that the confidence interval associated with newspaper is so wide?
The VIF scores are 1.005, 1.145, and 1.145 for TV
,radio
, and newspaper
, suggesting no evidence of collinearity.
In order to assess the association of each medium individually on sales, we can perform three separate simple linear regressions. Results are shown in Table 6.3, 6.4 and 6.5.
Coefficient | Std. error | t-statistic | p-value | |
---|---|---|---|---|
Intercept | 7.0325 | 0.4578 | 15.36 | <0.0001 |
TV |
0.0475 | 0.0027 | 17.67 | <0.0001 |
Coefficients of the least squares model for the regression of number of units sold on TV
advertising budget. An increase of $1,000 in the TV
advertising budget is associated with an increase in sales by around 50 units.
Coefficient | Std. error | t-statistic | p-value | |
---|---|---|---|---|
Intercept | 9.312 | 0.563 | 16.54 | <0.0001 |
radio |
0.203 | 0.02 | 9.92 | <0.0001 |
Coefficient | Std. error | t-statistic | p-value | |
---|---|---|---|---|
Intercept | 12.351 | 0.621 | 19.88 | <0.0001 |
newspaper |
0.055 | 0.017 | 3.30 | 0.00115 |
Coefficients of the simple linear regression model for number of units sold on:
Top:
radio
advertising budget.Bottom:
newspaper
advertising budget.
A $1,000 increase in spending on radio
advertising is associated with an average increase in sales by around 203 units, while the same increase in spending on newspaper
advertising is associated with an average increase in sales by around 55 units.
There is evidence of an extremely strong association between TV
and sales
and between radio
and sales
. There is evidence of a mild association between newspaper
and sales
, when the values of TV
and radio
are ignored.
- How accurately can we predict future sales?
The response can be predicted using (7.2):
\[\begin{equation} \Large y = \hat{\beta_0} + \hat{\beta_1}x_1 + \hat{\beta_2}x_2 + · · · + \hat{\beta_0}x_p. \tag{7.2} \end{equation}\]
The accuracy associated with this estimate depends on whether we wish to predict an individual response, \(Y = f(X) + \epsilon\), or the average response, \(f(X)\). If the former, we use a prediction interval, and if the latter, we use a confidence interval. Prediction intervals will always be wider than confidence intervals because they account for the uncertainty associated with \(\epsilon\), the irreducible error.
- Is the relationship linear?
In Considerations section, we saw that residual plots can be used in order to identify non-linearity. If the relationships are linear, then the residual plots should display no pattern. In the case of the Advertising
data, we observe a non-linear effect in Figure 7.2, though this effect could also be observed in a residual plot. We also discussed the inclusion of transformations of the predictors in the linear regression model in order to accommodate non-linear relationships.
From the pattern of the residuals, we can see that there is a pronounced non-linear relationship in the data. The positive residuals (those visible above the surface), tend to lie along the 45-degree line, where TV
and radio
budgets are split evenly. The negative residuals (most not visible), tend to lie away from this line, where budgets are more lopsided.
- Is there synergy among the advertising media?
The standard linear regression model assumes an additive relationship between the predictors and the response. An additive model is easy to interpret because the effect of each predictor on the response is unrelated to the values of the other predictors. However, the additive assumption may be unrealistic for certain data sets. A small p-value associated with the interaction term indicates the presence of such relationships.
Figure 7.2 suggested that the Advertising
data may not be additive. Including an interaction term in the model results in a substantial increase in \(R^2\), from around 90% to almost 97%.