Section 7 Case Study - Marketing Plan

7.1 Data Overview

The Advertising data set consists of the sales of that product in 200 different markets, along with advertising budgets for the product in each of those markets for three different media: TV, radio, and newspaper. The data are displayed in Figure 7.1.

The plot displays sales, in thousands of units, as a function of TV, radio, and newspaper budgets, in thousands of dollars, for 200 different markets. In each plot we show the simple least squares fit of sales to that variable, as described in previous section. In other words, each blue line represents a simple model that can be used to predict sales using TV, radio, and newspaper, respectively.

Figure 7.1: the Advertising data set

7.2 Important Questions

Suppose that in our role as statistical consultants we are asked to suggest, on the basis of this data, a marketing plan for next year that will result in high product sales. Here are a few important questions that we might seek to address:

Is there a relationship between advertising sales and budget?

This question can be answered by fitting a multiple regression model (7.1) of sales onto TV, radio, and newspaper, as:

\[\begin{equation} \Large sales = \beta_0 + \beta_1 × TV + \beta_2 × radio + \beta_3 × newspaper + \epsilon. \tag{7.1} \end{equation}\]

and testing the hypothesis \[ H_0: \beta_{TV} = \beta_{radio} = \beta_{newspaper} = 0 \]

F-statistic can be used to determine whether or not we should reject this null hypothesis. In this case, the p-value corresponding to the F-statistic in Table 6.1 is very low, indicating clear evidence of a relationship between advertising and sales.

**Table 6.1:** More information about the least squares model for the regression of number of units sold on TV, newspaper, and radio advertising budgets in the Advertising data
Quantity	Value
Residual standard error	1.69
$R^2$	0.897
F-statistic	570

2.How strong is the relationship?

First, the RSE estimates the standard deviation of the response from the population regression line. For the Advertising data, the RSE is 1,681 units while the mean value for the response is 14,022, indicating a percentage error of roughly 12%. Second, the $R^2$ statistic records the percentage of variability in the response that is explained by the predictors. The predictors explain almost 90% of the variance in sales. The RSE and $R^2$ statistics are displayed in Figure 6.1.

Which media contribute to sales?

To answer this question, we can examine the p-values associated with each predictor’s t-statistic. In the multiple linear regression displayed in Figure 6.3, the p-values for TV and radio are low, but the p-value for newspaper is not. This suggests that only TV and radio are related to sales.

**Table 6.2.** More information about the least squares model for the regression of number of units sold on TV, newspaper, and radio advertising budgets in the Advertising data
	Coefficient	Std. error	t-statistic	p-value
`Intercept`	2.939	0.3119	9.42	< 0.0001
`TV`	0.046	0.0014	32.81	< 0.0001
`radio`	0.189	0.0086	21.89	< 0.0001
`newspaper`	-0.001	0.0059	-0.18	0.8599

How large is the effect of each medium on sales?

The standard error of $\hat{\beta_j}$ can be used to construct confidence intervals for $\beta_j$. For the Advertising data, the 95% confidence intervals are as follows: (0.043, 0.049) for TV, (0.172, 0.206) for radio, and (−0.013, 0.011) for newspaper.

The confidence intervals for TV and radio are narrow and far from zero, providing evidence that these media are related to sales. But the interval for newspaper includes zero, indicating that the variable is not statistically significant given the values of TV and radio.

We saw in previous section that collinearity can result in very wide standard errors.

Could collinearity be the reason that the confidence interval associated with newspaper is so wide?

The VIF scores are 1.005, 1.145, and 1.145 for TV,radio, and newspaper, suggesting no evidence of collinearity.

In order to assess the association of each medium individually on sales, we can perform three separate simple linear regressions. Results are shown in Table 6.3, 6.4 and 6.5.

**Table 6.3**. Simple regression of sales on TV
	Coefficient	Std. error	t-statistic	p-value
Intercept	7.0325	0.4578	15.36	<0.0001
`TV`	0.0475	0.0027	17.67	<0.0001

Coefficients of the least squares model for the regression of number of units sold on TV advertising budget. An increase of $1,000 in the TV advertising budget is associated with an increase in sales by around 50 units.

**Table 6.4**. Simple regression of sales on radio
	Coefficient	Std. error	t-statistic	p-value
Intercept	9.312	0.563	16.54	<0.0001
`radio`	0.203	0.02	9.92	<0.0001

**Table 6.5**. Simple regression of sales on newspaper
	Coefficient	Std. error	t-statistic	p-value
Intercept	12.351	0.621	19.88	<0.0001
`newspaper`	0.055	0.017	3.30	0.00115

Coefficients of the simple linear regression model for number of units sold on:

Top: radio advertising budget.
Bottom: newspaper advertising budget.

A $1,000 increase in spending on radio advertising is associated with an average increase in sales by around 203 units, while the same increase in spending on newspaper advertising is associated with an average increase in sales by around 55 units.

There is evidence of an extremely strong association between TV and sales and between radio and sales. There is evidence of a mild association between newspaper and sales, when the values of TV and radio are ignored.

How accurately can we predict future sales?

The response can be predicted using (7.2):

\[\begin{equation} \Large y = \hat{\beta_0} + \hat{\beta_1}x_1 + \hat{\beta_2}x_2 + · · · + \hat{\beta_0}x_p. \tag{7.2} \end{equation}\]

The accuracy associated with this estimate depends on whether we wish to predict an individual response, $Y = f(X) + \epsilon$, or the average response, $f(X)$. If the former, we use a prediction interval, and if the latter, we use a confidence interval. Prediction intervals will always be wider than confidence intervals because they account for the uncertainty associated with $\epsilon$, the irreducible error.

Is the relationship linear?

In Considerations section, we saw that residual plots can be used in order to identify non-linearity. If the relationships are linear, then the residual plots should display no pattern. In the case of the Advertising data, we observe a non-linear effect in Figure 7.2, though this effect could also be observed in a residual plot. We also discussed the inclusion of transformations of the predictors in the linear regression model in order to accommodate non-linear relationships.

Figure 7.2: A linear regression fit to sales using TV and radio as predictors

From the pattern of the residuals, we can see that there is a pronounced non-linear relationship in the data. The positive residuals (those visible above the surface), tend to lie along the 45-degree line, where TV and radio budgets are split evenly. The negative residuals (most not visible), tend to lie away from this line, where budgets are more lopsided.

Is there synergy among the advertising media?

The standard linear regression model assumes an additive relationship between the predictors and the response. An additive model is easy to interpret because the effect of each predictor on the response is unrelated to the values of the other predictors. However, the additive assumption may be unrealistic for certain data sets. A small p-value associated with the interaction term indicates the presence of such relationships.

Figure 7.2 suggested that the Advertising data may not be additive. Including an interaction term in the model results in a substantial increase in $R^2$, from around 90% to almost 97%.