Chapter 2 Simple Linear Regression

2.1 Getting started

Putting text here

2.2 Foundation

Putting text here

2.3 Inference

Putting text here

2.4 Prediction

2.5 Checking conditions

2.6 Partioning variability

2.7 Derivation for slope and intercept

This document contains the mathematical details for deriving the least-squares estimates for slope ( $β_{1}$ ) and intercept ( $β_{0}$ ). We obtain the estimates, ${\hat{β}}_{1}$ and ${\hat{β}}_{0}$ by finding the values that minimize the sum of squared residuals ().

$S S R = \sum_{i = 1}^{n} [y_{i} - {\hat{y}}_{i}]^{2} = [y_{i} - ({\hat{β}}_{0} + {\hat{β}}_{1} x_{i})]^{2} = [y_{i} - ({\hat{β}}_{0} - {\hat{β}}_{1} x_{i}]^{2}$

Recall that we can find the values of ${\hat{β}}_{1}$ and ${\hat{β}}_{0}$ that minimize () by taking the partial derivatives of () and setting them to 0. Thus, the values of ${\hat{β}}_{1}$ and ${\hat{β}}_{0}$ that minimize the respective partial derivative also minimize the sum of squared residuals. The partial derivatives are

$\begin{aligned} \frac{\partial SSR}{\partial {\hat{β}}_{1}} = - 2 \sum_{i = 1}^{n} x_{i} (y_{i} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{i}) \\ \frac{\partial SSR}{\partial {\hat{β}}_{0}} = - 2 \sum_{i = 1}^{n} (y_{i} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{i}) \end{aligned}$

Let’s begin by deriving ${\hat{β}}_{0}$ .

$\begin{aligned} \frac{\partial SSR}{\partial {\hat{β}}_{0}} & = - 2 \sum_{i = 1}^{n} (y_{i} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{i}) = 0 \\ \Rightarrow - \sum_{i = 1}^{n} (y_{i} + {\hat{β}}_{0} + {\hat{β}}_{1} x_{i}) = 0 \\ \Rightarrow - \sum_{i = 1}^{n} y_{i} + n {\hat{β}}_{0} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i} = 0 \\ \Rightarrow n {\hat{β}}_{0} = \sum_{i = 1}^{n} y_{i} - {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i} \\ \Rightarrow {\hat{β}}_{0} = \frac{1}{n} (\sum_{i = 1}^{n} y_{i} - {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}) \\ \Rightarrow {\hat{β}}_{0} = \bar{y} - {\hat{β}}_{1} \bar{x} \end{aligned}$

Now, we can derive ${\hat{β}}_{1}$ using the ${\hat{β}}_{0}$ we just derived

$\begin{aligned} \frac{\partial SSR}{\partial {\hat{β}}_{1}} = - 2 \sum_{i = 1}^{n} x_{i} (y_{i} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{i}) = 0 \\ \Rightarrow - \sum_{i = 1}^{n} x_{i} y_{i} + {\hat{β}}_{0} \sum_{i = 1}^{n} x_{i} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} = 0 \\ (Fill in {\hat{β}}_{0}) & \Rightarrow - \sum_{i = 1}^{n} x_{i} y_{i} + (\bar{y} - {\hat{β}}_{1} \bar{x}) \sum_{i = 1}^{n} x_{i} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} = 0 \\ \Rightarrow (\bar{y} - {\hat{β}}_{1} \bar{x}) \sum_{i = 1}^{n} x_{i} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} = \sum_{i = 1}^{n} x_{i} y_{i} \\ \Rightarrow \bar{y} \sum_{i = 1}^{n} x_{i} - {\hat{β}}_{1} \bar{x} \sum_{i = 1}^{n} x_{i} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} = \sum_{i = 1}^{n} x_{i} y_{i} \\ \Rightarrow n \bar{y} \bar{x} - {\hat{β}}_{1} n {\bar{x}}^{2} + {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} = \sum_{i = 1}^{n} x_{i} y_{i} \\ \Rightarrow {\hat{β}}_{1} \sum_{i = 1}^{n} x_{i}^{2} - {\hat{β}}_{1} n {\bar{x}}^{2} = \sum_{i = 1}^{n} x_{i} y_{i} - n \bar{y} \bar{x} \\ \Rightarrow {\hat{β}}_{1} (\sum_{i = 1}^{n} x_{i}^{2} - n {\bar{x}}^{2}) = \sum_{i = 1}^{n} x_{i} y_{i} - n \bar{y} \bar{x} \\ {\hat{β}}_{1} = \frac{\sum_{i = 1}^{n} x_{i} y_{i} - n \bar{y} \bar{x}}{\sum_{i = 1}^{n} x_{i}^{2} - n {\bar{x}}^{2}} \end{aligned}$

To write ${\hat{β}}_{1}$ in a form that’s more recognizable, we will use the following:

$\sum x_{i} y_{i} - n \bar{y} \bar{x} = \sum (x - \bar{x}) (y - \bar{y}) = (n - 1) Cov (x, y)$

$\sum x_{i}^{2} - n {\bar{x}}^{2} - \sum (x - \bar{x})^{2} = (n - 1) s_{x}^{2}$

where $Cov (x, y)$ is the covariance of $x$ and $y$ , and $s_{x}^{2}$ is the sample variance of $x$ ( $s_{x}$ is the sample standard deviation).

Thus, applying () and (), we have

$\begin{aligned} {\hat{β}}_{1} & = \frac{\sum_{i = 1}^{n} x_{i} y_{i} - n \bar{y} \bar{x}}{\sum_{i = 1}^{n} x_{i}^{2} - n {\bar{x}}^{2}} \\ = \frac{\sum_{i = 1}^{n} (x - \bar{x}) (y - \bar{y})}{\sum_{i = 1}^{n} (x - \bar{x})^{2}} \\ = \frac{(n - 1) Cov (x, y)}{(n - 1) s_{x}^{2}} \\ = \frac{Cov (x, y)}{s_{x}^{2}} \end{aligned}$

The correlation between $x$ and $y$ is $r = \frac{Cov (x, y)}{s_{x} s_{y}}$ . Thus, $Cov (x, y) = r s_{x} s_{y}$ . Plugging this into (), we have

${\hat{β}}_{1} = \frac{Cov (x, y)}{s_{x}^{2}} = r \frac{s_{y} s_{x}}{s_{x}^{2}} = r \frac{s_{y}}{s_{x}}$