Multiple Linear Regression

Multiple linear regression extends simple regression to model the response as a linear function of several predictors, using matrix algebra for a compact and general formulation.

The Matrix Formulation

Definition

The multiple linear regression model is $\mathbf{Y} = \mathbf{X}\boldsymbol{\beta} + \boldsymbol{\epsilon}$ where $\mathbf{Y} = (Y_1, \ldots, Y_n)^T$ is the $n \times 1$ response vector, $\mathbf{X}$ is the $n \times p$ design matrix (with rows $(1, x_{i1}, \ldots, x_{i,p-1})$ ), $\boldsymbol{\beta} = (\beta_0, \beta_1, \ldots, \beta_{p-1})^T$ is the $p \times 1$ parameter vector, and $\boldsymbol{\epsilon} \sim N(\mathbf{0}, \sigma^2 \mathbf{I}_n)$ .

Definition

The OLS estimator minimizes $\|\mathbf{Y} - \mathbf{X}\boldsymbol{\beta}\|^2$ and is given by $\hat{\boldsymbol{\beta}} = (\mathbf{X}^T\mathbf{X})^{-1}\mathbf{X}^T\mathbf{Y}$ provided $\mathbf{X}^T\mathbf{X}$ is invertible (i.e., $\mathbf{X}$ has full column rank $p$ ). The fitted values are $\hat{\mathbf{Y}} = \mathbf{X}\hat{\boldsymbol{\beta}} = \mathbf{H}\mathbf{Y}$ , where $\mathbf{H} = \mathbf{X}(\mathbf{X}^T\mathbf{X})^{-1}\mathbf{X}^T$ is the hat matrix.

Properties

ExampleDistribution of the OLS estimator

Under the normal model $\boldsymbol{\epsilon} \sim N(\mathbf{0}, \sigma^2\mathbf{I})$ : $\hat{\boldsymbol{\beta}} \sim N(\boldsymbol{\beta}, \sigma^2(\mathbf{X}^T\mathbf{X})^{-1})$ $\frac{(n-p)\hat{\sigma}^2}{\sigma^2} = \frac{\mathbf{e}^T\mathbf{e}}{\sigma^2} \sim \chi^2_{n-p}$ where $\mathbf{e} = \mathbf{Y} - \hat{\mathbf{Y}}$ is the residual vector and $\hat{\sigma}^2 = \mathbf{e}^T\mathbf{e}/(n-p)$ .

Moreover, $\hat{\boldsymbol{\beta}}$ and $\hat{\sigma}^2$ are independent.

Inference

RemarkTesting individual coefficients

To test $H_0: \beta_j = 0$ (whether predictor $j$ contributes after accounting for all other predictors), use $t_j = \frac{\hat{\beta}_j}{\text{SE}(\hat{\beta}_j)} = \frac{\hat{\beta}_j}{\hat{\sigma}\sqrt{[(\mathbf{X}^T\mathbf{X})^{-1}]_{jj}}} \sim t_{n-p}$ under $H_0$ . The overall F-test $H_0: \beta_1 = \cdots = \beta_{p-1} = 0$ uses $F = \frac{SSR/(p-1)}{SSE/(n-p)} \sim F_{p-1, n-p}$ .