Residual sum of squares

One explanatory variable

Summarize

Perspective

In a model with a single explanatory variable, RSS is given by:^[1]

$\operatorname {RSS} =\sum _{i=1}^{n}\left(y_{i}-f(x_{i})\right)^{2}$

where y_i is the i^th value of the variable to be predicted, x_i is the i^th value of the explanatory variable, and $f(x_{i})$ is the predicted value of y_i (also termed ${\hat {y_{i}}}$ ). In a standard linear simple regression model, $y_{i}=\alpha +\beta x_{i}+\varepsilon _{i}\,$ , where $\alpha$ and $\beta$ are coefficients, y and x are the regressand and the regressor, respectively, and ε is the error term. The sum of squares of residuals is the sum of squares of ${\widehat {\varepsilon \,}}_{i}$ ; that is

$\operatorname {RSS} =\sum _{i=1}^{n}\left({\widehat {\varepsilon }}_{i}\right)^{2}=\sum _{i=1}^{n}\left(y_{i}-({\widehat {\alpha \,}}+{\widehat {\beta }}\,x_{i})\right)^{2}$

where ${\widehat {\alpha \,}}$ is the estimated value of the constant term $\alpha$ and ${\widehat {\beta \,}}$ is the estimated value of the slope coefficient $\beta$ .

Remove ads

Matrix expression for the OLS residual sum of squares

Summarize

Perspective

The general regression model with $n$ observations and $k$ explanators, the first of which is a constant unit vector whose coefficient is the regression intercept, is

$y=X\beta +e$

where $y$ is an n × 1 vector of dependent variable observations, each column of the n × k matrix $X$ is a vector of observations on one of the k explanators, $\beta$ is a k × 1 vector of true coefficients, and $e$ is an n× 1 vector of the true underlying errors. The ordinary least squares estimator for $\beta$ is

${\begin{aligned}&X{\hat {\beta }}=y\\[1ex]\iff &X^{\operatorname {T} }X{\hat {\beta }}=X^{\operatorname {T} }y\\[1ex]\iff &{\hat {\beta }}=\left(X^{\operatorname {T} }X\right)^{-1}X^{\operatorname {T} }y.\end{aligned}}$

The residual vector ${\hat {e}}=y-X{\hat {\beta }}=y-X(X^{\operatorname {T} }X)^{-1}X^{\operatorname {T} }y$ ; so the residual sum of squares is:

$\operatorname {RSS} ={\hat {e}}^{\operatorname {T} }{\hat {e}}=\left\|{\hat {e}}\right\|^{2},$

(equivalent to the square of the norm of residuals). In full:

${\begin{aligned}\operatorname {RSS} &=y^{\operatorname {T} }y-y^{\operatorname {T} }X\left(X^{\operatorname {T} }X\right)^{-1}X^{\operatorname {T} }y\\[1ex]&=y^{\operatorname {T} }\left[I-X\left(X^{\operatorname {T} }X\right)^{-1}X^{\operatorname {T} }\right]y\\[1ex]&=y^{\operatorname {T} }\left[I-H\right]y,\end{aligned}}$

where $H$ is the hat matrix, or the projection matrix in linear regression.

Remove ads

Relation with Pearson's product-moment correlation

Summarize

Perspective

The least-squares regression line is given by

$y=ax+b,$

where $b={\bar {y}}-a{\bar {x}}$ and $a={\frac {S_{xy}}{S_{xx}}}$ , where $S_{xy}=\sum _{i=1}^{n}({\bar {x}}-x_{i})({\bar {y}}-y_{i})$ and $S_{xx}=\sum _{i=1}^{n}({\bar {x}}-x_{i})^{2}.$

Therefore,

${\begin{aligned}\operatorname {RSS} &=\sum _{i=1}^{n}\left(y_{i}-f(x_{i})\right)^{2}=\sum _{i=1}^{n}\left(y_{i}-(ax_{i}+b)\right)^{2}\\[1ex]&=\sum _{i=1}^{n}\left(y_{i}-ax_{i}-{\bar {y}}+a{\bar {x}}\right)^{2}=\sum _{i=1}^{n}\left[a\left({\bar {x}}-x_{i}\right)-\left({\bar {y}}-y_{i}\right)\right]^{2}\\[1ex]&=a^{2}S_{xx}-2aS_{xy}+S_{yy}=S_{yy}-aS_{xy}\\[1ex]&=S_{yy}\left(1-{\frac {S_{xy}^{2}}{S_{xx}S_{yy}}}\right)\end{aligned}}$

where $S_{yy}=\sum _{i=1}^{n}({\bar {y}}-y_{i})^{2}.$

The Pearson product-moment correlation is given by $r={\frac {S_{xy}}{\sqrt {S_{xx}S_{yy}}}};$ therefore, $\operatorname {RSS} =S_{yy}(1-r^{2}).$

Remove ads

Residual sum of squares

One explanatory variable

Matrix expression for the OLS residual sum of squares

Relation with Pearson's product-moment correlation

See also

References

Wikiwand - on