A number characterising the model prediction quality ( goodness of fit ):

$\begin{array}{l}\displaystyle R^2 = 1 - \frac{RMSD(x, \hat x)}{MSE(x, \bar x)}, \quad 0 \leq R^2 \leq 1\end{array}$

where

$\begin{array}{l}x = \{ x_1, \, x_2, \, x_3 , ... x_N \}\end{array}$	a variable represented by a discrete data set of numerical samples
$\begin{array}{l}\hat x = \{ \hat x_1, \, \hat x_2, \, \hat x_3 , ... \hat x_N \}\end{array}$	predictor of variable $\begin{array}{l}x\end{array}$ , represented by another discrete data set of numerical samples, with the same number of samples $\begin{array}{l}N\end{array}$ predicted at the same conditions as the original samples $\begin{array}{l}\{ x_1, \, x_2, \, x_3 , ... x_N \}\end{array}$
$\begin{array}{l}\bar x\end{array}$	mean value of the variable $\begin{array}{l}x\end{array}$ , which can be considered as some sort of extreme predictor with zero variability
$\begin{array}{l}RMSD(x, \hat x)\end{array}$	Root-Mean-Square Deviation between a variable $\begin{array}{l}x\end{array}$ and its predictor $\begin{array}{l}\hat x\end{array}$
$\begin{array}{l}MSE(x) = RMSD(x, \bar x)\end{array}$	mean square error between a variable $\begin{array}{l}x\end{array}$ and its mean value $\begin{array}{l}\bar x\end{array}$

The coefficient of determination $\begin{array}{l}R^2\end{array}$ normally ranges between:

0, indicating a course fit, trending to the mean average value

and

1, indicating a fine fit, fairly reproducing the variability of the $\begin{array}{l}x\end{array}$

Negative $\begin{array}{l}R^2\end{array}$ values indicate a substantial mismatch between variable $\begin{array}{l}x\end{array}$ and model prediction $\begin{array}{l}\hat x\end{array}$ .

Page tree

See also

Page tree

Coefficient of determination (R2)

See also