• Nakafa

    Nakafa

    Learn free and with quality.
Subject
    • Grade 10
    • Grade 11
    • Grade 12
Exercises
Holy
  • Quran
Articles
  • Politics
  • Community
  • About

Command Palette

Search for a command to run...

Statistics

Coefficient of Determination

What is the Coefficient of Determination?

After finding the best-fit linear regression line for our data, the next question is: how well does that line actually represent or explain our data?

The measure that answers this question is the Coefficient of Determination, denoted as r2r^2r2 (read: r-squared).

Simply put, r2r^2r2 tells us the proportion or percentage of the variation (ups and downs in values) in the dependent variable (Y) that can be explained by the variation in the independent variable (X) using our linear regression model.

Coefficient of Determination from a Scatter Diagram

The value of r2r^2r2 is closely related to how tightly the data points cluster around the regression line:

  1. High r2r^2r2 (approaching 1 or 100%)

    High r2r^2r2
    Data points are very close to the regression line.

    See how the data points above are very tightly packed and close to the regression line? This indicates a high r2r^2r2 value (for example, maybe around 0.95 or 95%). This means that most of the variation in Y values can be explained well by the regression line (or by variable X).

  2. Low r2r^2r2 (approaching 0 or 0%)

    Low r2r^2r2
    Data points are scattered far from the regression line.

    Compare this with this diagram. The points are more spread out from the regression line (the residual lines are longer). This indicates a low r2r^2r2 value (for example, maybe around 0.40 or 40%). This means that this regression line is not very good at explaining the variation in Y values; only a small portion of the variation in Y can be explained by X through this model.

Calculating the Coefficient of Determination

The easiest way to calculate r2r^2r2 is by squaring the Correlation Coefficient (rrr) that we learned about earlier.

r2=(r)2r^2 = (r)^2r2=(r)2

So, if you've already calculated the value of rrr, just square it!

Since the value of rrr is always between -1 and +1 (−1≤r≤1-1 \le r \le 1−1≤r≤1), the value of r2r^2r2 will always be between 0 and 1.

0≤r2≤10 \le r^2 \le 10≤r2≤1

Mathematically (using Sum of Squares):

The value of r2r^2r2 can also be calculated directly using the Sum of Squares values used to calculate rrr:

r2=(SSxy)2SSxxSSyyr^2 = \frac{(SS_{xy})^2}{SS_{xx} SS_{yy}}r2=SSxx​SSyy​(SSxy​)2​

Interpretation as a Percentage

The value of r2r^2r2 is often converted into a percentage (by multiplying by 100) for easier interpretation.

  • If r2=0.81r^2 = 0.81r2=0.81, it means that 81% of the total variation in variable Y can be explained by the variation in variable X through the linear regression model.
  • The remaining variation (1−r21 - r^21−r2 or 19% in this example) is explained by other factors not included in the model (could be other variables, or random error).

The higher the percentage of r2r^2r2, the better our linear regression model is at explaining the relationship between X and Y.

Previous

Product Moment Correlation

Next

Complex Number Concept

  • Coefficient of DeterminationLearn how r² measures how well your regression line explains data variation. Master coefficient of determination with visual examples and calculations.
On this page
  • What is the Coefficient of Determination?
  • Coefficient of Determination from a Scatter Diagram
  • Calculating the Coefficient of Determination
  • Interpretation as a Percentage
  • Comments
  • Report
  • Source code