Correlation (Extended)

Understand Pearson's correlation coefficient, interpret r values and distinguish causation from correlation. IB MYP Year 5 Extended Maths statistics guide.

Want help mastering this topic?
Work 1-on-1 with an IB expert tutor.
Book a session →

From Description to Measurement

While Standard level students describe correlation qualitatively, Extended students work with a numerical measure: Pearson's correlation coefficient (r). This gives a precise value for the strength and direction of a linear relationship between two variables.

Pearson's Correlation Coefficient

The value of r always lies between −1 and 1:

In MYP assessments, r is typically provided or calculated using a GDC. You are expected to interpret the value, not derive the formula from scratch.

Interpreting r in Context

Simply stating "r = 0.85, which is strong positive correlation" is insufficient at Extended level. A complete response should:

  1. State the value of r
  2. Describe the strength and direction
  3. Interpret what this means for the specific variables in the question

Causation vs Correlation

A strong correlation does not imply that one variable causes the other. This is one of the most important conceptual distinctions in statistics. MYP questions regularly test whether students can identify confounding variables or explain why correlation is not sufficient evidence of causation.

Example: Ice cream sales and drowning rates are positively correlated, but both are driven by a third variable — hot weather.

Common Mistakes

Frequently asked questions

Focuses on Pearson's correlation coefficient (r), used to measure linear association between two quantitative variables. You'll interpret r values between -1 and +1 for strength (weak, moderate, strong) and direction (positive, negative). Also distinguishes correlation from causation, reminding you that a strong r doesn't prove one variable causes the other. Sitting after scatter graphs in Extended, it builds on visual analysis by adding a numerical measure.
Students often confuse strength with direction. An r of -0.9 is stronger than +0.5, even though it's negative; the sign only tells you the slope direction. Another frequent slip: claiming causation from a high r value, e.g. saying ice cream sales cause drownings because r is near 0.8. Always describe r in two parts: direction and strength (using magnitude), then explicitly state whether a causal link is justified. Mention possible lurking variables.
Ready to start?
Book a free diagnostic.
Get started →

Related