Working with Sets of Data

Compare datasets using IQR, outliers and statistical measures in context. MYP Year 5 Standard Maths statistics guidance for Criterion D responses.

Want help mastering this topic?
Work 1-on-1 with an IB expert tutor.
Book a session →

Building on Basic Statistics

Once you can calculate measures for a single dataset, the next step is comparing two or more datasets. This topic develops your ability to evaluate data critically and draw meaningful conclusions — skills that are directly assessed in Criterion D.

Interquartile Range (IQR)

The interquartile range (IQR = Q3 − Q1) measures the spread of the middle 50% of data. Unlike the range, it is not affected by extreme values, making it a more reliable measure of consistency.

You need to be able to:

Identifying Outliers

An outlier is a value that lies significantly outside the general pattern of a dataset. In MYP assessments, you may be asked to identify outliers using the IQR fence method or simply by inspection of a box plot. Always consider whether an outlier reflects a genuine anomaly or a data entry error.

Comparing Datasets in Context

When comparing two datasets, structure your response around:

  1. A comparison of averages (e.g. "Dataset A has a higher median, suggesting…")
  2. A comparison of spread (e.g. "Dataset B has a larger IQR, indicating more variation…")
  3. A conclusion in context (e.g. "This suggests students in Group B performed less consistently…")

This three-part structure is what Criterion D responses reward.

Common Mistakes

Frequently asked questions

Moves beyond single-dataset summaries to comparing two sets side by side. You find quartiles (Q1, Q3) and the interquartile range (IQR = Q3 - Q1) as a measure of spread less affected by outliers than the range. You'll draw and read box plots, then use them to compare centre (median) and spread (IQR) between two groups. Sits after basic averages in the Standard pathway and prepares you for probability work that follows.
Students forget to order the data first. Always sort values smallest to largest before locating Q1 and Q3. The next slip is mishandling the median when n is even: split the ordered list into two equal halves at the median, then take Q1 as the median of the lower half and Q3 of the upper half. When comparing two box plots, always quote one centre statement (medians) and one spread statement (IQRs) in context for full marks.
Ready to start?
Book a free diagnostic.
Get started →

Related