Statistics

Distributions & Variation

1

Look at This Data Display (Set A)

Study the data display and answer the questions about its shape.

0–2 goals
3–4 goals
5–6 goals
7–8 goals
9–10 goals
1

Which group has the most teams?

2

Is the data bunched in the middle or spread out evenly?

3

Are there any groups with very few teams?

2

Look at This Data Display (Set B)

Study the data and answer.

0–5 min
6–10 min
11–15 min
16–20 min
1

Which time range is the most common?

2

Is the data clustered on one side?

3

How many people took more than 15 minutes?

3

Describe the Distribution (Set A)

Circle the best answer.

Most scores are in the middle with fewer at the ends. This is:

Spread out
Clustered in the middle
All the same

When data values are very different from each other, there is:

Low variation
High variation
No variation

If every student scored exactly 7/10, the variation is:

High
Low
Zero
4

Describe the Distribution (Set B)

Circle the best answer.

Most data is on the left side with a 'tail' to the right:

Symmetrical
Skewed right
Uniform

Data is evenly spread across all categories:

Clustered
Symmetrical
Spread out evenly

One value is much higher/lower than all others:

Typical value
An outlier
A mode
5

High or Low Variation? (Set A)

Sort each example.

Test scores: 85, 87, 86, 88, 84
Heights: 120 cm, 155 cm, 98 cm, 175 cm
Ages: 9, 9, 10, 10, 9
Shoe sizes: 2, 8, 4, 12, 6
Low variation
High variation
6

High or Low Variation? (Set B)

Sort each example.

Temperatures: 22°C, 23°C, 21°C, 22°C
Pocket money: $2, $50, $5, $100
Race times: 12.1 s, 12.3 s, 12.0 s, 12.2 s
Book pages: 50, 300, 20, 800
Low variation
High variation
7

True or False? (Distributions)

Circle TRUE or FALSE.

If all values are the same, there is no variation

TRUE
FALSE

A wider spread of data means more variation

TRUE
FALSE

An outlier is a value that fits perfectly with the rest

TRUE
FALSE

Most real-world data sets have some variation

TRUE
FALSE
8

Match Data Words to Meanings

Draw a line from each word to its meaning.

Variation
Distribution
Outlier
Cluster
Range
Difference between highest and lowest
How spread out values are
How data is spread across groups
A group of values close together
A value far from the rest
9

Order Data Sets by Variation

Number from 1 (least variation) to 3 (most variation).

?
Set A: 50, 50, 51, 49, 50
?
Set B: 10, 30, 50, 70, 90
?
Set C: 40, 45, 50, 55, 60
10

Find the Range (Set A)

Calculate the range for each data set. Range = highest − lowest.

Data: 5, 8, 12, 3, 7. Range = ___ − ___ = ___

Data: 20, 25, 22, 28, 21. Range = ___

Data: 100, 50, 75, 90, 60. Range = ___

11

Find the Range (Set B)

Calculate the range.

Data: 14, 14, 15, 14, 15. Range = ___

Data: 2, 45, 8, 10, 12. Range = ___

Which set above has more variation? How does the range tell you?

12

Range Calculations

Find the range (total = highest, partA = lowest, partB = range).

50
12
?
100
?
65
88
23
?
200
?
150
13

Identify Outliers

Circle the outlier (the value that does not fit with the rest).

Data: 10, 12, 11, 13, 50

10
13
50

Data: 80, 82, 5, 79, 81

5
80
82

Data: 25, 27, 26, 28, 25, 90

25
28
90
14

Identify Outliers (Set B)

Circle the outlier.

Data: 4, 5, 6, 4, 5, 30

4
6
30

Data: 100, 98, 102, 99, 15

15
99
102

Data: 55, 52, 58, 54, 200

52
58
200
15

Spread and Variation (Set A)

Circle the correct answer.

Data with a SMALL range has:

Lots of variation
Little variation
No values

Data: 5, 6, 7, 5, 6 — the range is:

2
5
7

Data: 10, 50, 20, 80, 30 — the range is:

30
50
70

Which set has MORE variation: {3, 4, 5} or {1, 50, 100}?

{3, 4, 5}
{1, 50, 100}
16

Spread and Variation (Set B)

Circle the correct answer.

Data: 20, 22, 21, 19, 20 — the range is:

2
3
22

Data: 8, 35, 12, 40, 5 — the range is:

27
32
35

A set with range 3 has ___ variation than a set with range 50:

More
Less
Same

Range = Highest − Lowest. If highest = 45 and lowest = 18, range = ?

25
27
63
17

Range Calculations (Set B)

Range = Highest − Lowest. Total = highest, partA = lowest, partB = range.

75
20
?
120
?
85
60
15
?
95
?
60
150
42
?
18

Range Calculations (Set C)

Find the missing number.

48
12
?
200
?
135
99
33
?
80
?
45
500
125
?
19

More or Less Variation?

Sort each data set: more or less variation?

Heights: 130 cm, 132 cm, 131 cm, 129 cm
Heights: 100 cm, 150 cm, 120 cm, 180 cm
Scores: 88, 90, 89, 91
Scores: 12, 95, 43, 78
Ages: 9, 10, 9, 10
Ages: 2, 15, 8, 45
Low variation (small range)
High variation (large range)
20

Sort: Cluster or Spread?

Sort each description.

Most values between 20 and 25
Values range from 5 to 95
All readings within 3 of the average
Largest value is 10 times the smallest
Range of 4
Range of 80
Clustered together
Spread out
21

Match Data to Range

Draw a line from each data set to its range.

{5, 8, 12, 3}
{20, 22, 21, 23}
{100, 10, 50, 80}
{7, 7, 8, 7}
1
3
9
90
22

Match Descriptions to Data

Draw a line to match.

Low variation
High variation
Has an outlier
Evenly spread
{50, 51, 52, 50}
{2, 80, 15, 99}
{10, 11, 12, 60}
{10, 20, 30, 40}
23

Distribution of Test Scores

Use the graph to answer questions about the distribution.

0-5
6-10
11-15
16-20
1

Which score range was most common?

2

How many students scored 10 or less?

3

Is the data clustered or spread out?

4

How many students took the test?

24

Distribution of Heights

Use the graph to answer.

120-129 cm
130-139 cm
140-149 cm
150-159 cm
1

Which height range was most common?

2

Where is the data clustered?

3

What is the range of the group counts?

25

Variation in Everyday Life

Think about variation in the real world.

Give an example of data with LOW variation (e.g. temperatures in one day):

Give an example of data with HIGH variation:

Why is it useful to know if data has high or low variation?

26

Calculate the Range

Find the range for each data set.

Data: 15, 22, 18, 30, 10. Highest = ___, Lowest = ___, Range = ___

Data: 5, 5, 6, 5, 7, 5. Highest = ___, Lowest = ___, Range = ___

Data: 100, 45, 80, 60, 95. Highest = ___, Lowest = ___, Range = ___

Which set above has the most variation? How do you know?

27

Distribution of Pocket Money

Record the data in a tally chart.

ItemTallyTotal
$0-$2
$3-$5
$6-$8
$9+
28

Range Patterns

These are ranges for data sets. Continue the pattern.

2
4
6
8
?
?
?
5
10
15
20
?
?
?
29

Describe the Data (Set A)

Look at each set of data and describe the distribution.

Spelling test scores: 4, 5, 7, 7, 8, 8, 8, 9, 9, 10. Where are most scores? Much variation?

Number of books read: 1, 1, 2, 5, 5, 5, 6, 6, 12. Spread out or bunched?

30

Describe the Data (Set B)

Describe each data set.

Heights (cm): 130, 132, 131, 133, 130, 132. Describe the variation.

Goals scored: 0, 1, 1, 2, 2, 3, 3, 3, 4, 10. Is there an outlier?

31

Compare Two Data Sets (Set A)

Compare these two sets of data.

Set A: 45, 47, 48, 50, 51, 52, 55. Set B: 20, 35, 48, 50, 63, 78, 95. Which has more variation?

32

Compare Two Data Sets (Set B)

Compare these data sets.

Class A test scores: 70, 72, 71, 73, 70. Class B test scores: 50, 60, 70, 80, 90. Which class had more consistent results?

Explain what 'consistent results' means using the word 'variation'.

33

Range as a Measure of Variation

The range = highest value − lowest value.

Data: 3, 7, 12, 5, 9. Range = ___ − ___ = ___

Data: 22, 24, 23, 25, 22. Range = ___

Which data set above has more variation? How does the range help you tell?

34

What Does the Shape Tell Us?

Circle the best description.

Data: 1, 2, 5, 8, 8, 9, 9, 10. Most values are:

Low
In the middle
High

Data: 1, 1, 2, 2, 3, 7, 8, 9. The distribution is:

Bunched at the low end
Evenly spread
Bunched at the high end

Data: 5, 5, 5, 5, 5. The shape is:

Spread out
Uniform (all the same)
Skewed
35

Investigate Variation

Answer these investigation questions.

If you measured the height of everyone in your class, would you expect high or low variation? Why?

If you rolled a dice 100 times, would you expect each number to appear exactly the same number of times?

Give an example of data you would expect to have very low variation.

36

Challenge: Create Data Sets

Create your own data sets to match these descriptions.

Write 8 numbers that have LOW variation:

Write 8 numbers that have HIGH variation:

Write 8 numbers with one outlier:

37

Home Activity: Investigating Variation

Explore variation in data at home!

  • 1Measure the temperature at 9 am every day for a week. Is there much variation?
  • 2Count how many steps you take to walk to 5 different places. How spread out are the numbers?
  • 3Ask 10 people their favourite number between 1 and 10. Are the answers bunched or spread?
  • 4Roll a dice 30 times. Do all numbers appear the same amount? Describe the variation.
38

Understanding the Median

Find the median (middle value) of each dataset.

3, 7, 9, 14, 21 — median = ___

4, 8, 11, 15, 17, 22 — median = ___

24, 16, 9, 31, 7 — first put in order: ___. Median = ___

How does the median help us understand a dataset?

39

Understanding the Mean (Average)

Find the mean of each dataset.

4, 8, 6, 10, 7 — total = ___, mean = ___

15, 20, 25, 10 — total = ___, mean = ___

Temperatures over 5 days: 18, 22, 25, 21, 19 — mean temperature = ___

40

Range of a Dataset

Find the range (largest − smallest) of each dataset.

5, 12, 3, 9, 17, 8 — range = ___

24, 16, 32, 9, 41 — range = ___

If a dataset has a large range, what does that tell you about the data?

41

Sort: Low or High Variation?

Sort each dataset by whether it shows low or high variation.

5, 6, 5, 6, 5, 6
3, 15, 8, 42, 1, 67
99, 100, 101, 99, 100
10, 40, 70, 100, 5, 80
12, 13, 12, 14, 11
2, 200, 50, 150, 1
Low variation (close together)
High variation (spread out)
42

Interpret Distributions (Set A)

Circle the best description for each dataset.

Test scores: 72, 75, 71, 74, 73, 76. This distribution is:

Clustered (bunched)
Spread out
Bimodal

Heights: 130, 145, 162, 170, 131. This distribution is:

Clustered
Spread out
Uniform

Daily steps: 8000, 8100, 7900, 8050, 8000. This distribution is:

Clustered
Spread out
Increasing

Pocket money: $2, $15, $8, $30, $5. This distribution is:

Clustered
Spread out
Uniform
43

Describing Distributions

Describe each distribution in your own words.

Daily rainfall (mm): 0, 0, 12, 0, 0, 45, 0, 2, 0, 0. Describe the distribution.

Marks on a test (out of 20): 14, 16, 13, 15, 17, 14, 16, 15. Describe the distribution.

Which dataset has more variation? Explain how you can tell.

44

Plant Growth Over 5 Weeks

Each leaf icon = 2 cm of growth. Read and answer.

Week 1
Week 2
Week 3
Week 4
Week 5
1

In which week did the plant grow the most?

2

What was the total growth over 5 weeks?

3

Describe the pattern of growth. Is it consistent?

4

Calculate the mean growth per week.

45

Hours of Sleep Survey

10 students recorded how many hours they slept. Read the tally.

ItemTallyTotal
Less than 8 hours
8 hours
9 hours
More than 9 hours
46

Outliers in Data

An outlier is a value that is very different from the others.

Scores: 15, 16, 14, 17, 15, 2, 16. Which score is an outlier? ___. How does it affect the mean?

Masses: 3.2 kg, 3.5 kg, 3.1 kg, 25 kg, 3.4 kg. Circle the outlier. Why might this value be here?

47

Comparing Two Datasets

Compare these two sets of test results.

Class A: 60, 65, 70, 68, 72, 65. Class B: 45, 80, 55, 90, 40, 88. Find the range for each class.

Which class shows more variation? How do you know?

Find the mean for each class. Which class performed better on average?

48

Stem-and-Leaf Plots

A stem-and-leaf plot organises data by tens (stems) and ones (leaves).

Data: 23, 25, 31, 35, 38, 42, 44, 45, 51. Complete the stem-and-leaf plot: 2|___ 3|___ 4|___ 5|___

What is the median of this dataset?

Describe the distribution — is the data spread or clustered?

49

Challenge: Making Predictions

Use data distributions to make predictions.

A shop sells 30, 45, 38, 50, 42 ice creams on 5 days. Predict sales for day 6. Justify your prediction.

If a new data point of 100 is added to the set, how would the mean and range change?