Statistics

Statistical Investigation & Data Display

1

Types of Data

Sort each example into the correct data type.

Favourite sport
Number of siblings
Height in cm
Eye colour
Number of pets
Temperature at noon
Type of music preferred
Time to run 100 m
Categorical
Discrete numerical
Continuous numerical
2

Match Graph to Data Type

Draw a line to match each data type to its best graph.

Categorical data
Numerical data distribution
Changes over time
Two numerical variables
Scatter plot
Bar or column graph
Histogram or stem-and-leaf
Line graph
3

Interpret the Stem-and-Leaf Plot

The stem-and-leaf plot shows test scores: Stem | Leaves — 4|2 5 8 — 5|1 3 6 9 — 6|0 4 7 — 7|2 5. Answer each question.

How many students sat the test?

13
12
14

What is the highest score?

75
72
67

How many students scored 50 or more?

11
10
13
4

Identify Outliers

Look at each data set. Which value is an outlier?

Data: 12, 14, 13, 15, 11, 45

45
11
15

Data: 80, 82, 79, 81, 3, 83

3
83
79

Data: 5, 6, 5, 4, 6, 5, 55

55
4
6
5

Investigation Steps

Sort the steps of a statistical investigation in correct order.

Collect and organise data
Pose a statistical question
Draw conclusions and communicate findings
Display data in a graph
First
Second
Third
Fourth
6

Conduct a Mini-Investigation

Plan and describe a statistical investigation.

Write a statistical question you could investigate in your class (e.g., 'How many hours of screen time per day do students have?').

How would you collect the data? What type of data is it?

Which graph would you use to display your results? Explain why.

What is one question you could answer from your results?

7

Reading Dot Plots

A dot plot shows daily steps (in thousands): values are 6, 6, 7, 7, 7, 8, 9, 9, 10, 12. Answer each question.

What is the most common value (mode)?

7
6
9

What is the range?

6
7
5

How many data values are at or above 9?

3
4
2

Which value could be considered an outlier?

12
10
6
8

Match Graph Features to Descriptions

Draw a line to match each graph feature to what it shows.

A gap between one value and the rest
The middle of the distribution
How spread out the data is
The most common category
Mode
Outlier
Range or IQR
Median or centre
9

Identifying Misleading Graphs

Circle the feature that makes each graph potentially misleading.

A bar chart where the y-axis starts at 80 instead of 0

Truncated axis
Wrong colours
Too many bars

A pie chart where all slices appear equal but the percentages given add to 110%

Percentages don't add to 100%
The chart is too small
No title shown

A line graph with uneven time intervals on the x-axis

Unequal intervals
Too many labels
Missing legend
10

Comparing Two Data Sets

Two classes recorded how many books they read in a month.

Class A: 2, 3, 3, 4, 5, 5, 6, 7, 7, 8. Class B: 1, 1, 2, 3, 4, 4, 5, 5, 10, 10. Find the range for each class.

Find the median for each class.

Which class has more consistent reading habits? Use the statistics to explain.

11

Range and Spread

Sort each statement about data spread as true or false.

A larger range always means the data is more useful
Range = highest value − lowest value
Two data sets with the same range must have the same median
Range is a measure of spread
An outlier can greatly increase the range
True
False
12

Collect and Display Data

Conduct your own mini data investigation at home.

  • 1Ask 10 people (friends, family) how many hours of sleep they got last night. Record the data and draw a dot plot.
  • 2Count the number of words in 10 consecutive sentences in a book. Find the range and median.
  • 3Record the daily high temperature for 7 days. Display the data in a line graph and describe the trend.
  • 4Collect data on the ages of 10 items in your home (e.g., books, appliances). What type of graph best shows this data?
17

Constructing a Stem-and-Leaf Plot

Construct a stem-and-leaf plot for the given data.

Data: 34, 41, 29, 35, 28, 47, 31, 45, 38, 27, 43, 33. Construct a back-to-back or single stem-and-leaf plot. Find the range and median.

Draw here
18

Frequency Tables

Use the frequency table to answer each question. Score | Freq 3 | 2 4 | 5 5 | 7 6 | 4 7 | 2

Total number of students:

20
25
19

Most common score (mode):

5
4
6

What fraction of students scored 7?

2/20 = 1/10
2/7
1/5
20

Bar Chart vs Histogram

Compare bar charts and histograms.

What is the key difference between a bar chart and a histogram? When would you use each one?

A class measured their hand spans: 14, 15, 15, 16, 16, 16, 17, 17, 18, 19 cm. Would you display this in a bar chart or histogram? Explain and sketch the appropriate graph.

Draw here
24

Dot Plot Analysis

Analyse the following dot plot data.

A dot plot shows the number of goals scored by a football team each game: 0, 0, 1, 1, 1, 2, 2, 2, 2, 3, 3, 4, 7. Describe the distribution: is it symmetric or skewed? What is the outlier? What does the mode tell us about this team's scoring?

28

Identifying Bias in Surveys

Evaluate potential sources of bias in each survey design.

A survey asks: 'Don't you agree that students should have more homework?' What is biased about this question? Rewrite it as an unbiased question.

To find students' favourite subject, a school surveys only students attending the maths extension class. What is the sampling bias? How would you fix it?

TipBias occurs when the sample or question design systematically favours one result over another.
29

Appropriate Graph for Data Type

Circle the most appropriate graph for each situation.

Showing the proportion of time spent on different activities

Pie chart
Line graph
Histogram

Showing individual test scores for 20 students

Dot plot
Pie chart
Line graph

Showing how school enrolment changed from 2010 to 2024

Line graph
Bar chart
Pie chart

Comparing favourite foods of Year 7 students

Bar chart
Histogram
Stem-and-leaf
30

Two-Way Tables

Interpret a two-way table and answer questions.

A survey of 40 students shows: 15 boys like sport (5 don't); 12 girls like sport (8 don't). Complete a two-way table. What fraction of all students like sport?

Draw here

From the table: are boys or girls more likely to like sport? Express your answer as a percentage.

TipTwo-way tables (also called contingency tables) show the relationship between two categorical variables.
32

Statistical Terms Sort

Sort each term into the correct column.

Mean
Range
Dot plot
Median
Interquartile range
Histogram
Mode
Bar chart
Measure of Centre
Measure of Spread
Data Display
35

Designing a Fair Survey

Design a statistical investigation on a topic of your choice.

Choose a topic to investigate (e.g., sleep habits, sports participation, screen time). Write a clear statistical question. Describe how you would collect a random sample of at least 20 responses. What type of data will you collect (categorical, discrete, or continuous)?

Describe which graph you would use to display your results and explain why it is the most appropriate choice for your data type.

36

Data Sequence — Cumulative Frequency

A frequency table shows: Score 1: freq 3, Score 2: freq 5, Score 3: freq 7, Score 4: freq 4, Score 5: freq 1. Fill in the cumulative frequencies.

3
8
15
20
?
5
10
20
40
?
37

Interpreting a Histogram

Analyse a histogram and describe the distribution.

A histogram shows heights of students in intervals: 140–150 cm (5 students), 150–160 cm (12 students), 160–170 cm (15 students), 170–180 cm (8 students), 180–190 cm (2 students). Describe the shape of the distribution. What is the most common height interval? What is an estimate of the median?

TipA histogram shows how continuous data is distributed across intervals (called classes or bins).
40

Critiquing a Statistical Claim

Evaluate a statistical claim using your knowledge of data and graphs.

A newspaper headline says: 'Screen time up 200% — teenagers in crisis!' The article shows a bar chart with y-axis starting at 3 hours. Average screen time went from 3 hours to 5 hours. (a) Is the 200% claim correct? (b) How is the bar chart misleading? (c) What would a fair representation look like?

43

Cross-Curricular Data — Science Context

Apply statistics to a science investigation context.

A student measures plant growth over 8 days: 0, 1, 2, 3, 3, 4, 5, 6 cm. Display this data in a line graph. Calculate the range. What type of data is this? Describe the trend.

Draw here

Another plant's growth was: 0, 0, 0, 0, 4, 8, 12, 16 cm. Compare the two plants' growth patterns. Which grew more consistently? What caused the step change in the second plant?

44

Interpreting Pie Charts

A pie chart shows how 80 students travel to school: Walk (90°), Car (120°), Bike (30°), Bus (120°). Answer each question.

How many students walk? (90° out of 360°)

20
25
30

What fraction travel by car?

1/3
1/4
1/2

How many students cycle?

10
8
6
45

Create a Pie Chart

Construct a pie chart from survey results.

30 students chose their favourite season: Summer 12, Winter 6, Autumn 9, Spring 3. Calculate the angle for each sector. Draw the pie chart and label each sector with the percentage.

Draw here
TipTo find each sector angle: (frequency ÷ total) × 360°.
49

Scatter Plots — Introduction

Interpret a scatter plot showing study time vs. test score.

A scatter plot shows: as study hours increase from 1 to 6, test scores generally increase from 45 to 90. Describe the correlation (positive, negative, or none). Is it strong or weak?

A student studies 0 hours but scores 75. Is this consistent with the trend? What might explain it?

TipScatter plots show whether two numerical variables are related (correlated). Positive correlation: both increase together. Negative correlation: one increases as the other decreases.
51

Reading a Back-to-Back Stem Plot

Back-to-back stem plot for ages at two events: Left (Event A): 3 2 1 | 2 | 4 5 8 Left (Event A): 7 6 4 2 | 3 | 1 3 Left (Event A): 8 5 1 | 4 | 0 2 Answer each question.

Median age for Event A (read from oldest to youngest on left):

32
27
34

Which event has older attendees on average?

Event A
Event B
Both the same

What is the range of ages at Event B?

22
18
28
53

Grouped Frequency Tables

Organise data into a grouped frequency table.

Heights (cm): 152, 163, 155, 171, 148, 165, 158, 174, 160, 145, 168, 177, 151, 162, 169. Organise into groups: 140–149, 150–159, 160–169, 170–179. Complete the frequency table and draw a histogram.

Draw here
55

Describing Distributions

Describe the shape of each distribution using correct statistical language.

Describe the shape of this distribution of exam scores: most students scored around 70, fewer scored very high or very low. Is it symmetric, positively skewed, or negatively skewed?

A data set on reaction times shows most values clustered at 0.3 seconds with a long tail to the right up to 1.5 seconds. What distribution shape is this? What might cause the long tail?

TipKey distribution shapes: symmetric (bell-shaped), positively skewed (tail to the right), negatively skewed (tail to the left), bimodal (two peaks), uniform (flat).
56

Graph Types — Best Match

Sort each data scenario to the most appropriate graph.

Comparing favourite music genres of students
Monthly rainfall over a year
Arm span vs height of 20 students
Number of pets per family
Temperature every hour across a day
Study time vs exam score
Bar chart
Line graph
Scatter plot
58

Ethics in Data Collection

Consider ethical issues in statistical research.

A company collects data on its customers' shopping habits without telling them. List 3 ethical concerns with this approach. How should data be collected ethically?

A medical trial collects health data from 1000 people. Why is it important that participants give informed consent? What risks might they face if their data is misused?

60

Statistical Reasoning — Final Investigation

Design and reason through a complete investigation.

Choose a topic: does amount of sleep affect academic performance? OR does exercise frequency affect mood? Write a statistical question, describe your collection method, list 3 potential sources of bias, choose an appropriate graph, and describe what results you would expect to find and why.

TipThis open-ended investigation develops all statistical reasoning skills. It may take 20–30 minutes.
62

Choosing Mean, Median, or Mode

Circle the best measure of centre for each situation.

Most popular shoe size in a shop

Mode
Mean
Median

Typical house price in a suburb (with some very expensive houses)

Median
Mean
Mode

Average score on a fair test with no outliers

Mean
Mode
Median

Most common eye colour in a class

Mode
Mean
Median
64

Interpreting Data in Context

Interpret data from a realistic context and draw meaningful conclusions.

Australia's average annual rainfall has been recorded every year for 100 years. A line graph shows a slight downward trend. List 3 questions a scientist would ask to determine if this trend is statistically meaningful.

A school principal reports that the average class score improved from 68 to 72 between two years. One student's score improved from 40 to 80. Explain why this single student could have a large effect on the mean but not on the median.

66

Frequency Sequence

A frequency table grows by adding one more category. Fill in the missing cumulative total.

5
12
20
32
37
?
10
25
40
60
?
67

Infographic Design

Plan a data infographic.

You have data about 50 Year 7 students: favourite subjects (5 types), average study hours per week (range 0–15), and whether they play sport (Yes/No). Choose one graph type for each variable and explain your choice. Sketch one of the three graphs.

Draw here
71

Calculating and Interpreting Relative Frequency

Use relative frequency to describe experimental results.

A spinner is spun 200 times: Red 85, Blue 60, Green 55. Calculate the relative frequency of each colour. How close is each to the theoretical probability if the spinner were fair (1/3 each)?

72

Data Vocabulary

Circle the correct term for each definition.

The entire group being studied

Population
Sample
Variable
Census

A value that is much higher or lower than most of the data

Outlier
Mode
Range
Mean

Data collected in categories rather than numbers

Categorical
Continuous
Discrete
Numerical
73

Box Plot Introduction

Interpret a box plot.

A box plot for test scores shows: minimum 35, Q1 55, median 65, Q3 75, maximum 90. What is the interquartile range (IQR = Q3 − Q1)? What does the IQR tell you about the middle 50% of students?

Another class has the same median but a much larger IQR. What does this tell you about the spread of scores in that class?

TipBox plots (also called box-and-whisker plots) summarise data using 5 numbers: minimum, lower quartile (Q1), median, upper quartile (Q3), and maximum.
75

Advantages and Disadvantages of Graphs

Sort each statement into the correct column.

Shows trends over time clearly
Best for comparing categories
Shows the relationship between two numerical variables
Easy to compare heights of bars
Needs both variables to be numerical
Can show seasonal patterns
Bar Chart
Line Graph
Scatter Plot
78

Comparing Box Plots

Compare two data sets using box plots.

Class A results: Min 40, Q1 55, Median 70, Q3 80, Max 95. Class B results: Min 30, Q1 45, Median 65, Q3 85, Max 100. Compare the two classes on: (a) median score, (b) spread (IQR), (c) range, (d) which class performed better overall.

79

Box Plot Interpretation

A box plot shows: Min=10, Q1=20, Median=30, Q3=45, Max=70. Circle the correct answer.

IQR = Q3 − Q1 = ?

25
40
60

Range = Max − Min = ?

60
25
35

What fraction of data is above Q3?

25%
75%
50%
80

Statistical Investigation — Plan

Create a detailed plan for a statistical investigation.

Choose a real question to investigate (e.g., 'Does exercise improve mood?'). Write: (a) your statistical question, (b) what data you need, (c) how you would collect it, (d) how you would display it, and (e) what conclusion you might expect.

84

Data in the Media — Evaluation

Critically evaluate a statistical claim from the media.

A news article states: 'Our city has the highest average house price in the country at $1.2 million.' What questions should you ask before accepting this claim? Consider: which average was used, who collected the data, when was it collected, and what is the sample size?

86

Data Literacy — Conclusion

Reflect on what you have learned about statistical investigation.

List 5 key things you have learned about statistical investigation in this worksheet. Which one surprised you most?

Describe a real-world situation where you could apply the statistical investigation skills you have learned.

TipMetacognitive reflection — thinking about your own thinking — is one of the most powerful learning strategies. Encourage honest, thoughtful responses.
88

Reading Histograms

A histogram shows test scores grouped in intervals of 10 (50–59: 4 students, 60–69: 8, 70–79: 12, 80–89: 10, 90–99: 6). Circle the correct answer.

Total number of students:

40
45
50

Modal class (most frequent interval):

70–79
80–89
60–69

What percentage of students scored 80 or above?

40%
50%
30%
91

Newspaper Statistics Critique

Read and evaluate a statistical claim.

Headline: 'New study finds students who eat breakfast score 15% higher on tests.' List at least 4 questions you should ask before accepting this claim. What other factors might explain the difference?

92

Primary vs Secondary Data

Sort each data source into the correct column.

Results of a survey you designed
Australian Bureau of Statistics census
Experiment you conducted in class
Wikipedia article statistics
Observations you recorded
School's existing test score database
Primary Data
Secondary Data
93

Bar Chart Height Sequence

A bar chart shows monthly rainfall. Each month increases by 5 mm from the previous. Fill in the missing values.

20
25
30
40
45
?
10
20
30
50
60
?
96

Real-World Application — Climate Data

Apply statistical investigation to climate data.

Monthly average temperatures (°C) for a city: Jan 28, Feb 27, Mar 25, Apr 22, May 18, Jun 15, Jul 14, Aug 16, Sep 19, Oct 22, Nov 25, Dec 27. Find the range. Calculate the median. Display the data in a line graph. Describe the seasonal trend.

Draw here
98

Favourite Fruit Survey

Read the picture graph showing favourite fruits of 40 students, then answer the questions.

Apple
Banana
Watermelon
Grape
1

If each icon = 2 students, how many prefer banana?

2

What fraction of students prefer apple?

99

Match Data Term to Definition

Draw a line to match each statistical term to its definition.

Population
Sample
Variable
Census
A survey of EVERY member of the population
Any group or collection being studied
A subset selected from the population
A characteristic that can take different values
100

Order Data Investigation Steps

Sort these steps of a complete statistical investigation into the correct order.

Analyse data and draw conclusions
Pose a statistical question
Display data in an appropriate graph
Collect and organise the data
Step 1
Step 2
Step 3
Step 4