Learn on PengiIllustrative Mathematics, Grade 6Unit 8 Data Sets and Distributions

Lesson 1: Data, Variability, and Statistical Questions

In this Grade 6 Illustrative Mathematics lesson from Unit 8, students explore the foundational concepts of data by distinguishing between numerical data and categorical data, interpreting dot plots, and evaluating what makes a valid statistical question. Through a class survey activity, students collect real measurements such as height, travel time, and sleep hours, then classify survey responses by data type and unit of measurement.

Section 1

Classifying Data: Categorical vs. Quantitative

Property

Before creating any graph, you must identify the type of univariate data (data tracking exactly one variable) you are working with. Data falls into two main categories:

  • Categorical Data (Qualitative): Deals with descriptions, names, or labels. It categorizes items but cannot be meaningfully added, subtracted, or averaged.
  • Quantitative Data (Numerical): Deals with measurable quantities. These are true numbers where mathematical operations (like finding a mean or median) make logical sense.

Examples

  • Categorical Data: Eye color (brown, blue, green), favorite sport, or grade level.
  • Quantitative Data: Height in inches (62, 68, 71), number of pets (0, 2, 5), or test scores.
  • Disguised Categories: Sometimes categorical data uses numbers as labels, like zip codes or sports jersey numbers. You cannot meaningfully calculate an "average zip code," which proves it is categorical, not quantitative.

Explanation

The key to classifying variables is asking yourself: "Does it make sense to do math with these answers?" If you ask 10 people their favorite pet, you cannot calculate the "average pet"—you can only find the mode (the most popular category). Understanding whether your data is a descriptive label or a measurable quantity is the crucial first step because it completely dictates which type of graph you are allowed to use.

Section 2

Numerical Data: Quantities, Not Labels

Property

A set of numbers, D={d1,d2,...,dn}D = \{d_1, d_2, ..., d_n\}, is numerical data only if it represents measurable or countable quantities. A key test is whether the average, dˉ=din\bar{d} = \frac{{\sum d_i}}{n}, is a meaningful value. If the numbers are just labels, the data is categorical.

Examples

Section 3

Interpreting Dot Plot Shapes

Property

A dot plot displays numerical data by placing dots above a number line at each data value's location.
When interpreting dot plots, we analyze their shape by identifying key features: clusters (groups of data points close together), gaps (intervals with no data), peaks (values with the highest frequency), and outliers (values significantly separated from the main data).
These shape characteristics help us understand the distribution and patterns within the dataset.

Examples

Section 4

Introduction to Variability

Property

Variability is a measure of how much samples or data differ from each other.
Understanding variability in samplings allows students the opportunity to estimate or even measure the differences.
Gauging how far off an estimate or prediction might be is a way to address issues of variation in samples.

Examples

  • One random sample of 10 students reads an average of 3 books a month. A second sample of 10 students reads an average of 4 books. The difference is due to sampling variability.
  • A pollster asks two separate random groups of 100 people if they like a new movie. In the first group, 65% say yes. In the second group, 70% say yes. This 5% difference shows variability.
  • If you roll a die 12 times, you expect to get two 4s. But one time you might get one 4, and the next you might get three. This fluctuation is variability.

Explanation

Variability means that different samples from the same population won't be exactly alike. It's the natural, expected difference between samples. Understanding it helps you know how much you can trust your predictions and how much they might change.

Book overview

Jump across lessons in the current chapter without opening the full course modal.

Continue this chapter

Unit 8 Data Sets and Distributions

  1. Lesson 1Current

    Lesson 1: Data, Variability, and Statistical Questions

  2. Lesson 2

    Lesson 2: Dot Plots and Histograms

  3. Lesson 3

    Lesson 3: Measures of Center and Variability

  4. Lesson 4

    Lesson 4: Median and IQR

Lesson overview

Expand to review the lesson summary and core properties.

Expand

Section 1

Classifying Data: Categorical vs. Quantitative

Property

Before creating any graph, you must identify the type of univariate data (data tracking exactly one variable) you are working with. Data falls into two main categories:

  • Categorical Data (Qualitative): Deals with descriptions, names, or labels. It categorizes items but cannot be meaningfully added, subtracted, or averaged.
  • Quantitative Data (Numerical): Deals with measurable quantities. These are true numbers where mathematical operations (like finding a mean or median) make logical sense.

Examples

  • Categorical Data: Eye color (brown, blue, green), favorite sport, or grade level.
  • Quantitative Data: Height in inches (62, 68, 71), number of pets (0, 2, 5), or test scores.
  • Disguised Categories: Sometimes categorical data uses numbers as labels, like zip codes or sports jersey numbers. You cannot meaningfully calculate an "average zip code," which proves it is categorical, not quantitative.

Explanation

The key to classifying variables is asking yourself: "Does it make sense to do math with these answers?" If you ask 10 people their favorite pet, you cannot calculate the "average pet"—you can only find the mode (the most popular category). Understanding whether your data is a descriptive label or a measurable quantity is the crucial first step because it completely dictates which type of graph you are allowed to use.

Section 2

Numerical Data: Quantities, Not Labels

Property

A set of numbers, D={d1,d2,...,dn}D = \{d_1, d_2, ..., d_n\}, is numerical data only if it represents measurable or countable quantities. A key test is whether the average, dˉ=din\bar{d} = \frac{{\sum d_i}}{n}, is a meaningful value. If the numbers are just labels, the data is categorical.

Examples

Section 3

Interpreting Dot Plot Shapes

Property

A dot plot displays numerical data by placing dots above a number line at each data value's location.
When interpreting dot plots, we analyze their shape by identifying key features: clusters (groups of data points close together), gaps (intervals with no data), peaks (values with the highest frequency), and outliers (values significantly separated from the main data).
These shape characteristics help us understand the distribution and patterns within the dataset.

Examples

Section 4

Introduction to Variability

Property

Variability is a measure of how much samples or data differ from each other.
Understanding variability in samplings allows students the opportunity to estimate or even measure the differences.
Gauging how far off an estimate or prediction might be is a way to address issues of variation in samples.

Examples

  • One random sample of 10 students reads an average of 3 books a month. A second sample of 10 students reads an average of 4 books. The difference is due to sampling variability.
  • A pollster asks two separate random groups of 100 people if they like a new movie. In the first group, 65% say yes. In the second group, 70% say yes. This 5% difference shows variability.
  • If you roll a die 12 times, you expect to get two 4s. But one time you might get one 4, and the next you might get three. This fluctuation is variability.

Explanation

Variability means that different samples from the same population won't be exactly alike. It's the natural, expected difference between samples. Understanding it helps you know how much you can trust your predictions and how much they might change.

Book overview

Jump across lessons in the current chapter without opening the full course modal.

Continue this chapter

Unit 8 Data Sets and Distributions

  1. Lesson 1Current

    Lesson 1: Data, Variability, and Statistical Questions

  2. Lesson 2

    Lesson 2: Dot Plots and Histograms

  3. Lesson 3

    Lesson 3: Measures of Center and Variability

  4. Lesson 4

    Lesson 4: Median and IQR