Section 1
Classifying Data: Categorical vs. Quantitative
Property
Before creating any graph, you must identify the type of univariate data (data tracking exactly one variable) you are working with. Data falls into two main categories:
- Categorical Data (Qualitative): Deals with descriptions, names, or labels. It categorizes items but cannot be meaningfully added, subtracted, or averaged.
- Quantitative Data (Numerical): Deals with measurable quantities. These are true numbers where mathematical operations (like finding a mean or median) make logical sense.
Examples
- Categorical Data: Eye color (brown, blue, green), favorite sport, or grade level.
- Quantitative Data: Height in inches (62, 68, 71), number of pets (0, 2, 5), or test scores.
- Disguised Categories: Sometimes categorical data uses numbers as labels, like zip codes or sports jersey numbers. You cannot meaningfully calculate an "average zip code," which proves it is categorical, not quantitative.
Explanation
The key to classifying variables is asking yourself: "Does it make sense to do math with these answers?" If you ask 10 people their favorite pet, you cannot calculate the "average pet"—you can only find the mode (the most popular category). Understanding whether your data is a descriptive label or a measurable quantity is the crucial first step because it completely dictates which type of graph you are allowed to use.