StatsIQ · AP Statistics

Lesson 5: The Normal Distribution

Unit 1 · Phase 1 · Statistical Practice:** 3 — Analyze Data

Topics:** Density curves (area = proportion, total area = 1); the normal model N(μ, σ); the 68–95–99.7 (Empirical) Rule; standardizing with z-scores and interpreting them; finding proportions/probabilities from a normal model; finding a value given a percentile (inverse normal); assessing whether a normal model is reasonable.

Calculator:** TI-84 `normalcdf(lower, upper, μ, σ)` for proportions and `invNorm(area, μ, σ)` for values; using `−1E99` and `1E99` for open-ended tail bounds.

Objectives:

Use the 68–95–99.7 rule to estimate proportions for any normally distributed variable.
Standardize a value into a z-score and use `normalcdf` to find the proportion above, below, or between values.
Reverse the process with `invNorm` to find the data value at a given percentile.

(a) Warm-Up

A college admissions officer is staring at two students' files. One scored 1280 on the SAT. The other scored 28 on the ACT. The SAT and ACT are completely different tests on completely different scales — the SAT runs to 1600, the ACT only to 36. So here's the question that decides who looks stronger: which score is more impressive?

You can't compare 1280 to 28 directly; that's comparing apples to elevators. But you can ask: how far above average is each student, measured in standard deviations? That single idea — counting how many standard deviations a value sits from the mean — is the engine behind nearly everything in this lesson.

Most of the variables you'll meet in AP Stats (heights, test scores, battery lifetimes, reaction times) pile up in a familiar bell shape: lots of values near the middle, fewer out at the extremes. When a variable follows that bell shape, one model unlocks every "what proportion…?" question you can throw at it. Let's build it.

(b) Core Concept

Density curves: area is everything

Before the bell curve, meet its parent idea. A density curve is a smooth curve drawn over a histogram that describes the overall pattern of a distribution. It has two non-negotiable properties:

The curve is always on or above the horizontal axis (it never dips negative).
The total area under the curve equals exactly 1.

That second rule is the whole point. Because the total area is 1, the area under the curve over any interval equals the proportion of values that fall in that interval. Area = proportion. If 23% of the area sits to the left of some value, then 23% of the data is below that value. A density curve is an idealized description of a distribution — it smooths over the bumps of a real histogram so we can do clean math.

The normal model: N(μ, σ)

The most important density curve in statistics is the normal curve — the symmetric, bell-shaped one. A normal distribution is completely described by just two numbers, so we write it as N(μ, σ):

μ (mu) is the mean, which sits at the center and at the peak (and equals the median, by symmetry).
σ (sigma) is the standard deviation, which controls the spread — the distance from the center to the inflection points where the curve switches from bending down to bending out.

[GRAPH: A symmetric bell-shaped normal curve over a horizontal axis. Center peak labeled μ. Vertical dashed lines at μ−σ, μ−2σ, μ−3σ, μ+σ, μ+2σ, μ+3σ. The region between μ−σ and μ+σ is shaded, labeled "68% of the data." Axis ticks read μ−3σ, μ−2σ, μ−σ, μ, μ+σ, μ+2σ, μ+3σ.]

A larger σ makes the curve shorter and wider; a smaller σ makes it taller and narrower. But every normal curve has the same shape, which is why one rule works for all of them.

The 68–95–99.7 (Empirical) Rule

For any normal distribution, the proportion of data within a set number of standard deviations of the mean is fixed:

≈ 68% of values fall within 1 standard deviation of the mean (between μ−σ and μ+σ)
≈ 95% of values fall within 2 standard deviations (between μ−2σ and μ+2σ)
≈ 99.7% of values fall within 3 standard deviations (between μ−3σ and μ+3σ)

Because the curve is symmetric, you can slice these further. Outside ±2σ lies 100% − 95% = 5%, split evenly into 2.5% in each tail.

Worked empirical-rule example. Resting heart rates for healthy adults are approximately N(72, 8) beats per minute. What proportion of adults have a resting heart rate between 56 and 88 bpm?

Notice 56 = 72 − 2(8) and 88 = 72 + 2(8). So 56 and 88 are exactly 2 standard deviations below and above the mean. By the 95 part of the rule, about 95% of healthy adults have a resting heart rate between 56 and 88 bpm.

Push it further: what proportion is above 88? Above +2σ is half of the leftover 5%, so about 2.5%.

The empirical rule is your fast, no-calculator sanity check. But it only works for "nice" values landing on whole standard-deviation marks. For everything else, we standardize.

Standardizing: the z-score

The z-score answers the Warm-Up question directly: how many standard deviations is a value from the mean?

z = (x − μ) / σ

A positive z-score means the value is above the mean; negative means below; z = 0 means exactly average. A z-score has no units — it's pure "standard deviations from the mean" — which is precisely what lets us compare an SAT score to an ACT score.

Back to our two students. SAT scores are about N(1050, 200); ACT scores are about N(21, 5.7).

SAT 1280:  z = (1280 − 1050) / 200 = 230 / 200  = 1.15
ACT 28:    z = (28 − 21) / 5.7     = 7 / 5.7    ≈ 1.23

The ACT student sits 1.23 standard deviations above average versus 1.15 for the SAT student. The ACT score is (slightly) more impressive. Same scale, fair fight.

From z-score to proportion: normalcdf

Once you can standardize, you can find the exact proportion below, above, or between any values. On the TI-84, the workhorse is:

normalcdf(lower, upper, μ, σ) — returns the proportion of a normal distribution between lower and upper.

The catch is the bounds for open-ended tails. "Below" has no real lower limit, and "above" has no real upper limit. We feed the calculator a number so extreme it acts like infinity:

For a left tail ("less than x"): normalcdf(−1E99, x, μ, σ)
For a right tail ("greater than x"): normalcdf(x, 1E99, μ, σ)

(On the TI-84, type 1E99 using the 2nd → , (EE) key, not the caret. −1E99 is negative one followed by EE 99.)

Worked z-score-to-proportion example. A brand of AA battery has lifetimes that are approximately N(500, 50) minutes. What proportion of batteries last longer than 540 minutes?

By hand (the z-score story):

`

z = (540 − 500) / 50 = 40 / 50 = 0.80

`

We want the area to the right of z = 0.80.

On the calculator:

`

TI-84: normalcdf(540, 1E99, 500, 50)

Output: 0.2119

`

About 21.2% of batteries last longer than 540 minutes. (You can also feed the standardized version, normalcdf(0.80, 1E99, 0, 1), and get the same 0.2119 — using μ and σ directly just skips a step.)

That's the full toolkit for proportions. Now we run it backward.

From percentile to value: invNorm

Sometimes you know the proportion and want the value. "What battery lifetime marks the top 5%?" "What SAT score is the 90th percentile?" These are inverse normal problems, and they use a different command:

invNorm(area, μ, σ) — returns the value with that much area (proportion) to its left.

The single most important rule: invNorm always works with the area to the LEFT of the value you want. If a problem gives you a right-tail proportion, subtract from 1 first.

Worked inverse example. For the SAT, N(1050, 200), what score is the 90th percentile?

The 90th percentile has 0.90 of the area to its left — already a left area, so no subtraction needed.

`

TI-84: invNorm(0.90, 1050, 200)

Output: 1306.31

`

A score of about 1306 marks the 90th percentile. A student at the 90th percentile scored higher than 90% of test-takers.

Is a normal model even reasonable?

A normal model is an assumption, not a law of nature — and using it on the wrong data gives confidently wrong answers. Before you trust normalcdf, ask: is the distribution roughly symmetric and bell-shaped, with no strong skew or major outliers? A histogram or dotplot that's heavily right-skewed (incomes, home prices) or clearly bimodal is not well described by a normal model, and the empirical rule won't hold for it. Many real variables — heights, standardized test scores, measurement errors — are close enough to normal that the model works beautifully. Just don't apply it blindly.

(c) Worked Examples

Example 1 (Empirical rule — easy)

Problem. The heights of adult women in the U.S. are approximately N(64, 2.7) inches. Between what two heights do the middle 95% of women fall?

Strategy. The middle 95% is captured by ±2σ (the 95 part of the empirical rule).

Solution.

μ − 2σ = 64 − 2(2.7) = 64 − 5.4 = 58.6
μ + 2σ = 64 + 2(2.7) = 64 + 5.4 = 69.4

Interpretation. About 95% of adult women are between 58.6 and 69.4 inches tall.

Example 2 (Proportion between two values — medium)

Problem. Using the same model, N(64, 2.7) for women's heights, what proportion of women are between 60 and 66 inches tall?

Strategy. These aren't whole-σ marks, so the empirical rule won't pin it down. Standardize both endpoints, then use normalcdf.

Solution.

z₆₀ = (60 − 64) / 2.7 = −4 / 2.7  ≈ −1.48
z₆₆ = (66 − 64) / 2.7 =  2 / 2.7  ≈  0.74

TI-84: normalcdf(60, 66, 64, 2.7)
Output: 0.7013

Interpretation. About 70.1% of adult women are between 60 and 66 inches tall. (Sanity check: the interval spans roughly from −1.5σ to +0.7σ, so a value around 0.70 is exactly what we'd expect.)

Example 3 (Inverse normal — "find the cutoff")

Problem. A manufacturer wants to advertise a minimum battery life that 95% of its batteries beat. Lifetimes are N(500, 50) minutes. What lifetime should appear on the package?

Strategy. "95% beat it" means 95% lie above the cutoff, so only 5% lie below it. invNorm needs the LEFT area, which is 0.05.

Solution.

TI-84: invNorm(0.05, 500, 50)
Output: 417.76

Interpretation. About 418 minutes. If they advertise 418 minutes, roughly 95% of batteries will exceed that claim. (Trap avoided: feeding 0.95 would give 582 minutes — the top cutoff, the opposite of what's wanted.)

Example 4 (Comparing z-scores across different distributions — AP-style)

Problem. Maria scored 1280 on the SAT, N(1050, 200). Her friend Devon scored 28 on the ACT, N(21, 5.7). Colleges treat both as equivalent admissions data. Whose performance is stronger, and approximately what percentile did each reach?

Strategy. Standardize each score to compare on a common scale, then convert each z-score to a percentile with normalcdf (left tail).

Solution.

Maria (SAT):  z = (1280 − 1050) / 200 = 1.15
Devon (ACT):  z = (28 − 21) / 5.7    ≈ 1.23

Percentiles (proportion scoring at or below):
  normalcdf(−1E99, 1.15, 0, 1) = 0.8749   → ~87th percentile
  normalcdf(−1E99, 1.23, 0, 1) = 0.8907   → ~89th percentile

Interpretation. Devon's z-score (1.23) is higher than Maria's (1.15), so Devon's performance is slightly stronger, landing around the 89th percentile versus Maria's 87th. The raw scores (28 vs. 1280) are uncomparable, but z-scores put them on one honest scale.

(d) Common Mistakes

1. Forgetting to standardize (or comparing raw values across distributions). Students compare 1280 to 28 directly, or plug a raw value into a z-table without converting. Fix: whenever distributions differ, convert to z-scores first — z = (x − μ) / σ — so everything is in the same "standard deviations from the mean" currency.

2. Swapping invNorm and normalcdf. Use normalcdf when you have a value and want a proportion; use invNorm when you have a proportion and want a value. Fix: ask "what does the question give me, and what does it want back?" Given x → want area? normalcdf. Given area → want x? invNorm.

3. Using the wrong tail bound. Writing normalcdf(540, 500, 50) (only three arguments, no real upper bound) or using 0 instead of 1E99. Fix: a right tail needs a huge upper bound: normalcdf(540, 1E99, 500, 50). A left tail needs a huge negative lower bound: normalcdf(−1E99, x, μ, σ).

4. Feeding invNorm a right-tail area. For "the top 10%," students type invNorm(0.10, …) and get the bottom cutoff. Fix: invNorm always uses area to the LEFT. Top 10% means 0.90 to the left, so use invNorm(0.90, …).

5. Mixing up "% below" and "% above." Reporting the left-tail area when the question asked "what proportion exceed…". Fix: the two always sum to 1. If normalcdf gives you the area below and the question wants above, subtract from 1 (or reverse the bounds).

(e) Practice Problems

Assume each variable is approximately normal. MC items have exactly 4 choices.

Question 1

Adult male heights are N(69, 2.8) inches. The empirical rule says about 68% of men are between which two heights?

(A) 63.4 and 74.6
(B) 66.2 and 71.8
(C) 60.6 and 77.4
(D) 67.0 and 71.0

Question 2

For the same N(69, 2.8) model, what proportion of men are taller than 72 inches?

(A) 0.142
(B) 0.858
(C) 0.250
(D) 0.071

Question 3

IQ scores are N(100, 15). What proportion of people have an IQ above 130?

(A) 0.0228
(B) 0.9772
(C) 0.0456
(D) 0.1587

Question 4

Which TI-84 command finds the proportion of a N(500, 50) distribution that is less than 460?

(A) invNorm(460, 500, 50)
(B) normalcdf(460, 1E99, 500, 50)
(C) normalcdf(−1E99, 460, 500, 50)
(D) invNorm(0.460, 500, 50)

Question 5

A pizzeria's delivery times are N(28, 6) minutes. What proportion of deliveries take between 20 and 30 minutes?

(A) 0.539
(B) 0.461
(C) 0.369
(D) 0.631

Question 6

For the delivery model N(28, 6), the slowest 10% of deliveries take longer than what time? (Find the cutoff.)

(A) 20.3 min
(B) 35.7 min
(C) 33.8 min
(D) 28.0 min

Question 7

Reaction times in a driving simulator are N(0.21, 0.04) seconds. What proportion of drivers react faster than 0.15 seconds (i.e., less than 0.15)?

(A) 0.0668
(B) 0.9332
(C) 0.4332
(D) 0.1500

Question 8

Two students take different exams. Ana scores 82 on a test that is N(75, 7). Ben scores 88 on a test that is N(80, 10). Whose score is more impressive, and why?

(A) Ana, because her z-score (1.0) exceeds Ben's (0.8)
(B) Ben, because 88 > 82
(C) Ben, because his z-score (1.0) exceeds Ana's (0.8)
(D) They are equal, because both beat their means

Question 9

SAT scores are N(1050, 200). What score marks the 80th percentile?

(A) 1218
(B) 1218 only if you use invNorm(0.20)
(C) 882
(D) 1310

Question 10

A density curve has total area equal to ____, and the area over an interval equals the ____ of values in that interval.

(A) 100; count
(B) 1; proportion
(C) σ; mean
(D) μ; median

Question 11

Salaries at a large company are strongly right-skewed. Why is a normal model a poor choice here?

(A) The mean is too large
(B) Normal models require integer data
(C) The empirical rule and normalcdf assume a symmetric bell shape, which skewed data violate
(D) Salaries can't be standardized

(Interpretation) The 25th percentile of IQ scores, N(100, 15), is about 89.9. Write a sentence interpreting this value in context.

(Interpretation) A student computes normalcdf(72, 1E99, 69, 2.8) = 0.142 for adult male heights. Explain in one sentence what 0.142 means here.

Question 14

Cholesterol readings are N(200, 25) mg/dL. What proportion of readings fall between 150 and 250? (No calculator — use the empirical rule.)

(A) 0.68
(B) 0.95
(C) 0.997
(D) 0.34

Question 15

A teacher wants to give A's to the top 15% of scores on a test that is N(78, 9). What is the minimum score for an A?

(A) 68.7
(B) 87.3
(C) 90.0
(D) 78.0

🔑 Answer Key

1. (B) 66.2 and 71.8. The empirical rule's 68% covers ±1σ: 69 − 2.8 = 66.2 and 69 + 2.8 = 71.8.

- (A) is ±2σ (the 95% interval). (C) is ±3σ. (D) wrongly uses ±0.7 instead of ±σ.

2. (A) 0.142. z = (72 − 69)/2.8 = 1.07. Right tail: normalcdf(72, 1E99, 69, 2.8) = 0.1423.

- (B) 0.858 is the area below 72 — answers "shorter than." (C) 0.250 is a guess from the empirical rule misapplied. (D) 0.071 halves the correct answer.

3. (A) 0.0228. z = (130 − 100)/15 = 2.0. Above +2σ is half of the leftover 5%, ≈ 2.5%; exactly, normalcdf(130, 1E99, 100, 15) = 0.0228.

- (B) is the area below 130. (C) doubles the answer. (D) is the area above +1σ, not +2σ.

4. (C) normalcdf(−1E99, 460, 500, 50). "Less than 460" is a left tail: lower bound −1E99, upper bound 460. Output = 0.2119.

- (A)/(D) use invNorm, which returns a value, not a proportion. (B) has the bounds set for a right tail ("greater than 460").

5. (A) 0.539. z₂₀ = (20−28)/6 = −1.33, z₃₀ = (30−28)/6 = 0.33. normalcdf(20, 30, 28, 6) = 0.5393.

- (B) 0.461 is the complement (outside the interval). (C)/(D) come from dropping or misplacing an endpoint.

6. (B) 35.7 min. "Slowest 10% take longer than" = the 90th percentile (0.90 to the left). invNorm(0.90, 28, 6) = 35.69.

- (A) 20.3 is invNorm(0.10,…) — the fastest 10% cutoff, the wrong tail. (C) 33.8 misreads σ. (D) is the mean.

7. (A) 0.0668. z = (0.15 − 0.21)/0.04 = −1.50. Left tail: normalcdf(−1E99, 0.15, 0.21, 0.04) = 0.0668.

- (B) 0.9332 is "slower than 0.15" (the complement). (C) 0.4332 is the area between the mean and z = −1.5 only. (D) just restates the value.

8. (A) Ana, because her z-score (1.0) exceeds Ben's (0.8). Ana: z = (82 − 75)/7 = 1.0. Ben: z = (88 − 80)/10 = 0.8. Ana sits farther above her test's mean, so her score is more impressive.

- (B) compares raw scores across different scales — invalid. (C) reverses the z-scores. (D) ignores how far above the mean each scored.

9. (A) 1218. 80th percentile has 0.80 to the left: invNorm(0.80, 1050, 200) = 1218.4.

- (B) describes a wrong method (invNorm(0.20) gives the 20th percentile, 882). (C) 882 is the 20th percentile. (D) 1310 is roughly the 90th percentile.

10. (B) 1; proportion. A density curve has total area exactly 1, and area over an interval equals the proportion of data there.

- The other options confuse area with counts, parameters, or measures of center.

11. (C). normalcdf and the empirical rule are built on a symmetric bell shape; strong right skew violates that assumption, so the model's proportions would be wrong.

- (A) and (D) are false; (B) is not a requirement of normal models.

12. Sample answer: "About 25% of people have an IQ score below 89.9; equivalently, 89.9 is the score that separates the lowest-scoring 25% from the rest." (invNorm(0.25, 100, 15) = 89.88.)

13. Sample answer: "About 14.2% of adult men are taller than 72 inches" — the right-tail area is the proportion of the distribution above 72.

14. (B) 0.95. 150 = 200 − 2(25) = μ − 2σ and 250 = 200 + 2(25) = μ + 2σ, so 150 and 250 are exactly ±2σ, capturing the middle 95%.

- (A) 0.68 is the ±1σ interval (175 to 225). (C) 0.997 is ±3σ. (D) 0.34 is half of 68%.

15. (B) 87.3. Top 15% means 0.85 to the left: invNorm(0.85, 78, 9) = 87.33.

- (A) 68.7 is invNorm(0.15,…) — the bottom cutoff, wrong tail. (C) 90.0 is an arbitrary round number. (D) is the mean.

---

StatsIQ · Lesson 5 of 30 · Unit 1: Exploring One-Variable Data & Collecting Data · Phase 1: Data & Design

This lesson is for exam-preparation purposes and is not affiliated with or endorsed by the College Board. "AP" is a registered trademark of the College Board. All statistical computations in this lesson have been independently recomputed and reviewed for accuracy; normalcdf and invNorm outputs are reported to the stated decimal places.

← Lesson 4

Lesson 6 →