Topic Review — Interval Estimates for Proportions — Solutions
← Interval Estimates for Proportions
This review covers all lessons in this topic: confidence intervals for proportions, margin of error and sample size, and applications. Click each answer to reveal the worked solution.
Review Questions
- Define the margin of error in the context of a confidence interval for a proportion.
The margin of error (ME) is the maximum expected difference between the sample proportion p̂ and the true population proportion p, at a given confidence level. It equals the half-width of the confidence interval:
ME = z* × √(p̂(1−p̂)/n)
The CI can be written as p̂ − ME < p < p̂ + ME. - A sample of 300 people finds 192 prefer online shopping. Construct a 95% confidence interval for the true proportion.
p̂ = 192/300 = 0.64
Conditions: np̂ = 192 ≥ 5 ✓; n(1−p̂) = 108 ≥ 5 ✓
SE = √(0.64 × 0.36 / 300) = √(0.000768) ≈ 0.02771
ME = 1.96 × 0.02771 ≈ 0.05431
95% CI: (0.586, 0.694)
We are 95% confident the true proportion who prefer online shopping is between 58.6% and 69.4%. - Explain the correct interpretation of a 95% confidence interval. What does 95% refer to?
A 95% CI does not mean there is a 95% probability that the true p lies in this specific interval. Once computed, the interval either contains p or it doesn’t.
The 95% refers to the procedure: if we took many repeated random samples and constructed a CI from each, approximately 95% of those intervals would contain the true p. - State the three critical values z* and their associated confidence levels.
90% confidence: z* = 1.645
95% confidence: z* = 1.96
99% confidence: z* = 2.576
These values come from the standard normal distribution: z* is the value such that P(−z* < Z < z*) equals the confidence level. - A 95% CI for p is (0.33, 0.49). Find the sample proportion p̂, the margin of error, and back-calculate the approximate sample size.
p̂ = (0.33 + 0.49)/2 = 0.41
ME = (0.49 − 0.33)/2 = 0.08
Back-calculating n:
0.08 = 1.96 × √(0.41 × 0.59/n)
0.04082 = √(0.2419/n)
0.001666 = 0.2419/n
n ≈ 145.2 → n ≈ 146 - Check whether the normality conditions are met for n = 80 and p̂ = 0.06.
Conditions: np̂ ≥ 5 and n(1−p̂) ≥ 5
np̂ = 80 × 0.06 = 4.8 < 5 ✗
Conditions are NOT met. The expected number of successes (4.8) is below 5, so the normal approximation to the distribution of p̂ is unreliable. A CI based on the normal formula should not be used; a larger sample or alternative method is needed. - A researcher wants ME ≤ 0.04 with 95% confidence. No prior estimate of p is available. Find the minimum sample size.
Use conservative p̂ = 0.5:
n ≥ (1.96)² × 0.5 × 0.5 / (0.04)²
n ≥ 3.8416 × 0.25 / 0.0016
n ≥ 0.9604 / 0.0016 = 600.25
Minimum sample size: n = 601 - A manufacturer says its product has a 15% return rate. A random sample of 200 purchases finds 24 returns. Construct a 95% CI. Is the manufacturer’s claim consistent with the data?
p̂ = 24/200 = 0.12
SE = √(0.12 × 0.88 / 200) = √(0.000528) ≈ 0.02298
ME = 1.96 × 0.02298 ≈ 0.04503
95% CI: (0.075, 0.165)
The claimed value of 15% (0.15) lies inside the CI (0.075, 0.165). The data are consistent with the manufacturer’s claim. - Explain why using p̂ = 0.5 in the sample size formula gives the most conservative (largest) estimate of n.
The required sample size formula is n ≥ (z*)² × p̂(1−p̂) / ME². The factor p̂(1−p̂) is maximised when p̂ = 0.5, where p̂(1−p̂) = 0.25. Any other value of p̂ gives a smaller product (e.g., p̂ = 0.3 gives 0.21, p̂ = 0.1 gives 0.09). Using p̂ = 0.5 therefore produces the largest n, guaranteeing that the desired ME is achieved regardless of the actual value of p.
- A pilot study estimates p̂ ≈ 0.72. A researcher wants a 99% CI with ME ≤ 0.03. Find the required sample size.
z* = 2.576, ME = 0.03, p̂ = 0.72
n ≥ (2.576)² × 0.72 × 0.28 / (0.03)²
n ≥ 6.6355 × 0.2016 / 0.0009
n ≥ 1.3377 / 0.0009
n ≥ 1486.3
Minimum sample size: n = 1487 - Two schools are surveyed about homework preferences. School 1: n = 150, 87 prefer less homework. School 2: n = 150, 72 prefer less homework. Construct 95% CIs for each and determine whether there is evidence of a genuine difference.
School 1: p̂1 = 87/150 = 0.58
SE1 = √(0.58 × 0.42/150) = √(0.001624) ≈ 0.04030
95% CI: (0.58 − 0.079, 0.58 + 0.079) = (0.501, 0.659)
School 2: p̂2 = 72/150 = 0.48
SE2 = √(0.48 × 0.52/150) = √(0.001664) ≈ 0.04080
95% CI: (0.48 − 0.080, 0.48 + 0.080) = (0.400, 0.560)
The intervals overlap (School 1 lower: 0.501; School 2 upper: 0.560). There is insufficient evidence to conclude a genuine difference at the 95% level. - A current 95% CI uses n = 500. How large a sample is needed to reduce the margin of error by one third (to 2/3 of its current value)?
ME ∝ 1/√n
To get MEnew = (2/3) MEold:
1/√nnew = (2/3) × 1/√nold
√nnew = (3/2) × √nold
nnew = (3/2)² × nold = 2.25 × 500 = 1125
A sample of 1125 is needed to reduce the margin of error to two-thirds of the current value. - A survey of 600 Australians finds 420 support a new environmental policy.
- (a) Construct a 90% CI.
- (b) Construct a 99% CI.
- (c) Compare the widths. What trade-off does this illustrate?
p̂ = 420/600 = 0.70
SE = √(0.70 × 0.30 / 600) = √(0.00035) ≈ 0.01871
(a) 90% CI: ME = 1.645 × 0.01871 ≈ 0.03078
CI: (0.669, 0.731)
(b) 99% CI: ME = 2.576 × 0.01871 ≈ 0.04819
CI: (0.652, 0.748)
(c) Width of 90% CI ≈ 0.062; Width of 99% CI ≈ 0.096
The 99% CI is about 55% wider. This illustrates the trade-off between confidence and precision: higher confidence requires a wider interval for the same sample data. - An online news poll of 2000 respondents reports 58% support for a proposition with ME = ±2.2%. A statistician says the poll is unreliable. Give one reason why.
An online news poll suffers from voluntary response bias (self-selection bias). Only people who choose to respond participate — typically those with strong opinions. This makes the sample unrepresentative of the general population. The margin of error calculation assumes a random sample; applying it to a non-random sample produces a misleading measure of precision. The stated ±2.2% refers only to sampling variability, not to bias from non-random sampling.
- A 95% CI based on p̂ = 0.48 and n = 400 is constructed. Without recalculating, what would happen to this interval if the confidence level were changed to 90%? To 99%? Describe the effect on both the margin of error and the interval width.
The SE = √(0.48 × 0.52/400) ≈ 0.02497 is fixed (depends only on p̂ and n).
90% CI: z* decreases from 1.96 to 1.645. ME decreases from 1.96 × 0.02497 ≈ 0.04894 to 1.645 × 0.02497 ≈ 0.04108. The interval narrows (less certain, more precise).
99% CI: z* increases from 1.96 to 2.576. ME increases to 2.576 × 0.02497 ≈ 0.06432. The interval widens (more certain, less precise).
Summary: lower confidence → narrower interval; higher confidence → wider interval.