Review Solutions — Bivariate Data and Scatter Plots
-
Correlation basics. Fluency
- (a) r=0.91:
- (b) r=−0.35:
- (c) Stronger:
- (d) r=0 means unrelated?:
-
Lines of best fit. Fluency
- (a) ŷ=15+4(6):
- (b) Through (2,20) and (8,44):
- (c) Slope −3 meaning:
- (d) Actual 52, predicted 47:
-
r² and interpretation. Fluency
- (a) r=0.9, r²:
- (b) r²=0.36, find |r|:
- (c) r=−0.7, unexplained %:
- (d) r²=0.02, good predictor?:
-
Interpolation, extrapolation, causation. Fluency
- (a) x=18, range 5–25:
- (b) x=40:
- (c) Shoe size vs maths ability, ages 6–14:
- (d) High r with plausible causation:
-
Advertising vs sales scatter plot. Understanding
- (a) Correlation:
- (b) Equation:
- (c) r=0.98, r² in context:
- (d) Causation comment:
-
Two-way table (phone use vs sleep). Understanding
- (a) High users, good sleep %:
- (b) Low users, good sleep %:
- (c) Association?:
- (d) Causation? Better study design?:
-
Pages vs reading time (ŷ=3.5+0.12x). Understanding
- (a) 300-page book:
- (b) Slope meaning:
- (c) 700-page book:
- (d) Residual for 250-page reader (40 hrs):
-
Outlier investigation. Understanding
- (a) Outlier weakens r from 0.94 to 0.82:
- (b) Small sample impact:
- (c) Data entry error:
- (d) When to keep outlier:
-
Rainfall vs wheat yield (r=0.87). Problem Solving
- (a) r² and variation explained:
- (b) 450 mm prediction:
- (c) 800 mm — two reliability concerns:
- (d) Other variables affecting wheat yield:
-
Class size vs test score (r=−0.68). Problem Solving
- (a) Variables:
- (b) Slope meaning:
- (c) 30→20 students, predicted change:
- (d) Two confounders:
-
Mean point and line equation (perfect linear data). Problem Solving
- (a) ¯x and ¯y:
- (b) Slope estimate:
- (c) y-intercept:
- (d) Verify all 5 points; value of r:
-
Full bivariate report (exercise vs heart rate). Problem Solving
- (a) Full description:
- (b) Predict at 5 hrs:
- (c) Predict at 15 hrs:
- (d) Caution statement: