2.18 Tests for normality + the central limit theorem

Although a significant result was obtained for the Shapiro-Wilk test, we choose to perform a parametric t-test to compare the BMI between the groups of Walkability (see slides session 3 part III “Central limit theorem”).

t.test( BMI ~ Walkability, data=BEPAS)

Central limit theorem

If sample sizes per subgroup are large, the central limit theorem can be applied:
- The underlying distribution of a sample mean of an outcome variable, based on a sufficiently large sample, approximates a normal distribution, regardless of the underlying distribution of the outcome variable.
“Large” sample sizes:
- 30 or 40 per group when data are approximately normally distributed
- Hundreds-> ignore the distribution
Based on simulation studies: apply tests assuming normality (=parametric tests) only when the data are approximately normally distributed.
Reference: “Normality Tests for Statistical Analysis: A Guide for Non-Statisticians” Asghar Ghasemi and Saleh Zahediasl, 2012 Reference¹.