ConceptComplete

Common Hypothesis Tests

We survey the most important parametric hypothesis tests, each designed for specific situations regarding the population distribution, sample size, and known/unknown parameters.


Tests for Means

Definition

The Student's tt-test is used for testing the mean when the variance is unknown:

  • One-sample: H0:μ=μ0H_0: \mu = \mu_0. Test statistic: T=Xˉμ0S/ntn1T = \frac{\bar{X} - \mu_0}{S/\sqrt{n}} \sim t_{n-1} under H0H_0 (assuming normality).
  • Two-sample (equal variances): H0:μ1=μ2H_0: \mu_1 = \mu_2. Test statistic: T=Xˉ1Xˉ2Sp1/n1+1/n2T = \frac{\bar{X}_1 - \bar{X}_2}{S_p\sqrt{1/n_1 + 1/n_2}} where Sp2=(n11)S12+(n21)S22n1+n22S_p^2 = \frac{(n_1-1)S_1^2 + (n_2-1)S_2^2}{n_1 + n_2 - 2} is the pooled variance. Under H0H_0: Ttn1+n22T \sim t_{n_1+n_2-2}.
  • Paired: H0:μD=0H_0: \mu_D = 0 where Di=XiYiD_i = X_i - Y_i. Apply the one-sample tt-test to the differences.
ExampleOne-sample $t$-test

A manufacturer claims light bulbs last μ0=1000\mu_0 = 1000 hours. A sample of n=16n = 16 gives xˉ=980\bar{x} = 980, s=40s = 40. Test at α=0.05\alpha = 0.05: T=980100040/16=2010=2.0T = \frac{980 - 1000}{40/\sqrt{16}} = \frac{-20}{10} = -2.0 With t15,0.025=2.131t_{15, 0.025} = 2.131, since 2.0<2.131|-2.0| < 2.131, we fail to reject H0H_0.


Tests for Variances and Proportions

Definition

Key tests for other parameters:

  • Chi-squared test for variance: H0:σ2=σ02H_0: \sigma^2 = \sigma_0^2. Statistic: χ2=(n1)S2σ02χn12\chi^2 = \frac{(n-1)S^2}{\sigma_0^2} \sim \chi^2_{n-1}
  • F-test for equality of variances: H0:σ12=σ22H_0: \sigma_1^2 = \sigma_2^2. Statistic: F=S12/S22Fn11,n21F = S_1^2/S_2^2 \sim F_{n_1-1, n_2-1}
  • Z-test for proportion: H0:p=p0H_0: p = p_0. Statistic: Z=p^p0p0(1p0)/nN(0,1)Z = \frac{\hat{p} - p_0}{\sqrt{p_0(1-p_0)/n}} \sim N(0,1) approximately

Goodness of Fit

RemarkChi-squared goodness-of-fit test

Pearson's chi-squared test assesses whether observed frequencies match expected frequencies under a hypothesized distribution. The test statistic χ2=i=1k(OiEi)2Ei\chi^2 = \sum_{i=1}^k \frac{(O_i - E_i)^2}{E_i} has an approximate χk1p2\chi^2_{k-1-p} distribution, where kk is the number of categories and pp is the number of estimated parameters. Large values of χ2\chi^2 indicate poor fit.