Two sample t test - equal variances assumed overview

This page offers structured overviews of one or more selected methods. Add additional methods for comparisons (max. of 3) by clicking on the dropdown button in the right-hand column. To practice with a specific method click the button at the bottom row of the table

Two sample $t$ test - equal variances assumed	McNemar's test	Friedman test
Independent/grouping variable	Independent variable	Independent/grouping variable
One categorical with 2 independent groups	2 paired groups	One within subject factor ($\geq 2$ related groups)
Dependent variable	Dependent variable	Dependent variable
One quantitative of interval or ratio level	One categorical with 2 independent groups	One of ordinal level
Null hypothesis	Null hypothesis	Null hypothesis
H₀: $\mu_1 = \mu_2$ Here $\mu_1$ is the population mean for group 1, and $\mu_2$ is the population mean for group 2.	Let's say that the scores on the dependent variable are scored 0 and 1. Then for each pair of scores, the data allow four options: First score of pair is 0, second score of pair is 0 First score of pair is 0, second score of pair is 1 (switched) First score of pair is 1, second score of pair is 0 (switched) First score of pair is 1, second score of pair is 1 The null hypothesis H₀ is that for each pair of scores, P(first score of pair is 0 while second score of pair is 1) = P(first score of pair is 1 while second score of pair is 0). That is, the probability that a pair of scores switches from 0 to 1 is the same as the probability that a pair of scores switches from 1 to 0. Other formulations of the null hypothesis are: H₀: $\pi_1 = \pi_2$, where $\pi_1$ is the population proportion of ones for the first paired group and $\pi_2$ is the population proportion of ones for the second paired group H₀: for each pair of scores, P(first score of pair is 1) = P(second score of pair is 1)	H₀: the population scores in any of the related groups are not systematically higher or lower than the population scores in any of the other related groups Usually the related groups are the different measurement points. Several different formulations of the null hypothesis can be found in the literature, and we do not agree with all of them. Make sure you (also) learn the one that is given in your text book or by your teacher.
Alternative hypothesis	Alternative hypothesis	Alternative hypothesis
H₁ two sided: $\mu_1 \neq \mu_2$ H₁ right sided: $\mu_1 > \mu_2$ H₁ left sided: $\mu_1 < \mu_2$	The alternative hypothesis H₁ is that for each pair of scores, P(first score of pair is 0 while second score of pair is 1) $\neq$ P(first score of pair is 1 while second score of pair is 0). That is, the probability that a pair of scores switches from 0 to 1 is not the same as the probability that a pair of scores switches from 1 to 0. Other formulations of the alternative hypothesis are: H₁: $\pi_1 \neq \pi_2$ H₁: for each pair of scores, P(first score of pair is 1) $\neq$ P(second score of pair is 1)	H₁: the population scores in some of the related groups are systematically higher or lower than the population scores in other related groups
Assumptions	Assumptions	Assumptions
Within each population, the scores on the dependent variable are normally distributed The standard deviation of the scores on the dependent variable is the same in both populations: $\sigma_1 = \sigma_2$ Group 1 sample is a simple random sample (SRS) from population 1, group 2 sample is an independent SRS from population 2. That is, within and between groups, observations are independent of one another	Sample of pairs is a simple random sample from the population of pairs. That is, pairs are independent of one another	Sample of 'blocks' (usually the subjects) is a simple random sample from the population. That is, blocks are independent of one another
Test statistic	Test statistic	Test statistic
$t = \dfrac{(\bar{y}_1 - \bar{y}_2) - 0}{s_p\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}} = \dfrac{\bar{y}_1 - \bar{y}_2}{s_p\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}}$ Here $\bar{y}_1$ is the sample mean in group 1, $\bar{y}_2$ is the sample mean in group 2, $s_p$ is the pooled standard deviation, $n_1$ is the sample size of group 1, and $n_2$ is the sample size of group 2. The 0 represents the difference in population means according to the null hypothesis. The denominator $s_p\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}$ is the standard error of the sampling distribution of $\bar{y}_1 - \bar{y}_2$. The $t$ value indicates how many standard errors $\bar{y}_1 - \bar{y}_2$ is removed from 0. Note: we could just as well compute $\bar{y}_2 - \bar{y}_1$ in the numerator, but then the left sided alternative becomes $\mu_2 < \mu_1$, and the right sided alternative becomes $\mu_2 > \mu_1$.	$X^2 = \dfrac{(b - c)^2}{b + c}$ Here $b$ is the number of pairs in the sample for which the first score is 0 while the second score is 1, and $c$ is the number of pairs in the sample for which the first score is 1 while the second score is 0.	$Q = \dfrac{12}{N \times k(k + 1)} \sum R^2_i - 3 \times N(k + 1)$ Here $N$ is the number of 'blocks' (usually the subjects - so if you have 4 repeated measurements for 60 subjects, $N$ equals 60), $k$ is the number of related groups (usually the number of repeated measurements), and $R_i$ is the sum of ranks in group $i$. Remember that multiplication precedes addition, so first compute $\frac{12}{N \times k(k + 1)} \times \sum R^2_i$ and then subtract $3 \times N(k + 1)$. Note: if ties are present in the data, the formula for $Q$ is more complicated.
Pooled standard deviation	n.a.	n.a.
$s_p = \sqrt{\dfrac{(n_1 - 1) \times s^2_1 + (n_2 - 1) \times s^2_2}{n_1 + n_2 - 2}}$	-	-
Sampling distribution of $t$ if H₀ were true	Sampling distribution of $X^2$ if H₀ were true	Sampling distribution of $Q$ if H₀ were true
$t$ distribution with $n_1 + n_2 - 2$ degrees of freedom	If $b + c$ is large enough (say, > 20), approximately the chi-squared distribution with 1 degree of freedom. If $b + c$ is small, the Binomial($n$, $P$) distribution should be used, with $n = b + c$ and $P = 0.5$. In that case the test statistic becomes equal to $b$.	If the number of blocks $N$ is large, approximately the chi-squared distribution with $k - 1$ degrees of freedom. For small samples, the exact distribution of $Q$ should be used.
Significant?	Significant?	Significant?
Two sided: Check if $t$ observed in sample is at least as extreme as critical value $t^$ or Find two sided $p$ value corresponding to observed $t$ and check if it is equal to or smaller than $\alpha$ Right sided: Check if $t$ observed in sample is equal to or larger than critical value $t^$ or Find right sided $p$ value corresponding to observed $t$ and check if it is equal to or smaller than $\alpha$ Left sided: Check if $t$ observed in sample is equal to or smaller than critical value $t^*$ or Find left sided $p$ value corresponding to observed $t$ and check if it is equal to or smaller than $\alpha$	For test statistic $X^2$: Check if $X^2$ observed in sample is equal to or larger than critical value $X^{2*}$ or Find $p$ value corresponding to observed $X^2$ and check if it is equal to or smaller than $\alpha$ If $b + c$ is small, the table for the binomial distribution should be used, with as test statistic $b$: Check if $b$ observed in sample is in the rejection region or Find two sided $p$ value corresponding to observed $b$ and check if it is equal to or smaller than $\alpha$	If the number of blocks $N$ is large, the table with critical $X^2$ values can be used. If we denote $X^2 = Q$: Check if $X^2$ observed in sample is equal to or larger than critical value $X^{2*}$ or Find $p$ value corresponding to observed $X^2$ and check if it is equal to or smaller than $\alpha$
$C\%$ confidence interval for $\mu_1 - \mu_2$	n.a.	n.a.
$(\bar{y}_1 - \bar{y}_2) \pm t^* \times s_p\sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}$ where the critical value $t^$ is the value under the $t_{n_1 + n_2 - 2}$ distribution with the area $C / 100$ between $-t^$ and $t^$ (e.g. $t^$ = 2.086 for a 95% confidence interval when df = 20). The confidence interval for $\mu_1 - \mu_2$ can also be used as significance test.	-	-
Effect size	n.a.	n.a.
Cohen's $d$: Standardized difference between the mean in group $1$ and in group $2$: $$d = \frac{\bar{y}_1 - \bar{y}_2}{s_p}$$ Cohen's $d$ indicates how many standard deviations $s_p$ the two sample means are removed from each other.	-	-
Visual representation	n.a.	n.a.
	-	-
Equivalent to	Equivalent to	n.a.
One way ANOVA with an independent variable with 2 levels ($I$ = 2): two sided two sample $t$ test is equivalent to ANOVA $F$ test when $I$ = 2 two sample $t$ test is equivalent to $t$ test for contrast when $I$ = 2 two sample $t$ test is equivalent to $t$ test multiple comparisons when $I$ = 2 OLS regression with one categorical independent variable with 2 levels: two sided two sample $t$ test is equivalent to $F$ test regression model two sample $t$ test is equivalent to $t$ test for regression coefficient $\beta_1$	Stuart-Maxwell test, with a categorical dependent variable consisting of two independent groups Cochran's Q test, with two related groups Two sided sign test: $b = W$, and $X^2 = z^2$	-
Example context	Example context	Example context
Is the average mental health score different between men and women? Assume that in the population, the standard deviation of mental health scores is equal amongst men and women.	Does a tv documentary about spiders change whether people are afraid (yes/no) of spiders?	Is there a difference in depression level between measurement point 1 (pre-intervention), measurement point 2 (1 week post-intervention), and measurement point 3 (6 weeks post-intervention)?
SPSS	SPSS	SPSS
Analyze > Compare Means > Independent-Samples T Test... Put your dependent (quantitative) variable in the box below Test Variable(s) and your independent (grouping) variable in the box below Grouping Variable Click on the Define Groups... button. If you can't click on it, first click on the grouping variable so its background turns yellow Fill in the value you have used to indicate your first group in the box next to Group 1, and the value you have used to indicate your second group in the box next to Group 2 Continue and click OK	Analyze > Nonparametric Tests > Legacy Dialogs > 2 Related Samples... Put the two paired variables in the boxes below Variable 1 and Variable 2 Under Test Type, select the McNemar test	Analyze > Nonparametric Tests > Legacy Dialogs > K Related Samples... Put the $k$ variables containing the scores for the $k$ related groups in the white box below Test Variables Under Test Type, select the Friedman test
Jamovi	Jamovi	Jamovi
T-Tests > Independent Samples T-Test Put your dependent (quantitative) variable in the box below Dependent Variables and your independent (grouping) variable in the box below Grouping Variable Under Tests, select Student's (selected by default) Under Hypothesis, select your alternative hypothesis	Frequencies > Paired Samples - McNemar test Put one of the two paired variables in the box below Rows and the other paired variable in the box below Columns	ANOVA > Repeated Measures ANOVA - Friedman Put the $k$ variables containing the scores for the $k$ related groups in the box below Measures
Practice questions	Practice questions	Practice questions

Two sample t test - equal variances assumed - overview