This page offers structured overviews of one or more selected methods. Add additional methods for comparisons (max. of 3) by clicking on the dropdown button in the right-hand column. To practice with a specific method click the button at the bottom row of the table
One or more quantitative of interval or ratio level and/or one or more categorical with independent groups, transformed into code variables
One or more quantitative of interval or ratio level and/or one or more categorical with independent groups, transformed into code variables
None
One within subject factor ($\geq 2$ related groups)
Dependent variable
Dependent variable
Dependent variable
Dependent variable
One categorical with 2 independent groups
One categorical with 2 independent groups
One categorical with $J$ independent groups ($J \geqslant 2$)
One of ordinal level
Null hypothesis
Null hypothesis
Null hypothesis
Null hypothesis
Model chi-squared test for the complete regression model:
H0: $\beta_1 = \beta_2 = \ldots = \beta_K = 0$
Wald test for individual regression coefficient $\beta_k$:
H0: $\beta_k = 0$
or in terms of odds ratio:
H0: $e^{\beta_k} = 1$
Likelihood ratio chi-squared test for individual regression coefficient $\beta_k$:
H0: $\beta_k = 0$
or in terms of odds ratio:
H0: $e^{\beta_k} = 1$
in the regression equation
$
\ln \big(\frac{\pi_{y = 1}}{1 - \pi_{y = 1}} \big) = \beta_0 + \beta_1 \times x_1 + \beta_2 \times x_2 + \ldots + \beta_K \times x_K
$. Here $ x_i$ represents independent variable $ i$, $\beta_i$ is the regression weight for independent variable $ x_i$, and $\pi_{y = 1}$ represents the true probability that the dependent variable $ y = 1$ (or equivalently, the proportion of $ y = 1$ in the population) given the scores on the independent variables.
Model chi-squared test for the complete regression model:
H0: $\beta_1 = \beta_2 = \ldots = \beta_K = 0$
Wald test for individual regression coefficient $\beta_k$:
H0: $\beta_k = 0$
or in terms of odds ratio:
H0: $e^{\beta_k} = 1$
Likelihood ratio chi-squared test for individual regression coefficient $\beta_k$:
H0: $\beta_k = 0$
or in terms of odds ratio:
H0: $e^{\beta_k} = 1$
in the regression equation
$
\ln \big(\frac{\pi_{y = 1}}{1 - \pi_{y = 1}} \big) = \beta_0 + \beta_1 \times x_1 + \beta_2 \times x_2 + \ldots + \beta_K \times x_K
$. Here $ x_i$ represents independent variable $ i$, $\beta_i$ is the regression weight for independent variable $ x_i$, and $\pi_{y = 1}$ represents the true probability that the dependent variable $ y = 1$ (or equivalently, the proportion of $ y = 1$ in the population) given the scores on the independent variables.
H0: the population proportions in each of the $J$ conditions are $\pi_1$, $\pi_2$, $\ldots$, $\pi_J$
or equivalently
H0: the probability of drawing an observation from condition 1 is $\pi_1$, the probability of drawing an observation from condition 2 is $\pi_2$, $\ldots$,
the probability of drawing an observation from condition $J$ is $\pi_J$
H0: the population scores in any of the related groups are not systematically higher or lower than the population scores in any of the other related groups
Usually the related groups are the different measurement points. Several different formulations of the null hypothesis can be found in the literature, and we do not agree with all of them. Make sure you (also) learn the one that is given in your text book or by your teacher.
Alternative hypothesis
Alternative hypothesis
Alternative hypothesis
Alternative hypothesis
Model chi-squared test for the complete regression model:
H1: not all population regression coefficients are 0
Wald test for individual regression coefficient $\beta_k$:
H1: $\beta_k \neq 0$
or in terms of odds ratio:
H1: $e^{\beta_k} \neq 1$
If defined as Wald $ = \dfrac{b_k}{SE_{b_k}}$ (see 'Test statistic'), also one sided alternatives can be tested:
H1 right sided: $\beta_k > 0$
H1 left sided: $\beta_k < 0$
Likelihood ratio chi-squared test for individual regression coefficient $\beta_k$:
H1: $\beta_k \neq 0$
or in terms of odds ratio:
H1: $e^{\beta_k} \neq 1$
Model chi-squared test for the complete regression model:
H1: not all population regression coefficients are 0
Wald test for individual regression coefficient $\beta_k$:
H1: $\beta_k \neq 0$
or in terms of odds ratio:
H1: $e^{\beta_k} \neq 1$
If defined as Wald $ = \dfrac{b_k}{SE_{b_k}}$ (see 'Test statistic'), also one sided alternatives can be tested:
H1 right sided: $\beta_k > 0$
H1 left sided: $\beta_k < 0$
Likelihood ratio chi-squared test for individual regression coefficient $\beta_k$:
H1: $\beta_k \neq 0$
or in terms of odds ratio:
H1: $e^{\beta_k} \neq 1$
H1: the population proportions are not all as specified under the null hypothesis
or equivalently
H1: the probabilities of drawing an observation from each of the conditions are not all as specified under the null hypothesis
H1: the population scores in some of the related groups are systematically higher or lower than the population scores in other related groups
Assumptions
Assumptions
Assumptions
Assumptions
In the population, the relationship between the independent variables and the log odds $\ln (\frac{\pi_{y=1}}{1 - \pi_{y=1}})$ is linear
The residuals are independent of one another
Often ignored additional assumption:
Variables are measured without error
Also pay attention to:
Multicollinearity
Outliers
In the population, the relationship between the independent variables and the log odds $\ln (\frac{\pi_{y=1}}{1 - \pi_{y=1}})$ is linear
The residuals are independent of one another
Often ignored additional assumption:
Variables are measured without error
Also pay attention to:
Multicollinearity
Outliers
Sample size is large enough for $X^2$ to be approximately chi-squared distributed. Rule of thumb: all $J$ expected cell counts are 5 or more
Sample is a simple random sample from the population. That is, observations are independent of one another
Sample of 'blocks' (usually the subjects) is a simple random sample from the population. That is, blocks are independent of one another
Test statistic
Test statistic
Test statistic
Test statistic
Model chi-squared test for the complete regression model:
$X^2 = D_{null} - D_K = \mbox{null deviance} - \mbox{model deviance} $
$D_{null}$, the null deviance, is conceptually similar to the total variance of the dependent variable in OLS regression analysis. $D_K$, the model deviance, is conceptually similar to the residual variance in OLS regression analysis.
Wald test for individual $\beta_k$:
The wald statistic can be defined in two ways:
Wald $ = \dfrac{b_k^2}{SE^2_{b_k}}$
Wald $ = \dfrac{b_k}{SE_{b_k}}$
SPSS uses the first definition.
Likelihood ratio chi-squared test for individual $\beta_k$:
$X^2 = D_{K-1} - D_K$
$D_{K-1}$ is the model deviance, where independent variable $k$ is excluded from the model. $D_{K}$ is the model deviance, where independent variable $k$ is included in the model.
Model chi-squared test for the complete regression model:
$X^2 = D_{null} - D_K = \mbox{null deviance} - \mbox{model deviance} $
$D_{null}$, the null deviance, is conceptually similar to the total variance of the dependent variable in OLS regression analysis. $D_K$, the model deviance, is conceptually similar to the residual variance in OLS regression analysis.
Wald test for individual $\beta_k$:
The wald statistic can be defined in two ways:
Wald $ = \dfrac{b_k^2}{SE^2_{b_k}}$
Wald $ = \dfrac{b_k}{SE_{b_k}}$
SPSS uses the first definition.
Likelihood ratio chi-squared test for individual $\beta_k$:
$X^2 = D_{K-1} - D_K$
$D_{K-1}$ is the model deviance, where independent variable $k$ is excluded from the model. $D_{K}$ is the model deviance, where independent variable $k$ is included in the model.
$X^2 = \sum{\frac{(\mbox{observed cell count} - \mbox{expected cell count})^2}{\mbox{expected cell count}}}$
Here the expected cell count for one cell = $N \times \pi_j$, the observed cell count is the observed sample count in that same cell, and the sum is over all $J$ cells.
Here $N$ is the number of 'blocks' (usually the subjects - so if you have 4 repeated measurements for 60 subjects, $N$ equals 60), $k$ is the number of related groups (usually the number of repeated measurements), and $R_i$ is the sum of ranks in group $i$.
Remember that multiplication precedes addition, so first compute $\frac{12}{N \times k(k + 1)} \times \sum R^2_i$ and then subtract $3 \times N(k + 1)$.
Note: if ties are present in the data, the formula for $Q$ is more complicated.
Sampling distribution of $X^2$ and of the Wald statistic if H0 were true
Sampling distribution of $X^2$ and of the Wald statistic if H0 were true
Find $p$ value corresponding to observed $X^2$ and check if it is equal to or smaller than $\alpha$
Wald-type approximate $C\%$ confidence interval for $\beta_k$
Wald-type approximate $C\%$ confidence interval for $\beta_k$
n.a.
n.a.
$b_k \pm z^* \times SE_{b_k}$
where the critical value $z^*$ is the value under the normal curve with the area $C / 100$ between $-z^*$ and $z^*$ (e.g. $z^*$ = 1.96 for a 95% confidence interval).
$b_k \pm z^* \times SE_{b_k}$
where the critical value $z^*$ is the value under the normal curve with the area $C / 100$ between $-z^*$ and $z^*$ (e.g. $z^*$ = 1.96 for a 95% confidence interval).
-
-
Goodness of fit measure $R^2_L$
Goodness of fit measure $R^2_L$
n.a.
n.a.
$R^2_L = \dfrac{D_{null} - D_K}{D_{null}}$
There are several other goodness of fit measures in logistic regression. In logistic regression, there is no single agreed upon measure of goodness of fit.
$R^2_L = \dfrac{D_{null} - D_K}{D_{null}}$
There are several other goodness of fit measures in logistic regression. In logistic regression, there is no single agreed upon measure of goodness of fit.
-
-
Example context
Example context
Example context
Example context
Can body mass index, stress level, and gender predict whether people get diagnosed with diabetes?
Can body mass index, stress level, and gender predict whether people get diagnosed with diabetes?
Is the proportion of people with a low, moderate, and high social economic status in the population different from $\pi_{low} = 0.2,$ $\pi_{moderate} = 0.6,$ and $\pi_{high} = 0.2$?
Is there a difference in depression level between measurement point 1 (pre-intervention), measurement point 2 (1 week post-intervention), and measurement point 3 (6 weeks post-intervention)?
SPSS
SPSS
SPSS
SPSS
Analyze > Regression > Binary Logistic...
Put your dependent variable in the box below Dependent and your independent (predictor) variables in the box below Covariate(s)
Analyze > Regression > Binary Logistic...
Put your dependent variable in the box below Dependent and your independent (predictor) variables in the box below Covariate(s)
Put your categorical variable in the box below Test Variable List
Fill in the population proportions / probabilities according to $H_0$ in the box below Expected Values. If $H_0$ states that they are all equal, just pick 'All categories equal' (default)
Analyze > Nonparametric Tests > Legacy Dialogs > K Related Samples...
Put the $k$ variables containing the scores for the $k$ related groups in the white box below Test Variables
Under Test Type, select the Friedman test
Jamovi
Jamovi
Jamovi
Jamovi
Regression > 2 Outcomes - Binomial
Put your dependent variable in the box below Dependent Variable and your independent variables of interval/ratio level in the box below Covariates
If you also have code (dummy) variables as independent variables, you can put these in the box below Covariates as well
Instead of transforming your categorical independent variable(s) into code variables, you can also put the untransformed categorical independent variables in the box below Factors. Jamovi will then make the code variables for you 'behind the scenes'
Regression > 2 Outcomes - Binomial
Put your dependent variable in the box below Dependent Variable and your independent variables of interval/ratio level in the box below Covariates
If you also have code (dummy) variables as independent variables, you can put these in the box below Covariates as well
Instead of transforming your categorical independent variable(s) into code variables, you can also put the untransformed categorical independent variables in the box below Factors. Jamovi will then make the code variables for you 'behind the scenes'
Frequencies > N Outcomes - $\chi^2$ Goodness of fit
Put your categorical variable in the box below Variable
Click on Expected Proportions and fill in the population proportions / probabilities according to $H_0$ in the boxes below Ratio. If $H_0$ states that they are all equal, you can leave the ratios equal to the default values (1)
ANOVA > Repeated Measures ANOVA - Friedman
Put the $k$ variables containing the scores for the $k$ related groups in the box below Measures