Non-Parametric Tests (The ones that we are going to discuss in here...)

1. One-sample tests & goodness-of-fit¶

Sign test¶

Tests whether the median equals a given value
Very weak assumptions
Mainly pedagogical (low power)

Wilcoxon signed-rank test¶

Non-parametric alternative to the one-sample t-test
Uses sign + magnitude
Assumes symmetry

Kolmogorov–Smirnov test (1-sample)¶

Tests full distribution against a known CDF
Sensitive to global differences
Parameters must be known (important caveat)

Anderson–Darling test¶

Goodness-of-fit test with strong tail sensitivity
Strictly better than KS for normality testing

2. Two-sample location tests (independent samples)¶

Mann–Whitney U test (Wilcoxon rank-sum)¶

Alternative to two-sample t-test
Tests stochastic dominance, not equality of means
Assumes identical shapes

Two-sample Kolmogorov–Smirnov test¶

Detects any distributional difference
Low power for pure location shifts

Brunner–Munzel test¶

Robust alternative to Mann–Whitney
Allows heteroscedasticity

3. Paired / repeated-measures tests¶

Wilcoxon signed-rank test (paired)¶

Non-parametric analogue of the paired t-test
Assumes symmetry

Sign test (paired)¶

Extremely robust
Very low power

4. More than two groups (one-way designs)¶

Kruskal–Wallis test¶

Non-parametric one-way ANOVA
Rank-based
Tests equality of distributions (not means)

Post-hoc procedures¶

Dunn test
Pairwise Wilcoxon tests with multiple-testing correction

5. Blocked & repeated-measures designs¶

Friedman test¶

Non-parametric one-way repeated-measures ANOVA
Blocks typically correspond to subjects

Quade test¶

Weighted version of Friedman
More powerful when blocks differ in importance

6. Factorial designs (two or more factors)¶

Aligned Rank Transform (ART) ANOVA¶

Non-parametric alternative to full factorial ANOVA
Correctly tests main effects and interactions
Essential for advanced courses

Permutation-based factorial ANOVA¶

Model-free
Handles interactions naturally
Strong conceptual link to ML validation

7. Scale / variance tests¶

Ansari–Bradley test¶

Rank-based test for equality of scale
Assumes symmetry

Fligner–Killeen test¶

Fully non-parametric
Robust to non-normality
Preferred in practice

Levene / Brown–Forsythe tests¶

Semi-parametric but widely used
Robust and practical

8. Association & dependence¶

Spearman’s rho¶

Rank correlation
Detects monotone relationships

Kendall’s tau¶

Concordance-based measure
Better for small samples and ties

Hoeffding’s D (optional / advanced)¶

Detects general dependence
Computationally heavier

9. Categorical data (non-parametric by nature)¶

Chi-squared tests¶

Goodness-of-fit
Independence
Homogeneity

Fisher’s exact test¶

Exact inference
Suitable for small samples

10. Resampling-based inference¶

Permutation tests¶

Location, scale, association
Distribution-free
Unifying framework for many classical tests

Bootstrap confidence intervals¶

Percentile interval
Bootstrap-t interval
BCa interval (recommended)

Minimal required¶

Wilcoxon signed-rank
Mann–Whitney U
Kruskal–Wallis
Friedman
Spearman / Kendall
Chi-squared + Fisher
Aligned Rank Transform ANOVA
Permutation tests
Bootstrap confidence intervals

We will cover by now:

Sign test
Wilcoxon signed-rank test
Mann–Whitney U test (Wilcoxon rank-sum)
Wilcoxon signed-rank test (paired)
Ansari–Bradley test
Levene / Brown–Forsythe tests

Sign Test

Purpose¶

Tests whether the median of a population equals a given value, or whether the median of paired differences equals zero.

Hypotheses¶

H₀: median = m₀
H₁: median ≠ m₀ (or one-sided)

Exact statistic¶

Let

$$ d_i = x_i - m₀ $$
$$ S = \sum_{i=1}^n \mathbf{1}\{ d_i > 0 \} $$

Zero differences are discarded.

Null distribution¶

Under H₀: $$ S \sim \text{Binomial}(n, 1/2) $$

Exact p-values are computed from the binomial distribution.

Assumptions¶

Independent observations
Continuous distribution
No symmetry required

Why the test works (theory)¶

If m₀ is the true median, then $$ \mathbb{P}(X_i > m₀) = \mathbb{P}(X_i < m₀) = \tfrac{1}{2}. $$

By independence, the indicator variables are i.i.d. Bernoulli(1/2), yielding an exact distribution-free test.

Interpretation¶

Uses only signs → extremely robust
Low power due to loss of magnitude information

Example¶

A study is done to determine the effects of removing a renal blockage in patients whose renal function is impaired because of advanced metatstatic malignancy of nonurologic cause. The arterial blood pressure of a random sample of 10 patients is measured before and after surgery for treatment of the blockage yielded the following data:

before = [150, 132, 130, 116, 107, 100, 101, 96, 90, 78]
after = [90, 102, 80, 82, 90, 94, 84, 93, 90, 80]

Wilcoxon Signed-Rank Test

The Wilcoxon signed-rank test is a non-parametric test for detecting a location shift in one-sample or paired-sample settings.
It is a robust alternative to the one-sample or paired t-test under non-normality.

1. Problem setup¶

One-sample case¶

Let $$ X_1, \dots, X_n \quad \text{i.i.d.} $$ and let $m_0$ be a hypothesized median.

Define differences: $$ d_i = X_i - m_0. $$

Paired two-sample case¶

Given paired observations $(X_i, Y_i)$, define $$ d_i = X_i - Y_i. $$

In both cases, inference is performed on the distribution of the differences $d_i$.

2. Hypotheses¶

Null hypothesis $$ H_0: \text{the distribution of } d_i \text{ is symmetric about } 0 $$
Alternative hypothesis $$ H_1: \text{the distribution of } d_i \text{ is not symmetric about } 0 $$ (or one-sided variants)

⚠️ Note: this is not merely a median test; symmetry is essential.

3. Assumptions¶

Independence of observations (or pairs)
Continuity (no ties in $|d_i|$)
Symmetry of the distribution of $d_i$ under $H_0$

4. Test statistic¶

Remove zero differences $d_i = 0$
Compute absolute differences $|d_i|$
Rank $|d_i|$ to obtain ranks $$ R_i \in \{1,2,\dots,n\} $$
Define signs $$ S_i = \operatorname{sign}(d_i) \in \{-1,+1\} $$

Signed-rank statistic (theoretical form)¶

$$ T_n = \sum_{i=1}^n R_i S_i $$

Positive rank-sum statistic (computational form)¶

$$ W^+ = \sum_{i=1}^n R_i \mathbf{1}\{d_i > 0\} $$

5. Relationship between the statistics¶

The two statistics are affinely equivalent: $$ W^+ = \frac{n(n+1)}{4} + \frac{1}{2} T_n, \quad T_n = 2W^+ - \frac{n(n+1)}{2}. $$

They induce identical tests, p-values, and decisions.

6. Exact null distribution (finite sample)¶

Under $H_0$:

The ranks $R_1,\dots,R_n$ are fixed
The signs $S_i$ are i.i.d. with $$ \mathbb{P}(S_i = +1) = \mathbb{P}(S_i = -1) = \tfrac12 $$

Thus, $$ T_n = \sum_{i=1}^n R_i S_i $$ has an exact permutation distribution.

Equivalently, $$ W^+ = \sum_{i=1}^n R_i B_i, \quad B_i \sim \text{Bernoulli}(1/2). $$

This distribution:

is discrete
depends only on $n$
is symmetric
is distribution-free

7. Support of the distribution¶

Let $$ S = \sum_{k=1}^n k = \frac{n(n+1)}{2}. $$

Then $$ W^+ \in \{0,1,\dots,S\}. $$

Each value corresponds to the sum of a subset of $\{1,\dots,n\}$.

Formally: $$

\mathbb{P}(W^+ = w)¶

\frac{#{\text{subsets of } {1,\dots,n} \text{ with sum } w}}{2^n}. $$

8. Symmetry of the distribution¶

For every subset $A \subseteq \{1,\dots,n\}$, its complement $A^c$ satisfies: $$ \sum_{k \in A^c} k = S - \sum_{k \in A} k. $$

Hence: $$ \mathbb{P}(W^+ = w) = \mathbb{P}(W^+ = S - w), $$ and $$ \mathbb{E}[W^+] = \frac{S}{2} = \frac{n(n+1)}{4}. $$

9. Mean and variance under $H_0$¶

For $W^+$: $$ \mathbb{E}[W^+] = \frac{n(n+1)}{4}, \quad \mathrm{Var}(W^+) = \frac{n(n+1)(2n+1)}{24}. $$

For $T_n$: $$ \mathbb{E}[T_n] = 0, \quad \mathrm{Var}(T_n) = \frac{n(n+1)(2n+1)}{6}. $$

10. Why the test works (core theoretical reason)¶

Under symmetry: $$ d_i \stackrel{d}{=} -d_i. $$

Therefore:

signs $S_i$ are independent of magnitudes $|d_i|$
conditional on the ranks, $S_i$ are i.i.d. Rademacher variables

Thus the statistic reduces to a randomly signed sum of fixed ranks, yielding:

an exact permutation distribution
distribution-free inference

11. Asymptotic null distribution (CLT)¶

Conditionally on the ranks: $$ T_n = \sum_{i=1}^n R_i S_i $$ is a sum of independent, mean-zero random variables.

Let $$ \sigma_n^2 = \sum_{i=1}^n R_i^2 \sim \frac{n^3}{3}. $$

Since $$ \max_i \frac{R_i^2}{\sigma_n^2} \to 0, $$ the Lindeberg condition holds.

Hence: $$ \frac{T_n}{\sqrt{\sigma_n^2}} \xrightarrow{d} N(0,1). $$

Equivalently: $$ \frac{W^+ - \mathbb{E}[W^+]}{\sqrt{\mathrm{Var}(W^+)}} \xrightarrow{d} N(0,1). $$

12. Interpretation¶

Tests for a location shift under symmetry
Uses both direction and magnitude
More powerful than the sign test
Robust to non-normality, but not to skewness

13. When the test fails¶

Strongly skewed distributions
Heavy ties
Discrete data
Dependence between observations

In these cases, the null distribution is distorted.

14. One-sentence summary (exam-perfect)¶

The Wilcoxon signed-rank test is an exact, distribution-free test for symmetry-based location shifts, whose null distribution arises from random sign permutations of fixed ranks and converges asymptotically to a normal distribution.

Seminar 7