A statistic that keeps all parameter information
A statistic $T(X_1,\dots,X_n)$ is sufficient for a parameter $\theta$ if, after we know $T$, the remaining details of the sample no longer tell us anything about $\theta$.
The main theorem
For a random sample with joint density or mass function $f_\theta(x_1,\dots,x_n)$, a statistic $T(X)$ is sufficient for $\theta$ if the joint model can be factored as
The parameter $\theta$ may appear in $g_\theta$, but the leftover factor $h$ cannot involve $\theta$.
Classroom proof sketch
After conditioning on $T=t$, the factor $g_\theta(t)$ is constant over all samples with the same statistic value. It cancels from the conditional distribution, leaving a distribution that depends only on $h$ and the sample space, not on $\theta$.
Interactive factorization checker
Only the number of successes matters
Let $X_1,\dots,X_n\overset{iid}{\sim}\mathrm{Bernoulli}(p)$. The joint PMF is
So $T=\sum_{i=1}^n X_i$ is sufficient for $p$.
Same statistic, same likelihood shape
For Bernoulli data, the order of 0s and 1s does not affect the likelihood. Only $T=\sum X_i$ matters.
The total count is sufficient
Let $X_1,\dots,X_n\overset{iid}{\sim}\mathrm{Poisson}(\lambda)$. Then
Thus $T=\sum X_i$ is sufficient for $\lambda$.
Likelihood as a function of $\lambda$
The likelihood uses the sample through the total count $T$. The MLE is $\hat\lambda=T/n=\bar X$.
Which normal statistic is sufficient?
For $X_i\sim N(\mu,\sigma^2)$, the answer depends on which parameters are unknown.
| Unknown parameter(s) | Sufficient statistic | Reason |
|---|---|---|
| $\mu$ only, $\sigma^2$ known | $\sum X_i$ or $\bar X$ | Likelihood depends on data through $\sum X_i$ |
| $\sigma^2$ only, $\mu$ known | $\sum (X_i-\mu)^2$ | Likelihood depends on squared deviations |
| Both $\mu,\sigma^2$ unknown | $(\sum X_i,\sum X_i^2)$ | Equivalent to $(\bar X,S^2)$ |
Data cloud and summaries
For unknown $\mu$ and $\sigma^2$, two summaries are needed: location and spread.
The smallest useful compression
A sufficient statistic may not be the smallest possible. A statistic $T$ is minimal sufficient if every other sufficient statistic must contain the information in $T$.
This likelihood-ratio criterion is often the easiest way to identify minimal sufficiency.
Likelihood-ratio test for two samples
Choose a model and compare two samples. The ratio is parameter-free exactly when the sufficient statistic agrees.
Self-check questions
Q1
For $X_i\sim\mathrm{Poisson}(\lambda)$, is $\bar X$ sufficient?
Q2
For $X_i\sim N(\mu,\sigma^2)$ with both parameters unknown, is $\bar X$ alone sufficient?
Q3
What does factorization mean intuitively?