Foundations of Probability and Statistical Theory
A Modern Introduction with Computation, Simulation, and Inference
Welcome
Welcome to the online book for MATH 5010: Foundations of Statistical Theory & Probability.
This book is designed for graduate students who want a rigorous but usable foundation in probability, mathematical statistics, and statistical inference. The central goal is to connect three ways of learning statistics:
- Theory: definitions, theorems, derivations, and proofs.
- Computation: Python labs, simulation, Monte Carlo methods, and numerical verification.
- Interactive learning: visual HTML modules that let students explore concepts dynamically.
The book begins with probability theory and random variables, then builds toward convergence theory, sampling distributions, sufficient statistics, point estimation, hypothesis testing, interval estimation, ANOVA, and Bayesian inference.
0.1 How to use this book
Each main chapter is written as a lecture-ready Quarto page. Most chapters are paired with an interactive HTML activity and a Python notebook lab.
A good workflow for students is:
- read the chapter notes before class;
- work through the examples during class;
- use the interactive HTML page to visualize the main ideas;
- open the notebook in Google Colab and run the simulations;
- return to the practice problems and solutions for review.
A good workflow for instructors is:
- use the chapter page as lecture notes;
- use the examples as board work or in-class activities;
- assign selected practice problems as homework or discussion problems;
- use the notebook labs for computational reinforcement;
- use the HTML modules for visual explanation and student exploration.
0.2 Book map
0.2.1 Part I: Probability foundations
| Chapter | Topic | Main purpose |
|---|---|---|
| 1 | Basic Probability | Build the language of probability: sample spaces, events, probability rules, conditioning, independence, Bayes theorem, and paradoxes. |
| 2 | Random Variables and Distributions | Introduce random variables, CDFs, PMFs, PDFs, mixed distributions, and common distribution families. |
| 3 | Joint and Conditional Probability | Study joint, marginal, and conditional distributions; independence; covariance; correlation; and Bayes rules in multivariate settings. |
| 4 | Expectations, Moments, and MGFs | Develop expected value, variance, moments, moment-generating functions, and multivariate moment tools. |
| 5 | Transformations | Learn CDF methods, change of variables, Jacobians, convolution, products, quotients, extrema, and polar transformations. |
| 6 | Conditional Expectations | Use conditional distributions and conditional expectation to compute means, variances, Bayesian updates, and first-step decompositions. |
| 6 Extra | Multinomial and Multinormal Models | Extend probability tools to multinomial, Dirichlet, multivariate normal, conditional Gaussian, and linear Gaussian models. |
| 7 | Inequalities and Identities | Study Markov, Chebyshev, Chernoff, Hoeffding, Jensen, Cauchy-Schwarz, and other core probability inequalities. |
| 8 | Sampling and Order Statistics | Introduce random samples, sample mean and variance, normal sampling theory, chi-square, \(t\), \(F\), and order statistics. |
| 9 | Convergence Theory | Study convergence in distribution, probability, mean, almost surely, LLN, CLT, continuous mapping, and Slutsky theorem. |
| 10 | Monte Carlo Sampling | Use simulation, Monte Carlo integration, inverse transform sampling, rejection sampling, importance sampling, and SIR. |
0.2.2 Part II: Statistical inference
| Chapter | Topic | Main purpose |
|---|---|---|
| 11 | Sufficient Statistics | Understand statistics, estimators, sufficiency, factorization, minimal sufficiency, complete sufficiency, and the likelihood principle. |
| 12 | Point Estimation I | Derive estimators using method of moments, maximum likelihood, Bayesian estimation, conjugacy, and MAP estimation. |
| 13 | Point Estimation II | Evaluate estimators using bias, variance, MSE, Fisher information, Cramér-Rao bounds, Rao-Blackwell, and UMVUE theory. |
| 14 | Hypothesis Tests I | Introduce null and alternative hypotheses, Type I/II errors, \(p\)-values, likelihood ratio tests, Bayesian tests, and Neyman-Pearson ideas. |
| 15 | Hypothesis Tests II | Study power functions, size, unbiased tests, UMP tests, binomial and normal examples, and risk-based testing. |
| 16 | Interval Estimation I | Construct confidence intervals using pivots, test inversion, normal theory, binomial, uniform, exponential, and Bayesian credible intervals. |
| 17 | Interval Estimation II | Study interval optimality, coverage, shortest intervals, UMA confidence sets, HPD intervals, and risk of confidence sets. |
| 18 | ANOVA I | Develop one-way ANOVA, treatment effects, contrasts, sums of squares, ANOVA tables, and the \(F\)-test. |
| 19 | ANOVA II and Applications | Apply ANOVA to real examples, multiple testing, experimental design, randomized block designs, and Python workflows. |
| 20 | Bayesian Inference | Unify Bayesian estimation, testing, credible intervals, HPD intervals, conjugate models, Bayes risk, and Bayesian workflow. |
0.3 Labs and interactive pages
The computational and interactive materials are summarized on a separate page:
Open the labs and interactive modules summary
The labs are stored in the labs/ folder and can be opened directly in Google Colab from the GitHub repository. The interactive pages are stored in the htmls/ folder and are intended for visual exploration before, during, or after lecture.