Photo by Joel Abraham on Unsplash
Likelihood-Based Inference: From Foundations to Research¶
This is a comprehensive guide to likelihood-based statistical inference, designed to take the reader from basic probability through to research-grade methods. Every result is derived from first principles with plain-English explanations alongside the mathematics.
Contents
- Part I: Foundations
- Chapter 1: Probability Basics
- Chapter 2: Random Variables
- 2.1 What Is a Random Variable?
- 2.2 Probability Mass Function (PMF)
- 2.3 Probability Density Function (PDF)
- 2.4 Cumulative Distribution Function (CDF)
- 2.5 Expectation (Expected Value)
- 2.6 Variance
- 2.7 Moment Generating Functions
- 2.8 Joint Distributions
- 2.9 Covariance and Correlation
- 2.10 Conditional Distributions and Conditional Expectation
- 2.11 The Law of Large Numbers
- 2.12 The Central Limit Theorem
- 2.13 Summary
- Chapter 3: Common Distributions
- Chapter 4: The Likelihood Function
- 4.1 Likelihood vs. Probability: The Key Distinction
- 4.2 Formal Definition of the Likelihood Function
- 4.3 The Log-Likelihood
- 4.4 Finding the MLE Analytically
- 4.5 The Score Function
- 4.6 Fisher Information
- 4.7 Sufficient Statistics
- 4.8 The Likelihood Principle
- 4.9 Bringing It All Together: Complete Light Bulb Analysis
- 4.10 Summary
- Part II: The Likelihood Catalogue
- Chapter 5 — Discrete Likelihoods
- Chapter 6 — Continuous Likelihoods
- 6.1 Normal (Gaussian) Distribution
- 6.2 Exponential Distribution
- 6.3 Gamma Distribution
- 6.4 Beta Distribution
- 6.5 Log-Normal Distribution
- 6.6 Weibull Distribution
- 6.7 Pareto Distribution
- 6.8 Student’s t-Distribution
- 6.9 Chi-Squared Distribution
- 6.10 F-Distribution
- 6.11 Uniform Distribution
- 6.12 Cauchy Distribution
- 6.13 Summary Table
- Chapter 7 — Multivariate Likelihoods
- Chapter 8 — Specialized Likelihoods
- Part III: Maximum Likelihood Estimation
- Chapter 9 — MLE Theory
- Chapter 10 — Analytical MLE Solutions
- 10.1 Normal Distribution — \(\mathcal{N}(\mu, \sigma^2)\)
- 10.2 Exponential Distribution — \(\text{Exp}(\lambda)\)
- 10.3 Poisson Distribution — \(\text{Pois}(\lambda)\)
- 10.4 Binomial Distribution — \(\text{Bin}(m, p)\)
- 10.5 Gamma Distribution — \(\text{Gamma}(\alpha, \beta)\)
- 10.6 Beta Distribution — \(\text{Beta}(\alpha, \beta)\)
- 10.7 Uniform Distribution — \(\text{Unif}(0, \theta)\)
- 10.8 Multinomial Distribution
- 10.9 Summary of Analytical MLEs
- Chapter 11 — Confidence Intervals and Hypothesis Testing
- 11.1 The Likelihood Ratio Test
- 11.2 The Wald Test
- 11.3 The Score (Rao) Test
- 11.4 Comparison of the Three Tests
- 11.5 Confidence Intervals from Fisher Information (Wald-Type CIs)
- 11.6 Profile Likelihood Confidence Intervals
- 11.7 Likelihood Ratio Confidence Regions
- 11.8 Complete A/B Test Analysis: All Methods Together
- 11.9 Multiple Testing Corrections
- 11.10 Practical Guidance: Choosing a Method
- 11.11 Summary
- Part IV: Optimization for Likelihood
- Chapter 12: Gradient Methods
- Chapter 13: Newton and Scoring Methods
- 13.1 Newton–Raphson from the Second-Order Taylor Expansion
- 13.2 Application to Maximum Likelihood Estimation
- 13.3 Fisher Scoring
- 13.4 Connection to IRLS for Generalized Linear Models
- 13.5 Modified Newton Methods
- 13.6 Convergence of Newton’s Method
- 13.7 Practical Issues
- 13.8 Worked Example: Logistic Regression
- 13.9 Summary
- Chapter 14: Quasi-Newton Methods
- Chapter 15: The EM Algorithm
- 15.1 The Incomplete-Data Problem
- 15.2 Derivation via Jensen’s Inequality
- 15.3 Monotone Ascent: EM Never Decreases the Likelihood
- 15.4 Example: Gaussian Mixture Model
- 15.5 Example: Simple Missing Data
- 15.6 ECM: Expectation Conditional Maximization
- 15.7 MCEM: Monte Carlo EM
- 15.8 Variational EM
- 15.9 Convergence Rate of EM
- 15.10 Generalizations and Connections
- 15.11 Summary
- Chapter 16: Constrained Optimization
- 16.1 Equality Constraints and Lagrange Multipliers
- 16.2 Inequality Constraints and KKT Conditions
- 16.3 Examples in Maximum Likelihood
- 16.4 Augmented Lagrangian Method
- 16.5 Barrier (Interior-Point) Methods
- 16.6 Penalty Methods
- 16.7 Reparameterization as an Alternative to Constraints
- 16.8 Putting It All Together: Constrained MLE
- 16.9 Summary
- Part V: Advanced and Research Topics
- Chapter 17 – Bayesian Connections
- Chapter 18 – Computational Methods
- Chapter 19 – Model Selection
- 19.1 The Bias–Variance Trade-off
- 19.2 Akaike Information Criterion (AIC)
- 19.3 Bayesian Information Criterion (BIC)
- 19.4 Deviance Information Criterion (DIC)
- 19.5 Widely Applicable Information Criterion (WAIC)
- 19.6 Cross-Validation
- 19.7 Likelihood Ratio Test for Nested Models
- 19.8 Model Averaging
- 19.9 A Practical Decision Guide
- 19.10 Summary
- Chapter 20 – Modern Research Frontiers
- Chapter 21 – Numerical Considerations
- 21.1 Floating-Point Arithmetic
- 21.2 Working in Log-Space
- 21.3 The Log-Sum-Exp Trick
- 21.4 Numerical Differentiation
- 21.5 Numerical Hessians
- 21.6 Condition Numbers
- 21.7 Cholesky vs Inverse for Solving Linear Systems
- 21.8 Parameterization Matters
- 21.9 Sparse and Structured Hessians
- 21.10 Parallel and GPU Computation
- 21.11 Software Libraries
- 21.12 Summary
- Appendices