B.2 Probability and Statistics
The exponential family and Generalized Linear Models (GLiMs)
Change of variables in probability densities
The score function
The score is defined as the gradient of the log-likelihood (with respect to the parameters,
The variance of the score is known as the Fisher information. Because its mean is zero, it is also the expected square of the score.
The Fisher information for exponential-family random variables
This turns out to take a simple form.
For a (vector) random variable
the Fisher information is:
where in the last line we have used the fact that the derivatives of the log-normalizer are the cumulants of the sufficient statistics (
Therefore,
Markov chains
Discrete random variables
[[[table]]]
Useful identities
Expectations of quadratic forms.
Consider a vector random variable
and then employ the cyclic-permutation property of the matrix-trace operator:
Hence, the expected value of the quadratic function of
Simulating Poisson random variates with mean less than 1.
motivation…