General multivariate normal distribution

Dependencies:

Standard multivariate normal distribution
Independence of random variables (incomplete)
Sum of independent normal random variables is normal
/probability/rand-vars/variance/variance-of-affine-form-of-matrix
Linearity of expectation for matrices
/probability/rand-vars/var-matrix-is-psd
/probability/rand-vars/moment-generating-function
/probability/rand-vars/mgf-gives-distr
/probability/normal-distr/mgf
Identity matrix
/linear-algebra/matrices/sqrt-of-symm-matrix
Full-rank square matrix is invertible
/linear-algebra/matrices/psd-full-rank

$\newcommand{\E}{\operatorname{E}}$ $\newcommand{\Var}{\operatorname{Var}}$ $\newcommand{\Cov}{\operatorname{Cov}}$

There is a class of probability distributions called multivariate normal distributions. Let $X = [X_1, X_2, \ldots, X_n]$ be a sequence of random variables. $X$ is said to follow a multivariate normal distribution (or more simply, $X$ is normal) iff \[ \forall a \in \mathbb{R}^n, \exists (\mu, \sigma), a^TX \sim \mathcal{N}(\mu, \sigma^2) \]

For a $k$-by-$n$ matrix $A$, $X$ is said to be $(\mu, A)$-normal iff $\exists Z \sim \mathcal{N}(0, 1)^k$ such that $X = \mu + AZ$.

We can prove that $X$ is normal iff $\exists \mu \in \mathbb{R}^n, \exists A \in \mathbb{R}^{k \times n}$ such that $X$ is $(\mu, A)$-normal. Therefore, $(\mu, A)$-normality can be used as an alternative definition of the class of multivariate normal distributions.

If $X$ is $(\mu, A)$-normal, then $\E(X) = \mu$ and $\Var(X) = AA^T$. We'll also prove that if $X$ is $(\mu, A)$-normal, then the probability distribution of $X$ is uniquely determined by $\mu$ and $AA^T$. Therefore, a multivariate normal distribution is uniquely defined by its mean $\mu$ and covariance matrix $\Sigma$. Denote this distribution by $\mathcal{N}(\mu, \Sigma)$. The standard normal distribution is therefore, $\mathcal{N}(0, I)$.

This finally gives us that $X \sim \mathcal{N}(\mu, \Sigma)$ iff \[ \exists A, (\Sigma = AA^T \textrm{ and } \exists Z \sim \mathcal{N}(0, I), X = \mu + AZ) \]

When $\Sigma$ is positive definite, $X \sim \mathcal{N}(\mu, \Sigma)$ has a density function given by \[ f_X(x) = \frac{1}{\sqrt{(2\pi)^k |\Sigma|}} \exp\left( -\frac{1}{2} (x-\mu)^T \Sigma^{-1} (x-\mu) \right) \]

$\E(X)$ and $\Var(X)$

Suppose $X$ is $(\mu, A)$-normal. \[ \E(X) = \E(\mu + AZ) = \mu + A\E(Z) = \mu \] \[ \Var(X) = \Var(\mu + AZ) = A\Var(Z)A^T = AA^T \]

Proof that $(\mu, A)$-normal iff normal

Suppose $X$ is $(\mu, A)$-normal, where $A \in \mathbb{R}^{n \times k}$. So $X = \mu + AZ$, where $Z$ is standard normal.

Let $b = A^Ta$. Then $a^TX = a^T\mu + b^TZ$ Since $b^TZ$ is a linear combination of standard normal variables, $b^TZ \sim \mathcal{N}(0, b^Tb)$. So, $a^TX \sim \mathcal{N}(a^T\mu, a^TAA^Ta)$.

Since $\forall a \in \mathbb{R}^n, a^TX$ is normal, $X$ is normal.

The proof that normal implies $(\mu, A)$-normal can be found in theorem 1 in http://www2.stat.duke.edu/~st118/sta732/mvnormal.pdf. The proof uses these lemmas/definitions:

moment generating functions for vector-valued random variables.
moment generating function uniquely determines probability distribution.
moment generating function of univariate normal distribution.
covariance matrix is positive semidefinite.
A PSD matrix $B$ can be expressed as $B = CC^T$.
$\Var(PX + Q) = P\Var(X)P^T$.
$(\mu, A)$-normal implies normal.

The theorem above also proves that if $X$ is $(\mu, A)$-normal, the moment generating function of $X$ depends only on $\mu$ and $AA^T$, and that $\E(X) = \mu$ and $\Var(X) = AA^T$.

Dependency for: None

Info:

Depth: 12
Number of transitive dependencies: 78

Transitive dependencies:

/analysis/topological-space
/sets-and-relations/countable-set
/linear-algebra/vector-spaces/condition-for-subspace
/linear-algebra/matrices/gauss-jordan-algo
/sets-and-relations/equivalence-relation
/linear-algebra/matrices/psd-full-rank
/linear-algebra/matrices/sqrt-of-symm-matrix
/probability/normal-distr/mgf
/probability/rand-vars/mgf-gives-distr
/probability/rand-vars/moment-generating-function
/probability/rand-vars/var-matrix-is-psd
/probability/rand-vars/variance/variance-of-affine-form-of-matrix
/analysis/exponential-grows-superpolynomially
/analysis/integration-by-parts
/sets-and-relations/de-morgan-laws
/measure-theory/linearity-of-lebesgue-integral
/measure-theory/lebesgue-integral
σ-algebra
Generated σ-algebra
Borel algebra
Measurable function
Generators of the real Borel algebra (incomplete)
Measure
σ-algebra is closed under countable intersections
Group
Ring
Polynomial
Integral Domain
Comparing coefficients of a polynomial with disjoint variables
Field
Vector Space
Linear independence
Span
Semiring
Matrix
Stacking
System of linear equations
Product of stacked matrices
Matrix multiplication is associative
Reduced Row Echelon Form (RREF)
Matrices over a field form a vector space
Row space
Elementary row operation
Every elementary row operation has a unique inverse
Row equivalence of matrices
Row equivalent matrices have the same row space
RREF is unique
Identity matrix
Inverse of a matrix
Inverse of product
Elementary row operation is matrix pre-multiplication
Row equivalence matrix
Equations with row equivalent matrices have the same solution set
Rank of a matrix
Basis of a vector space
Linearly independent set is not bigger than a span
Homogeneous linear equations with more variables than equations
Rank of a homogenous system of linear equations
Full-rank square matrix in RREF is the identity matrix
Full-rank square matrix is invertible
Probability
Conditional probability (incomplete)
Independence of events
Independence of composite events
Random variable
Expected value of a random variable
Distribution of sum of random variables (incomplete)
Independence of random variables (incomplete)
Linearity of expectation
Variance of a random variable
Covariance of 2 random variables
Linearity of expectation for matrices
Cross-covariance matrix
Covariance matrix
Gaussian integral (incomplete)
Normal distribution
Sum of independent normal random variables is normal
Standard multivariate normal distribution