Linear Discriminant Analysis for p greater than 1

Linear Discriminant Analysis for $p > 1$

Assume that $X = (X_{1}, X_{2}, \dots, X_{p})$ is drawn from a multivariate Gaussian (or multivariate normal) distribution, with a class-specific mean vector and a common covariance matrix.

The multivariate Gaussian distribution assumes that each individual predictor follows a one-dimensional normal distribution with some correlation between each pair of predictors.

To indicate that a $p$ -dimensional random variable $X$ has a multivariate Gaussian distribution, we write $X \sim N (μ, Σ)$ . Here $E (X) = μ$ is the mean of $X$ (a vector with $p$ components), and $C o v (X) = Σ$ is the $p \times p$ covariance matrix of $X$ . Formally, the multivariate Gaussian density is defined as:

f (x) = \frac{1}{(2 π)^{p / 2} | Σ |^{1 / 2}} exp (- \frac{1}{2} (x - μ)^{T} Σ^{- 1} (x - μ))

In the case of $p > 1$ predictors, the LDA classifier assumes that the observations in the $k^{t h}$ class are drawn from a multivariate Gaussian distribution $N (μ_{k}, Σ)$ , where $μ_{k}$ is a class-specific mean vector, and $Σ$ is a covariance matrix that is common to all $K$ classes.

The Bayes classifier assigns an observation $X = x$ to the class for which

δ_{k} (x) = x^{T} Σ^{- 1} μ_{k} - \frac{1}{2} μ_{k}^{T} Σ^{- 1} μ_{k} + log (π_{k})

is largest.

The Bayes decision boundaries are the values $x$ for which $δ_{k} (x) = δ_{l} (x)$ :

x^{T} Σ^{- 1} μ_{k} - \frac{1}{2} μ_{k}^{T} Σ^{- 1} μ_{k} = x^{T} Σ^{- 1} μ_{l} - \frac{1}{2} μ_{l}^{T} Σ^{- 1} μ_{l}

for $k \neq 1$ and $π_{k}$ is the same for each class. The Bayes classifier will classify an observation according to the region in which it is located.

LDA method uses estimates for the Bayes classifier the same as the $p = 1$ case.

Check for errors using a Confusion Matrix and accuracy with a ROC Curve

Sources: 1

Linear Discriminant Analysis for p>1

Connect With Me!

Linear Discriminant Analysis for $p > 1$