Updated 10-14-2002


The Normal distribution


Univariate Normal distribution, $X \sim N(\mu, \sigma^2)$


\begin{displaymath}f(x) = \frac{1}{\sqrt{2 \pi \sigma^2}} e^{- \frac{(x-\mu)^2}{2
\sigma^2}}
\end{displaymath}




Multivariate Normal distribution $\underline{X} \sim
N_p(\underline{\mu}, \underline{\Sigma})$


\begin{displaymath}f(x) = \frac{1}{(2 \pi)^{\frac{p}{2}} \vert\underline{\Sigma}...
...ime
\underline{\Sigma}^{-1} (\underline{x} - \underline{\mu})}
\end{displaymath}


Normal distribution and the likelihood function


Recall that maximum likelihood estimation asks us to find the $\mu$and $\Sigma$ which maximize the likelihood L (or the log likelihood).

Given n i.i.d. random vectors $\underline{X}_i \sim N(\underline{\mu}, \underline{\Sigma})$ the log likelihood is

\begin{displaymath}log L = \frac{pn}{2} \log {2\pi} - \frac{n}{2} \log
\vert\und...
...me
\underline{\Sigma}^{-1} (\underline{x}_i - \underline{\mu})
\end{displaymath}

After a bit of matrix algebra, this can be transformed into


\begin{displaymath}log L = constant + \frac{n}{2} \left[ \log \vert\underline{\S...
...nderline{\Sigma}^{-1} (\bar{\underline{x}} - \underline{\mu})
\end{displaymath}

Since $\underline{\mu}$ only appears in the last term, we can maximize the likelihood with respect to $\underline{\mu}$ by minimizing the last term with respect to $\underline{\mu}$. This is clearly done when $\underline{\hat{\mu}} = \bar{\underline{x}}$

So what we are left with doing is maximizing the remaining part with respect to the parameters in $\underline{\Sigma}$. Recall that in the exploratory factor analysis $\underline{\Sigma} =
\underline{\Lambda} \underline{\Lambda}^{\prime} + \underline{\Psi}$. So we want to maximize the following with respect to $\Lambda$ and $\Psi$.


\begin{displaymath}log L = \frac{n}{2} \left[ \log \vert\underline{\Sigma}^{-1}\vert -
trace({\bf S} \underline{\Sigma}^{-1})\right]
\end{displaymath}


Maximizing the Likelihood


GOODNESS of FIT TEST
BESIDES GIVING ESTIMATES for $\Lambda$ and $\Psi$, Maximum Likelihood Provides a GOODNESS of FIT TEST.


ROTATION
No matter what estimation procedure, for exploratory factor analysis we get estimates that look like:


\begin{displaymath}\hat{\underline{\Sigma}} =
\hat{\underline{\Lambda}} \hat{\underline{\Lambda}}^{\prime} +
\hat{\underline{\Psi}}
\end{displaymath}

We can get the exact same $\hat{\underline{\Sigma}}$ by taking


\begin{displaymath}\hat{\underline{\Sigma}} =
\hat{\underline{\Lambda}} T T^\prime \hat{\underline{\Lambda}}^{\prime} +
\hat{\underline{\Psi}}
\end{displaymath}

where T is a $q \times q$ matrix such that $T T^\prime$ = Identity matrix.




Thus


\begin{displaymath}\hat{\underline{\Sigma}} =
\hat{\underline{\Lambda}}^* {\hat{\underline{\Lambda}}}^{* \; \prime} +
\hat{\underline{\Psi}}
\end{displaymath}

where $\hat{\underline{\Lambda}}^*$ is the orthogonally rotated factor loading matrix, i.e. $\hat{\underline{\Lambda}}^* =
\hat{\underline{\Lambda}} T$ . BUT, There are an infinite number of matrices T that satisfy $T T^\prime$ = I.


ROTATION


Determining which variables measure which factors


Degrees of freedom in the exploratory factor analysis model


SAS Proc Factor Example using the visual perception/ reading skills data - Handouts in class.

1. Output from Proc Factor (handout)

2. Picturing rotation (handout)

Degrees of freedom for the visual perception/ reading skills data using exploratory factor analysis with 2 factors:

p = 6 (observed variables)

q = 2 (latent factors)

d.f. = 6*7/2 - (6*2+6) + 2*1/2 = 21 - 18 + 1 = 4