Updated 10-14-2002
The Normal distribution
Univariate Normal distribution,
Multivariate Normal distribution
Normal distribution and the likelihood function
Recall that maximum likelihood estimation asks us to find the
and
which maximize the likelihood L (or the log likelihood).
Given n i.i.d. random vectors
the log
likelihood is
After a bit of matrix algebra, this can be transformed into
Since
only appears in the last term, we can maximize the
likelihood with respect to
by minimizing the last
term with respect to
. This
is clearly done when
So what we are left with doing is maximizing the remaining
part with respect to the parameters in
.
Recall that in the exploratory factor analysis
.
So we want to maximize the following with respect to
and
.
Maximizing the Likelihood
- The maximization is done by iteration.
- Details of it are probably not that
interesting for most in the class.
- Needs starting values the same
way that Principal Factor Analysis did but uses these starting
values differently
- For those who are interested: The iteration starts by taking the eigenvalues of the following:
This is implicit in the discussion of p. 47 of BK. It is the
eigenvalues of this that appear on the first page of the PROC FACTOR output.
- I don't think I can provide any simple intuition about this iterative
technique. Just trust that it converges to the Maximum Likelihood
Estimators. Call them
and
,
so
.
GOODNESS of FIT TEST
BESIDES GIVING ESTIMATES for
and
, Maximum Likelihood
Provides a GOODNESS of FIT TEST.
- Use Likelihood Ratio Test, i.e.,
- IF the model fits the data well, this statistic should be
small!
- Its distribution is asymptotically distributed
with
degrees of freedom = (
- number of free parameters in model).
- We can determine if the the statistic is ``small'' enough by
comparing to the
distribution and obtaining a p-value.
- The Hypothesis being tested
-
- H0: The model is correct (i.e. q factors sufficiently
describe the p
dimensional vector)
-
- HA: More factors are needed
- Thus we are looking for the model where we DO NOT REJECT the
H0 (i.e. find a big p-value)
ROTATION
No matter what estimation procedure, for exploratory
factor analysis we get estimates that
look like:
We can get the exact same
by taking
where T is a
matrix such that
= Identity
matrix.
Thus
where
is the orthogonally
rotated factor loading matrix, i.e.
. BUT, There are an infinite number of matrices T that satisfy
= I.
- Why do we bother to rotate? - We want to find out which
variables ``stick together''
- How to choose a T? - Find one that gives the ``simplest''
structure to the factor loadings
ROTATION
Determining which variables measure which factors
- Examine the rotated factor loadings
- Rule of thumb, if the absolute value of the standardized loading is >.3, the variable is relevant for the particular factor.
- Do the variables that load on a given factor share some conceptual meaning?
- Examine the communality. Rule of thumb, if the communality is less than .10 then the observed variable can be deleted. A communality < .10 means that less than 10% of the variability in the observed variable is expained by all of the common factors.
- Cudeck, R. and O'Dell, L. (1994) "Applications of standard error estimates in unrestricted factor analysis: Significance Tests for factor loadings and correlations" Psychological Bulletin This paper argues that the >.3 cutoff is too simplistic and provides a method for significance testing of rotated loadings.
Degrees of freedom in the exploratory factor analysis model
-
has p*q parameters
-
has p parameters
- Recall that the degrees of freedom equal p*(p+1)/2 minus the number of parameters that are being estimated freely.
- Because of rotation, a set of restrictions needs to placed on the parameters so that the software knows which version of
to estimate.
SAS Proc Factor assumes that
is diagonal. (i.e. it places q*(q-1)/2 restrictions on the parameters in
.
- So d.f. = p*(p+1)/2 - (p*q + p) + q*(q-1)/2
SAS Proc Factor Example using the visual perception/ reading skills data - Handouts in class.
1. Output from Proc Factor (handout)
2. Picturing rotation (handout)
Degrees of freedom for the visual perception/ reading skills data using exploratory factor analysis with 2 factors:
p = 6 (observed variables)
q = 2 (latent factors)
d.f. = 6*7/2 - (6*2+6) + 2*1/2 = 21 - 18 + 1 = 4