Updated 9-24-02
FACTOR ANALYSIS MODEL
A bunch of Factor Analysis exmples can be found at http://www.psych.yorku.ca/lab/psy6140/ex/factor.htm
-
:
- p - dimensional observed vector
-
:
- q - dimensional underlying
factors.
are often called ``common factors''. Assume
for now
and
-
:
- p - dimensional random
error.
are often called ``unique factors'' or
``specific factors''. Assume
and
where
is a diagonal matrix.
-
:
-
matrix of scalars called ``factor loadings''
-
:
-
vector of scalar intercepts.
Often ignored if we are only interested in interrelations. Most
software assume
by default and
center the
variables, i.e. analyze

KEY ASSUMPTIONS
- 1.
-
.
- factors are uncorrelated with the error terms
- 2.
-
is a diagonal matrix.
- Recall
- Each individual
error term is uncorrelated with all the others.
- Thus, given that we know
,
the observed variables are
uncorrelated, i.e. the partial correlations of any of the observed variables
on the others is zero once we have adjusted for the factors.
- Basic idea underlying most latent variable models called the AXIOM of CONDITIONAL
INDEPENDENCE
Focus on Covariance structure
-
is called the model covariance matrix
-
is a function of all the model parameter
that make up the covariance structure
- We will estimate
using
,
the
sample covariance matrix
OR Standardize
to get
,
i.e.
-
are the ``standardized
factor loadings''
- We will estimate
using
,
the
sample correlation matrix
NOTE: There are several different estimation procedures (e.g. 1. principal
factor method, 2. normal theory maximum likelihood method, and others).
For the
maximum likelihood method,
obtained by
analyzing the correlation matrix is the same as rescaling the
obtained by analyzing the covariance matrix by
the observed standard deviations,
i.e.
.
(For a discussion, see section
3.17 in Bartholomew and Knott 1998) THIS IS
NOT TRUE WHEN THE PRINCIPAL FACTOR METHOD IS USED.
COMMUNALITIES
- Communalities are the diagonal elements of
or
- Communalities are the part of the variance of each observed
variable which is due to the q underlying factors
EXAMPLE: The communality for x2 is
- Discuss difference between PCA and FA from the Hatcher Handout.
PARAMETER ESTIMATION
We want to analyze the covariance structure of the factors, i.e.
. We want to estimate
.
TWO estimation procedures will be discussed (Only one will be
discussed in 2002 school year)
- 1.
- Principal factor method (won't discuss details of this method)
- Default method in SAS Proc FACTOR (probably SPSS too)
- Computationally simpler, historically this method came first
- Discussed in sections 3.10-3.15 of BK
- 2.
- Normal theory Maximum likelihood method
- Discussed in sections 3.5-3.9 of BK
- Prefered method in general
SKIP in 2002.....PARAMETER ESTIMATION - VIA Principal
factor method
Some NOTES
- The rank of
is equal to q (i.e. equal to the number of underlying factors).
This implies that (p-q) eigenvalues of
are zero.
- PCA (or eigenvalue/eigenvector decomposition) allowed us to
take a matrix and factor it into
,
well this can be
written as
- Thus we will use this idea to find
by using PCA on
or

Since we don't know
, we will start off with a guess
and then iterate.....
SKIP in 2002......Need a first guess for 
- call it
- Recall that there are p nonzero elements in
,
i.e. a
uniqueness for each error term
- Basically can use anything bigger than 0 and less than
for our first guess.
- By default SAS PROC FACTOR analyzes the correlation matrix so
actually we are finding a first guess for
- A common choice is to takes
as
.
If you are
analyzing the covariance matrix,
use
as
.
- SMC's are the squared multiple correlations. That is the
R2 resulting from a regression of each variable on all the rest.
- SAS calls these the prior communalities
SKIP in 2002..... ITERATIVE
PROCEDURE
- 1.
- Extract q eigenvectors from

- 2.
- Call these q eigenvectors

- 3.
- Re estimate
by taking
- 4.
- Label this new estimate of
as

- 5.
- Go back to step 1 and continue iterating until convergence
SKIP in 2002......INTERESTING NOTE about Principal
factor method
BK (sections 3.10, 3.11) show that the principal factor method is identical to solving for
and
such that
is as small as possible.
This is the same thing as solving for
and
such that
is as small as possible. This is the Ordinary Least
Squares discrepancy function.
Note that this minimization criterion does not assume
anything about the correlations between the elements of
. Hence we are ignoring information.