Jim Hodges

Associate Professor, Division of Biostatistics, University of Minnesota

Statistician, Minneapolis Heart Institute Foundation

I'm the one on the right. The one on the left is Li Chi-ping, who became my bride on 13 February 2007. (Photo taken March 2006, Taipei).

My locations

Main location: U of Minnesota Division of Biostat

  • Division of Biostatistics
  • School of Public Health
  • University of Minnesota
  • 2221 University Ave SE, Suite 200
  • Minneapolis, Minnesota 55414
  • Phone (612) 626-9626, Fax (612) 626-9054
  • e-mail: hodges@ccbr.umn.edu, hodge003@umn.edu (they go to the same inbox)

    At the Minneapolis Heart Institute Foundation

  • Minneapolis Heart Institute Foundation
  • 920 E 28th St
  • Minneapolis, MN 55407
  • phone (612) 863-1626
  • e-mail: JimHodgesMHIF@gmail.com
  • [NOTE: I am at MHIF only on Tuesday and Friday afternoons and generally don't check this e-mail account on other days.]

    Curriculum Vitae (or whatever "CV" stands for)

    Having trouble sleeping? Take a look at Jim's vita, current as of 7 May 2009.

    Materials for PubH8400/02 Spring 2008 "Richly Parameterized Models".

    Official syllabus

    Detailed syllabus, revised 2/18/08.

    Suggestions for class projects updated version 3/10/08.

    Papers assigned as reading

  • Cui, Hodges, Carlin (2007)
  • Hodges (1998)
  • Hodges, Cui, Sargent, Carlin 2007 published version;
  • Peterson et al 2001
  • Reich and Hodges 2007
  • Reich et al 2007 JASA version, page proofs; Research report version, rr2004-004
  • Reich et al 2006.
  • Reich & H, Spatially-adaptive CAR paper, Feb 2007 version

    Transparencies used in lectures; labels for files refer to the detailed syllabus.

  • Part I, Section A, 1, 2a (through conventional analyses) here; error on page IA1/13
  • Part I, Section A, 2b, 3, 4 (through the end of IA) here
  • Part I, Section B (alternative formulation; measures of complexity) here updated 1/30/08, replaced version posted 1/28/08
  • Part I, Section B thesis topics, Section C here
  • Part I, Section D except discrete-by-discrete interactions here
  • Part I, Section D, discrete-by-discrete interactions (smoothed ANOVA) here
  • Part I, Section E, spatial smoothing 1 (CAR smoothing on a lattice) here
  • Part I, Section E, spatial smoothing 2 (2D penalized splines) here
  • Part I, Section F, time series (dynamic linear models, Kalman filter-style models) here
  • Part II, Section A, Simple extensions of linear-model diagnostics here
  • Part II, Section B, Collinearity/Confounding and Smoothing/Shrinkage (beginning) here
  • Part II, Section B, Collinearity & smoothing (CAR smoothing) here, revised 3/7/08
  • Part II, Section B, concluded; Section C (identification of parameters in the variance structure) teaser here
  • Part II, Section C, identification of parameters in the variance structure, oddities here, sorry about the poor quality of the copies; all of the pictures that copied poorly are in the research-report version of Reich et al 2007.
  • Part II, Section C, identification of the parameters in the variance structure, Reich & Hodges 2007 here
  • Part II, Section C, previous continued; R&H 2007 for CAR model; 3-variance models here
  • Part II, Section C concluded, CAR models with 2 classes of neighbor pairs here
  • Part II, SPECIAL EXTRA, Random effects in the analysis vs random effects in data generation: lecture transparencies; there's also a draft paper covering the same ground more explicitly (or at least, using more words).
  • Part II, Section D, Influence of a few "observations" on variance-structure unknowns here
  • Part II, Section D, Generalizing last class's r-hat_i so z_i; applying the result to a spatio-temporal model for attachment loss here
  • Part II, Section D, a better spatio-temporal model, z_i still applies here
  • Part II, Section E, two last oddities from real datasets (oddities I wish someone had shown me) here

    Homework assignments

  • #1, 1/29/08 error in question 2
  • #2, 2/5/08
  • #3, 2/12/08
  • #4, 2/19/08

    Datasets

  • Molecular structure of a virus
  • Vocal folds
  • Global mean surface temperature deviations (in 0.01 degrees C) Text file, Excel file.
  • Physical properties of pig jawbone Excel file
  • Soft material polishability data -- as in Appendix B of Hodges et al 2007 -- columns are not scaled! Excel file

    Materials from Summer 2008 VA Methodology Group series: "Everything is a Mixed Linear Model".

    The CCDOR Methodology Group will present a series of 5 sessions led by Jim Hodges, "Everything is a mixed linear model". Topics, times, and locations (all rooms at the Minneapolis VA Medical Center) are given below; all presentations are on Thursdays at noon or 3:30. Each Topic has a reading. Presentations will be fairly informal but mostly aimed at people with at least a linear models course in grad school. However, Topic #3 should be of interest to a broader audience and will be less technical.

    Each topic's readings are listed below under the respective topics. Two of the readings are from the excellent book "Semiparametric Regression" (the book's website contains lots of useful things), by David Ruppert, Matt P. Wand, and Ray J. Carroll (2003, Cambridge U Press), which I recommend strongly if you want to learn more about penalized splines. (I got my soft-cover copy for about $35 on Amazon.com -- cheap!)

    Topic #1: Penalized splines as mixed linear models (MLMs)

    10 July, noon, 3B-137

    17 July, 3:30, 3E-136

    Penalized splines are a way to fit smooth curves to data. They were developed in their own theoretical universe but can be expressed as MLMs and thus combined with other effects (fixed and random) and estimated using software like PROC MIXED. These two sessions develop the idea of penalized splines and fit them into the MLM framework.

    Reading: Ruppert, Wand, and Carroll Chapter 3, Section 4.9, Chapter 6 sections 1-4. On a first reading, you can skip sections 3.4, 3.7, 3.8, 3.11, 3.15-3.18, and 6.3.

    Transparencies: here

    Topic #2: Spatial smoothing using mixed linear models.

    14 August, noon, 3E-136

    This class of analyses also grew up in its own universe but can be expressed as MLMs. First, I'll discuss conditional autoregressive models for areal data (where the dependent variable is the total or average over an area, e.g., county or VISN), and then present 2- dimensional penalized splines for point-referenced data or, in some cases, areal data.

    Readings: The first reading is some transparencies by me about fitting the conditional autoregressive (CAR) model into the MLM framework; the second reading Ruppert et al's Chapter 13, sections 13.1 to 13.4. On a first reading, you can skip section 13.3.

    Transparencies, in three parts: CAR models, 2-D splines, and example.

    Topic #3: Random effects can confound the fixed effects you care about

    28 August, 3:30, 3E-136

    Adding spatially correlated errors or a clustering effect to a model doesn't just inflate standard errors, it also in effect adds new implicit predictors that may be collinear with the predictor you care about. This will be obvious as soon as I write down the models, but the spatial people I know find this bizarre and unsettling, and nobody seems to know that simple clustering can have this effect.

    Reading: The reading is a paper by Reich, Hodges, and Zadnik (Biometrics 2006) about how adding CAR-distributed errors to an analysis confounds a fixed effect of interest. You can skip section 4 on a first reading.

    Transparencies, here.

    Topic #4: Random effects are not necessarily random.

    11 September, noon, 3E-136

    A huge range of models can be expressed and analyzed as MLMs. However, the models in Topics 1-3 would not have been recognized as random-effect models by, say, Scheffe. This conceptual quibble has real practical consequences, which this session will explore.

    Reading: This reading is a little polemic I wrote which, like all polemics, is too strong but I hope it has some entertainment value and gets you thinking about our headlong rush to compute things we don't understand.

    Transparencies here.

    Futher materials

    You might also be interested in the materials from a course I taught recently at the Division of Biostat, from which this series is mostly drawn. The stuff is just up above on this web page, above this VA series and below the addresses, under the heading "Materials for PubH8400/02 Spring 2008 'Richly Parameterized Models'". This has links to a detailed syllabus for the course, all the transparencies I used as overheads during the lectures, five datasets used as examples, and so on.


    Papers you can download

  • A postscript version of Sargent DJ, Hodges JS, Smoothed ANOVA with application to subgroup analysis, submitted to JASA some years ago and rejected with enthusiasm. A completely reworked and much better version has been accepted by Technometrics; the Research Report version is rr2005-018 on the U of MN Biostat web site.

  • A postscript version of Hodges JS, Sargent DJ, "Counting degrees of freedom in hierarchical and other richly parameterized models". The original version has some interesting stuff that's not in the Biometrika version (Hodges JS, Sargent DJ. Counting degrees of freedom in hierarchical and other richly-parameterised models. Biometrika, 88:367-379, 2001).

    Items associated with "Some algebra and geometry for hierarchical models, applied to diagnostics" (JRSSB 1998)

    Dataset: This dataset is in ASCII format. The file containing the dataset has two matrices -- one for plan-level data and one for state-level data -- and some introductory material.

    S+ functions: Peiming Ma has written S+ functions to execute the analyses in this paper. You can get separate postscript files for: documentation 1 and documentation 2, and ASCII files containing the functions for: gibbs sampler, trace plots, added-variable plot, collinearity check, case influence, residuals, and transformations. Although we have tested these functions and they work as far as we know, USE THEM AT YOUR OWN RISK! Also, we make no claims to efficiency or exemplary programming style, but they do appear to work.


    Last updated: April 2008.


    Return to U of M Biostat home page.


    Official Disclaimer: The views and opinions expressed in this page are strictly those of the page author. The contents of this page have not been approved by the University of Minnesota.

    Unofficial Disclaimer: This is all my fault. The U is blameless. They're such nice people, how could you even think of blaming them! Shame on you!