PubH 5421 Biostatistical Computing II - Fall 2003


School of Public Health


Division of Biostatistics


If you're right 90% of the time, why worry about the remaining 3% ? - Unknown.


The intent in this course is to gain some skill in statistical computing using SAS, Splus, and possibly FORTRAN or C. The recurring theme is solving problems by turning an algorithm into a program that provides relevant answers. Some background in probability and statistics, including an introduction to likelihood, is needed.

Topics covered will include use of pseudo-random number generators, distribution functions (pdfs and cdfs), matrix manipulations with applications to regression and estimation of variance, simulation studies, minimization of functions using various algorithms, nonlinear regression, manipulation and combination of datasets, techniques of macro programming, and methods of integration using quadrature.

The course is intended primarily for MS and MPH students in Biostatistics and Statistics. At least one previous semester of courses in biostatistics and statistics and concurrent or previous enrollment in a course involving linear regression are strongly recommended.

The main content of the course is a series of computing projects (see Syllabus). Datasets based on previous studies will be available for providing examples of computational methods.

This course is not intended as a continuation of SPH 5420. In general the level of statistical knowledge and mathematical skill is somewhat advanced over that required in SPH 5420 or the SPH 5450-5452 sequence.


Notes for this course:

  • notes.001 : Distributions
  • notes.002 : Randomization schedules
  • notes.003 : Pseudo-random numbers
  • notes.004 : Sample size, power, simulations
  • notes.005 : Binomial outcomes
  • notes.006 : Some program examples
  • notes.007 : More on simulations
  • notes.008 : Computation of summary stats
  • notes.009 : More on summary stats
  • notes.010 : More on summary stats
  • notes.011 : Mysteries of the SAS data step
  • notes.012 : Simulation from additional distribs
  • notes.013 : Linear transformations, matrices 1
  • notes.014 : Linear transformations, matrices 2
  • notes.015 : Linear transformations, matrices 3
  • notes.016 : SAS proc iml
  • notes.016a : SAS proc iml basics ...
  • notes.017 : Proc iml: solving nonlinear eqns
  • notes.018 : Proc iml: finding maxima, minima
  • notes.019 : Proc iml: maximum likelihood estn
  • notes.020 : Proc iml: max like estn, contin.
  • notes.021 : Proc iml: simulating multivar normals
  • notes.022 : SAS macros: macro variables
  • notes.023 : SAS macros: basics plus ...
  • notes.023a : SAS macros: Some generalities & examples
  • notes.024 : SAS macros: SAS/GRAPH ...
  • notes.025 : SAS macros: More SAS/GRAPH ...
  • notes.026 : SAS PROC NLIN: Introduction.
  • notes.027 : SAS PROC NLIN and amoeba ...
  • notes.028 : Variance Estimation, I
  • notes.029 : Variance Estimation, II : Delta Method
  • notes.030 : Numerical Integration
  • notes.031 : Numerical Integration, Contin.: Romberg Integration
  • notes.032 : Bayes Computations and Numerical Integration


  • Biostatistics Home Page

    Web address of this page: http://www.biostat.umn.edu/~john-c/ph5421.s2003.html

    Most recent update: September 1, 2003.


  • Mt. Etna, Sicily (eruption of November, 2002)

    Web counter provided by WebCounter(TM), http://www.digits.com