CS 281A / Stat 241A, Spring 2007:

Statistical Learning Theory


People




Office hours
Professor Peter Bartlett bartlett@cs Tue 3-4, 399 Evans. Wed 3-4, 723 Soda.
TAs
Alexandre Bouchard
bouchard@cs Tuesday 1-2, Thursday 1-2, 751 Soda.

Mikhail Traskin
mtraskin@stat Wednesday 10-12, 387 Evans.

Lectures:  Soda 306. Tuesday/Thursday 11-12:30.

Discussion section:  Evans 334. Thursday 5-6.

Course description

This course will provide an introduction to probabilistic and computational methods for the statistical modeling of complex, multivariate data. It will concentrate on graphical models, and in particular issues of representation, estimation, and inference in parametric models. [More details: html pdf]

Prerequisites:

The prerequisites are previous coursework in linear algebra, multivariate calculus, and basic probability and statistics. Previous coursework in graph theory, information theory, and optimization theory would be helpful but is not required.  Familiarity with Matlab, Splus or a related matrix-oriented programming language will be necessary.

Text book:

The course will follow the unpublished manuscript An Introduction to Probabilistic Graphical Models, by Michael I. Jordan, which is available from Copy Central at 2483 Hearst Avenue. It is reader number 54 (ask for CS C281A) and costs $23.
For an introduction, see the review paper Michael I. Jordan, Graphical models. .

Assignments:

There will be a substantial project (40% of the grade), plus regular homework assignments (60% of the grade; approximately one every two weeks).
It is appropriate to discuss homework assignments with other students, but homeworks must be written up individually. If you discuss the assignment with other students, please list their names on your homework. See the policy on academic dishonesty.

Lectures:

The chapter numbers under 'Reading' refer to the text An Introduction to Probabilistic Graphical Models, Michael I. Jordan.
Topic Reading
Jan 16 Introduction; directed graphical models 1, 2
Jan 18 directed graphical models 2.1
Jan 23 undirected graphical models 2.2, 2.3
Jan 25 elimination algorithm (guest lecturer: Martin Wainwright) 3
Jan 30 graph elimination, sum-product algorithm 3,4
Feb 1 factor graphs, poly trees 4
Feb 6 max-product algorithm, parameter estimation 4,5
Feb 8 parameter estimation 5
Feb 13 linear regression 6
Feb 15 linear classification 7
Feb 20 exponential family 8 (+ notes on exponential family
Feb 22 exponential family 8 (+ notes on exponential family
Feb 27 estimation with complete observations 9 (+ notes on chordal graphs)
Mar 1 estimation with complete observations: IPF 9
Mar 6 estimation with hidden variables: EM 10
Mar 8 estimation with hidden variables: EM 11 (+ notes on EM implementation)
Mar 13 inference in HMMs 12
Mar 15 estimation in HMMs 12
Mar 20 factor analysis 14
Mar 22 state space models 15
Mar 27 Spring Break
Mar 29 Spring Break
Apr 3 inference and estimation in state space models 15 (+ An approach to time series smoothing and forecasting using the EM algorithm.
R. Shumway and D. Stoffer. J. Time Series Analysis 3(4):253-264, 1982.
)
(+ notes on ML Gaussian estimates)
Apr 5 junction tree 17
Apr 10 junction tree, HMMs 17,18
Apr 12 junction tree and HMMs 18
Apr 17 junction tree and Kalman filter; maximum entropy 18,19
Apr 19 iterative scaling 20
Apr 24 iterative scaling 20
Apr 26 approximate inference: variational methods (guest lecturer: Alex Bouchard) A variational principle for graphical models. M. Wainwright and M. Jordan. 2005.
Tutorial on variational approximation methods. T. Jaakkola. 2000.
Alex's slides
May 1 approximate inference: sampling methods 21
May 3 CS281A project poster session
May 8 Stat241A project poster session

Announcements: