CS 294 / Stat 260, Fall 2014:

Learning in Sequential Decision Problems


This page contains pointers to a collection of resources on the topics covered in lectures.

Tutorials/Courses/Survey Papers/Texts

Stochastic Bandits

Regret bounds and UCB strategies
Gittins Index
Thompson Sampling

Adversarial Bandits

Partial monitoring

Contextual bandits

Linear bandits

Back to course home page