CS 294 / Stat 260, Fall 2014:

Learning in Sequential Decision Problems

Readings


This page contains pointers to a collection of resources on the topics covered in lectures.

Tutorials/Courses/Survey Papers/Texts

Stochastic Bandits

Classics
Regret bounds and UCB strategies
Gittins Index
Thompson Sampling

Adversarial Bandits

Partial monitoring

Contextual bandits

Linear bandits



Back to course home page