Lectures and Video from DeepLearn 2022
Link to Slides: |
||
Link to Video: |
||
Part 1, Goals and Challenges and solutions!
Introduction to the ODE method in a simple deterministic setting, with applications to extremum seeking control (a class of algorithms for online optimization, with applications to reinforcement learning). Much is taken from Chapter 4, and 2022 publications available on arXiv, such as |
Part 2, Variance Matters
The theoretical side of reinforcement learning has focused almost entirely on stochastic models for algorithm design and analysis. This talk surveys techniques for algorithm design and testing, building on part 1. The material is taken from Chapter 8, and recent tutorials and articles including |
Part 3, TD and Q-Learning
Covers final two chapters: All about algorithm design for TD- and Q-learning in a stochastic environment. |