Resources

Link to Slides:

Link to Video:
Part 1, Goals and Challenges and solutions! Introduction to the ODE method in a simple deterministic setting, with applications to extremum seeking control (a class of algorithms for online optimization, with applications to reinforcement learning). Much is taken from Chapter 4, and 2022 publications available on arXiv, such as Markovian Foundations for Quasi-Stochastic Approximation with Applications to Extremum Seeking Control	Part 2, Variance Matters The theoretical side of reinforcement learning has focused almost entirely on stochastic models for algorithm design and analysis. This talk surveys techniques for algorithm design and testing, building on part 1. The material is taken from Chapter 8, and recent tutorials and articles including The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning	Part 3, TD and Q-Learning Covers final two chapters: All about algorithm design for TD- and Q-learning in a stochastic environment. Much of Part II of CS&RL is based on handouts created over the years, some of which evolved to become Fundamental Design Principles for Reinforcement Learning Algorithms

Dr. Sean Meyn