This short RL course introduces the basic knowledge of reinforcement learning. Slides are made in English and lectures are given by Bolei Zhou in Chinese. The course is for personal entertainment only.
The course is scheduled as follows. There are 10 lectures in total, where the first last is premiered on 16 March 2020 and the last lecture is finished on 25 May 2020. Thanks for watching and may ReinForce be with you!
Topic | Resources | |
---|---|---|
Lecture 1 | Overview (课程概括与RL基础) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 2 | Markov Decision Process (马尔科夫决策过程) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 3 | Model-free Prediction and Control (无模型的预测和控制) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 4 | Value Function Approximation (价值函数近似) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 5 | Policy Optimization: Foundation (策略优化基础篇) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 6 | Policy Optimization: State of the art (策略优化进阶篇) | slide, Youtube(part1, part2), B站(上集, 下集) |
Lecture 7 | Model-based RL (基于环境模型的RL) | slide, Youtube, B站 |
Lecture 8 | Imitation Learning (模仿学习) | slide, Youtube, B站 |
Lecture 9 | Distributed systems for RL (分布式系统) | slide, Youtube, B站 |
Lecture 10 | RL in a nutshell (课程结局篇) | slide, Youtube, B站 |