Skip to content

Commit

Permalink
update lecture 6 SOTA Policy optimization
Browse files Browse the repository at this point in the history
It gonna be a brutal lecture....
  • Loading branch information
zhoubolei authored Apr 23, 2020
1 parent dd78358 commit bb599a2
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ The short course is scheduled as follows. Lectures 1-7 will be the foundation, t
| Lecture 3 | Model-free Prediction and Control | [slide](lecture3.pdf), Youtube([part1](https://www.youtube.com/watch?v=Duj1U73yHik), [part2](https://www.youtube.com/watch?v=sfkhinBjGGY)), B站([上集](https://www.bilibili.com/video/BV1N7411Q7aJ/), [下集](https://www.bilibili.com/video/BV1N7411Q7M6/)) |
| Lecture 4 | Value Function Approximation |[slide](lecture4.pdf), Youtube([part1](https://www.youtube.com/watch?v=YdWsnB-u8PQ), [part2](https://www.youtube.com/watch?v=fGIaFlbBFxk)), B站([上集](https://www.bilibili.com/video/BV11V411f7bi/), [下集](https://www.bilibili.com/video/BV1w54y1d7se/)) |
| Lecture 5 | Policy Optimization: Foundation |[slide](lecture5.pdf), Youtube([part1](https://www.youtube.com/watch?v=ProKaoyduFY), [part2](https://www.youtube.com/watch?v=MWXazkQkTlk)), B站([上集](https://www.bilibili.com/video/BV1fZ4y1x7mp/), [下集](https://www.bilibili.com/video/BV1ia4y1x7Va/)) |
| Lecture 6 | Policy Optimization: State of the art | |
| Lecture 6 | Policy Optimization: State of the art |[slide](lecture6.pdf), Youtube([part1](https://youtu.be/4YIdjLh-MJs), [part2](https://youtu.be/HOpiQWM0PCA)), B站([上集](https://www.bilibili.com/video/BV1s64y1M7AW/), [下集](https://www.bilibili.com/video/BV1EK41157fD/)) |
| Lecture 7 | Model-based RL | |
| Lecture 8 | Imitation Learning | |
| Lecture 9 | Distributed computing and RL system design | |
Expand Down

0 comments on commit bb599a2

Please sign in to comment.