Skip to content

Commit

Permalink
lecture 3 youtube videos are uploade. B zhan is waiting to be approve…
Browse files Browse the repository at this point in the history
…d by the platform
  • Loading branch information
zhoubolei committed Mar 29, 2020
1 parent f6dd573 commit ff60dca
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 13 deletions.
21 changes: 8 additions & 13 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,17 +10,12 @@ The short course will be arranged as follows. Lectures 1-8 will be the foundatio
|------------ |-------------------------------------------- |----------- |
| Lecture 1 | Overview |[slide](lecture1.pdf), [youtube1](https://www.youtube.com/watch?v=IkEF4LpH5Ys), [youtube2](https://www.youtube.com/watch?v=Qu8CPnnwplM), [B站1](https://www.bilibili.com/video/av96436833/), [B站2](https://www.bilibili.com/video/av96834288/) |
| Lecture 2 | Markov Decision Process | [slide](lecture2.pdf), [youtube1](https://www.youtube.com/watch?v=6yE9XiIB3hQ), [youtube2](https://www.youtube.com/watch?v=MIZbocCu7Sk), [B站1](https://www.bilibili.com/video/av98583371/), [B站2](https://www.bilibili.com/video/av98583913/) |
| Lecture 3 | Model-free Prediction and Control | |
| Lecture 4 | On-policy and Off-policy Learning | |
| Lecture 5 | Value Function Approximation | |
| Lecture 6 | Policy Optimization: Basics | |
| Lecture 7 | Policy Optimization: State of the art | |
| Lecture 8 | Model-based RL | |
| Lecture 9 | Imitation Learning | |
| Lecture 10 | Distributed computing and RL system design | |
| Lecture 11 | Case Study on AlphaGo Series | |
| Lecture 12 | Case Study on AlphaStar and OpenAI Five | |
| Lecture 13 | Unsupervised Learning | |
| Lecture 14 | Generative Modeling | |
| TBD | ... | |
| Lecture 3 | Model-free Prediction and Control | [slide](lecture2.pdf), [youtube1](https://www.youtube.com/watch?v=Duj1U73yHik), [youtube2](https://www.youtube.com/watch?v=sfkhinBjGGY), [B站1](), [B站2]() |
| Lecture 4 | Value Function Approximation | |
| Lecture 5 | Policy Optimization: Basics | |
| Lecture 6 | Policy Optimization: State of the art | |
| Lecture 7 | Model-based RL | |
| Lecture 8 | Imitation Learning | |
| Lecture 9 | Distributed computing and RL system design | |
| Lecture 10 | Case Study on AlphaGo Series | |

Binary file added lecture3.pdf
Binary file not shown.

0 comments on commit ff60dca

Please sign in to comment.