Slides

Outline:

  1. RL Concepts

  2. Policy gradients

  3. Dynamic programming

  4. Deep Q-networks

  5. Distributional RL

  6. D4PG

  7. PPO and MPO

  8. R2D3

  9. Applications of RL

    • AlphaX

    • Batch RL