Slides

Plan for Today:

  1. Formalizing RL

  2. Value Functions

  3. Exploration

  4. Policy Gradient and Actor Critic Approaches

  5. Generalization

  6. Structure

  7. Models

  8. New Challenges