Skip to content

Latest commit

 

History

History
29 lines (26 loc) · 1.15 KB

README.md

File metadata and controls

29 lines (26 loc) · 1.15 KB

Build Status master contributions welcome License

Reinforcement Learning

TODO

  • Decision Making Under Uncertainty
  • Markov Decision Processes
  • Values Functions & Bellman Equations
  • Dynamic Programming
  • Monte Carlo Methods
  • Temporal Difference Methods
  • Planning, Learning & Acting
  • On-Policy Prediction with Approximation
  • Constructing Features for Prediction
  • Control with Approximation
  • Policy Gradient Methods
    • REINFORCE
    • Actor-Critic
    • Off-Policy Policy Gradient
    • A2C
    • A3C
    • DDPG
    • PPO
    • SAC
    • TD3