Skip to content

Latest commit

 

History

History
43 lines (34 loc) · 2.88 KB

reinforcement_learning.md

File metadata and controls

43 lines (34 loc) · 2.88 KB

Reinforcement Learning

Sample Efficiency

Offline RL

  • ICLR 2023, X-QL: Extreme Q-Learning: MaxEnt RL without Entropy, Website
  • ICLR 2023, Diffusion-QL: Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning, OpenReview
  • ICLR 2022, IQL: Offline Reinforcement Learning with Implicit Q-Learning, arXiv
  • NIPS 2021, Decision Transformer: Reinforcement Learning via Sequence Modeling, Website
  • NIPS 2020, CQL: Conservative Q-Learning for Offline Reinforcement Learning, Website
  • ICLR 2021 rejection, D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Imitation Learning

Model-based RL

Multi-Task RL

Language