Skip to content

Latest commit

 

History

History
67 lines (35 loc) · 6.6 KB

reinforcement_learning.md

File metadata and controls

67 lines (35 loc) · 6.6 KB

Reinforcement Learning

A curated list of awesome deep reinforcement learning research in search and recommendation

Tutorials

Papers

  1. [AAAI'20] Deep Reinforcement Learning for Online Advertising in Recommender Systems, by Xiangyu Zhao, Changsheng Gu, Haoshenglun Zhang, Xiaobing Liu, Xiwang Yang, Jiliang Tang.

  2. [AAAI'20] Simulating User Feedback for Reinforcement Learning Based Recommendations, by Xiangyu Zhao, Long Xia, Lixin Zou, Dawei Yin, Jiliang Tang. [JD]

  3. [AAAI'20] Model-based Reinforcement Learning for Predictions and Control for Limit Order Books, by Haoran Wei, Yuanbo Wang, Lidia Mangu, Keith Decker. [J.P. Morgan]

  4. [WSDM'20] Deep Reinforcement Learning for Whole-Chain Recommendations, by Xiangyu Zhao, Long Xia, Dawei Yin, Jiliang Tang. [JD]

  5. [KDD'19] Off-policy Learning for Multiple Loggers, by Li He, Long Xia, Wei Zeng, Zhi-Ming Ma, Yihong Zhao, Dawei Yin. [JD]

  6. [ACL'19] Neural Keyphrase Generation via Reinforcement Learning with Adaptive Rewards, by Hou Pong Chan, Wang Chen, Lu Wang, Irwin King.

  7. [CIKM'19] Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System, by Ye Liu, Chenwei Zhang, Xiaohui Yan, Yi Chang, Philip S. Yu. [Amazon, Huawei]

  8. [RecSys'19] PyRecGym: a Reinforcement Learning Gym for Recommender Systems, by Bichen Shi, Makbule Gulcin Ozsoy, Neil Hurley, Barry Smyth, Elias Z. Tragos, James Geraci, Aonghus Lawlor. [Samsung]

  9. [Arxiv'19] RecSim: A Configurable Simulation Platform for Recommender Systems, by Eugene Ie, Chih-wei Hsu, Martin Mladenov, Vihan Jain, Sanmit Narvekar, Jing Wang, Rui Wu, Craig Boutilier. [Google]

  10. [CoRR'19] Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems, by Lixin Zou, Long Xia, Zhuoye Ding, Jiaxing Song, Weidong Liu, Dawei Yin. [JD]

  11. [CoRR'19] Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation, by Yong Liu, Yinan Zhang, Qiong Wu, Chunyan Miao, Lizhen Cui, Binqiang Zhao, Yin Zhao, Lu Guan. [Alibaba]

  12. [CoRR'19] Toward Simulating Environments in Reinforcement Learning Based Recommendations, by Xiangyu Zhao, Long Xia, Zhuoye Ding, Dawei Yin, Jiliang Tang. [JD]

  13. [CoRR'19] Model-Based Reinforcement Learning for Whole-Chain Recommendations, by Xiangyu Zhao, Long Xia, Yihong Zhao, Dawei Yin, Jiliang Tang. [JD]

  14. [ICML'19] Generative Adversarial User Model for Reinforcement Learning Based Recommendation System, by Xinshi Chen, Shuang Li, Hui Li, Shaohua Jiang, Yuan Qi, Le Song. [Ant Financial]

  15. [IJCAI'19] Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology, by Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Morgane Lustman, Vince Gatto, Paul Covington, Jim McFadden, Tushar Chandra, Craig Boutilier. [Google]

  16. [AAAI'19] Large-scale Interactive Recommendation with Tree-structured Policy Gradient, by Haokun Chen, Xinyi Dai, Han Cai, Weinan Zhang, Xuejian Wang, Ruiming Tang, Yuzhou Zhang, Yong Yu. [Huawei]

  17. [AAAI'19] [Hierarchical Reinforcement Learning for Course Recommendation in MOOCs], by Jing Zhang, Bowen Hao, Bo Chen, Cuiping Li, Hong Chen, Jimeng Sun.

  18. [AAAI'19] Large-scale Interactive Recommendation with Tree-structured Policy Gradient, by . [Huawei]

  19. [AAAI'19] Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning, by Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, An-Xiang Zeng. [Alibaba]

  20. [WSDM'19] Top-K Off-Policy Correction for a REINFORCE Recommender System, by Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, Ed Chi. [Google]

  21. [Arxiv'18] Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling, by Feng Liu, Ruiming Tang, Xutao Li, Weinan Zhang, Yunming Ye, Haokun Chen, Huifeng Guo, Yuzhou Zhang. [Huawei]

  22. [KDD'18] Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application, by Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu. [Alibaba]

  23. [RecSys'18] Deep Reinforcement Learning for Page-wise Recommendations, by Xiangyu Zhao, Long Xia, Liang Zhang, Zhuoye Ding, Dawei Yin, Jiliang Tang. [JD]

  24. [ICDM'18] A Reinforcement Learning Framework for Explainable Recommendation, by Xiting Wang, Yiru Chen, Jie Yang, Le Wu, Zhengtao Wu, Xing Xie. [Microsoft]

  25. [EMNLP'18] Paraphrase Generation with Deep Reinforcement Learning, by Zichao Li, Xin Jiang, Lifeng Shang, Hang Li. [Huawei]

  26. [Arxiv'18] Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling, by Feng Liu, Ruiming Tang, Xutao Li, Weinan Zhang, Yunming Ye, Haokun Chen, Huifeng Guo, Yuzhou Zhang. [Huawei]

  27. [SIGIR'17] Reinforcement Learning to Rank with Markov Decision Process, by Zeng Wei, Jun Xu, Yanyan Lan, Jiafeng Guo, Xueqi Cheng.

Industrial Reports

  1. [201x] 强化学习在阿里的技术演进与业务创新, Alibaba.