I'm a graduate from the School of Artificial Intelligence at Nanjing University. I've completed both my undergraduate and master's degrees in this field. My research interests lie in Reinforcement Learning and Large Language Models. Currently, I'm working at Moonshot.ai, focusing on Alignment research.
♾️
Pursuit Infinity
Lamda-RL, Nanjing University
-
Nanjing University
- Nanjing, China
-
15:18
(UTC +08:00) - https://orcid.org/0009-0001-4907-0304
- in/welt-ding
Highlights
- Pro
Pinned Loading
-
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Python
-
BibTeX-Formatter
BibTeX-Formatter PublicFormat your bibtex (.bib) file to help standardize citations for conference and journal submissions
Python 13
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.