🎯
Learning
NUS PhD student working on RL @sail-sg
-
Sea AI Lab @sail-sg
- Singapore
- https://lkevinzc.github.io/
- @zzlccc
Pinned Loading
-
mosecorg/mosec
mosecorg/mosec PublicA high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
-
sail-sg/oat
sail-sg/oat Public🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
-
sail-sg/oat-zero
sail-sg/oat-zero PublicA lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
-
sail-sg/dice
sail-sg/dice PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewards
-
sail-sg/rosmo
sail-sg/rosmo PublicCodes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Python 28
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.