lkevinzc

Follow

🎯

Learning

zclzc lkevinzc

🎯

Learning

Follow

NUS PhD student working on RL @sail-sg

62 followers · 161 following

Achievements

Achievements

Organizations

Pinned Loading

mosecorg/mosec mosecorg/mosec Public

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Python 821 61
sail-sg/oat sail-sg/oat Public

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 194 12
sail-sg/oat-zero sail-sg/oat-zero Public

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 160 9
sail-sg/dice sail-sg/dice Public

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Python 42 3
sail-sg/rosmo sail-sg/rosmo Public

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Python 28
dance dance Public

Codes for "DANCE: A Deep Attentive Contour Model for Efficient Instance Segmentation", WACV2021

Python 67 13