Skip to content

[Example] Policy model as its own reward model#270

Merged
pan-x-c merged 9 commits intoagentscope-ai:mainfrom
hiyuchang:feat/trainable_ruler
Sep 12, 2025
Merged

[Example] Policy model as its own reward model#270
pan-x-c merged 9 commits intoagentscope-ai:mainfrom
hiyuchang:feat/trainable_ruler

Commits

Commits on Sep 5, 2025

Commits on Sep 12, 2025

Comments