Skip to content

Conversation

@Harryllh
Copy link
Contributor

This is a DSPy integration that allows targeted training on a modular level within a compound AI system. Rollouts and trace collection are both done using DSPy. This also supports custom reward design, for both local and final reward.

Harryllh and others added 30 commits October 4, 2025 23:58
* feat: `run_lcb.sh` runs without error until data loading

* fix: run_lcb runs until trace collection

* feat: run_lcb.py runs

---------

Co-authored-by: J-Ch-n <jiashuchen@berkeley.edu>
Harryllh and others added 24 commits December 30, 2025 05:17
* feat: preliminary Hover to be tested

* feat: Hover ready for testing

* feat: add run script for Hover; TODO: final reward func

* feat: add hover/data.py

* feat: add `hover_final_reward_fn`

* working hover with final reward only

* Change of logging and litellm version

* slight change

* sig change

---------

Co-authored-by: J-Ch-n <jiashu.chen@berkeley.edu>
* feat: add banking77 folder

* feat: ready to test banking77

* fix: banking77 runs on lambda

* feat: banking77 works

* fix: minor changes

* fix: change reward function

* fix: update start script and reward functions

---------

Co-authored-by: J-Ch-n <jiashuchen@berkeley.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants