DSPy integration #921

Harryllh · 2026-01-22T22:25:55Z

This is a DSPy integration that allows targeted training on a modular level within a compound AI system. Rollouts and trace collection are both done using DSPy. This also supports custom reward design, for both local and final reward.

…tron-batch-validation

Assertion dataset

…sertion

* feat: `run_lcb.sh` runs without error until data loading * fix: run_lcb runs until trace collection * feat: run_lcb.py runs --------- Co-authored-by: J-Ch-n <jiashuchen@berkeley.edu>

* feat: preliminary Hover to be tested * feat: Hover ready for testing * feat: add run script for Hover; TODO: final reward func * feat: add hover/data.py * feat: add `hover_final_reward_fn` * working hover with final reward only * Change of logging and litellm version * slight change * sig change --------- Co-authored-by: J-Ch-n <jiashu.chen@berkeley.edu>

* feat: add banking77 folder * feat: ready to test banking77 * fix: banking77 runs on lambda * feat: banking77 works * fix: minor changes * fix: change reward function * fix: update start script and reward functions --------- Co-authored-by: J-Ch-n <jiashuchen@berkeley.edu>

Harryllh and others added 30 commits October 4, 2025 23:58

fix megatron batch validation

d636214

change divisible check

45706a5

check strategy flag existence

c0c1303

Merge branch 'main' of https://github.com/erictang000/SkyRL into mega…

6cd33bb

…tron-batch-validation

fix tests

86c6345

Merge branch 'NovaSky-AI:main' into main

1b3cbf0

Merge branch 'NovaSky-AI:main' into main

557a275

Merge branch 'NovaSky-AI:main' into main

b0103e7

Merge branch 'NovaSky-AI:main' into main

8c3f93e

Merge branch 'NovaSky-AI:main' into main

7192106

Merge branch 'NovaSky-AI:main' into main

790e2fd

launch code

fc657ec

updated bash script

0890817

dspy integration started

2f8000b

simplified code

6ee5085

lcb code

cb51d72

clean up

6b077f8

feat: finish dataset

4b6405c

refactor: change argument passing

40b9e26

feat: add example file

c76d7f5

refactor: modify data paths in example file

aa38573

refactor: move example file

f675720

feat: move is_stdin from test to example level

db01413

Merge pull request #1 from Harryllh/assertion-dataset

9c0d45a

Assertion dataset

push before sleep

765bb48

trace collection

01b3950

bash script

fbcc2c0

merged from assertion-dataset

409c716

Merge branch 'assertion' of https://github.com/Harryllh/SkyRL into as…

9973f59

…sertion

run_lcb.sh runs (#2)

201195b

* feat: `run_lcb.sh` runs without error until data loading * fix: run_lcb runs until trace collection * feat: run_lcb.py runs --------- Co-authored-by: J-Ch-n <jiashuchen@berkeley.edu>

Harryllh and others added 24 commits December 30, 2025 05:17

merged from assertion-dataset + clean-up

2469ca3

extra dspy programs

5a12b75

working training

5a8f1ee

epoch and final reward function

ea50aef

error handling and timeouts

99c757b

max tokens

ed0b0ed

papillon and hover start code

44984a4

new apis and papillon code

fe3f8c7

debug

e3c8681

async papillon supported. async lcb still needs testing

1368923

trace collection change

d9ad220

added chat template

1e33f96

robust trace collection. reasoning trace included

f1e7879

ready for 7b training

5d32c7d

lcb data pipeline

bb5046c

path issues

5a3ec2e

updated 8b training script. vllm engine update. concurrency update.

44ba540

working lcb and LM infra change

9f11a23

working papillon

fc4f7e3

efficient lcb

a20dc62

router script

fc46729

push before sleep

5116e10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DSPy integration #921

DSPy integration #921

Uh oh!

Harryllh commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DSPy integration #921

Are you sure you want to change the base?

DSPy integration #921

Uh oh!

Conversation

Harryllh commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants