You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+2-2
Original file line number
Diff line number
Diff line change
@@ -135,8 +135,8 @@ cd src/data_generation && bash scripts/run.sh
135
135
136
136
## Step 3: Response Synthesis
137
137
138
-
1. Generating responses (code in the `src/data_generation` folder) and applying a reward filtering strategy (code in the `src/data_generation/reward_filtering` folder).
139
-
2.For instruction-tuning and evaluation, we directly use the DART-Math framework.
138
+
1. Generating responses (code in the `src/data_generation` folder)
139
+
2.applying a reward filtering strategy (code in the `src/data_generation/reward_filtering` folder).
0 commit comments