Hyperparameter of the number of distractor documents and the ratio of golden documents in training RAFT #325

yifan1130 · 2024-04-07T20:48:06Z

Hi, I just wonder what the number of distractor documents and the ratio of golden documents in training RAFT are in producing the results in the paper. I saw the default ratio in the script the ratio is 1.0, which means no distractor documents are used. I wonder whether the default number in the scripts represents the hyperparameter used to produce the results in the paper?

kaiwen129 · 2024-04-08T03:24:19Z

Hi, the optimal value for the P% hyperparameter may vary from dataset to dataset. To answer your question, the paper used various values of P%, not just the default value of 1.0 in the script.

Change `p` which dictates the fraction of dataset with golden documents in them (vs) no golden documents. So, p = 0.8 means, for 80% of the train data set, `A* = Q + D* + D1 .. Dn` and for 20% of the train data set `A* = Q + D1 .. Dn` where `D*` are/is the golden document with the answer `A*`. Close #325

Change `p` which dictates the fraction of dataset with golden documents in them (vs) no golden documents. So, p = 0.8 means, for 80% of the train data set, `A* = Q + D* + D1 .. Dn` and for 20% of the train data set `A* = Q + D1 .. Dn` where `D*` are/is the golden document with the answer `A*`. Close ShishirPatil#325

ShishirPatil mentioned this issue Apr 14, 2024

Update raft.py with default p to match paper #353

Merged

ShishirPatil closed this as completed in #353 Apr 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter of the number of distractor documents and the ratio of golden documents in training RAFT #325

Hyperparameter of the number of distractor documents and the ratio of golden documents in training RAFT #325

yifan1130 commented Apr 7, 2024

kaiwen129 commented Apr 8, 2024

Hyperparameter of the number of distractor documents and the ratio of golden documents in training RAFT #325

Hyperparameter of the number of distractor documents and the ratio of golden documents in training RAFT #325

Comments

yifan1130 commented Apr 7, 2024

kaiwen129 commented Apr 8, 2024