Why utilizing the 'question' column of the 'ArmelR/stack-exchange-instruction' dataset for gradient backpropagation?

Thank you for your valuable open-source contribution!

In instruction tuning stage, it seems that only the answer aligned with the instruction participates in the backpropagation process.  And [this code](https://github.com/bigcode-project/starcoder/blob/main/finetune/finetune.py#L190) seems to imply that the question part of the dataset is also involved in backpropagation of the gradient. Will this lead to better training results?

![image](https://github.com/bigcode-project/starcoder/assets/41630003/4569446f-c4f1-47e6-abe0-90f90766bda3)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why utilizing the 'question' column of the 'ArmelR/stack-exchange-instruction' dataset for gradient backpropagation? #135

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why utilizing the 'question' column of the 'ArmelR/stack-exchange-instruction' dataset for gradient backpropagation? #135

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions