You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems that Subset class in PyTorch simply shallow-copies the original dataset. In PR #210, I made it deep-copy before the for-loop to fix the issue, instead of when substituting to dataset_dict. Otherwise, other splits might be affected by the change made before dataset_dict[sub_dataset_id] = copy.deepcopy(sub_dataset) you showed above.
When building dataset dict, transformers would be overloaded in
train
andval
split dataset in split_datasetI fixed such bug using following codes:
The text was updated successfully, but these errors were encountered: