You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have already begun the fine tuning. However, some thing wrong:
***** Running training *****
Num examples = 2830
Num Epochs = 6
Instantaneous batch size per device = 8
Total train batch size (w. parallel, distributed & accumulation) = 64
Gradient Accumulation steps = 8
Total optimization steps = 264
0%| | 0/264 [00:00<?, ?it/s]Traceback (most recent call last):
File "workspace/train_code5.py", line 268, in <module>
train()
File "workspace/train_code5.py", line 262, in train
trainer.train(resume_from_checkpoint=training_args.resume_from_checkpoint)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/transformers/trainer.py", line 1498, in train
return inner_training_loop(
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/transformers/trainer.py", line 1740, in _inner_training_loop
tr_loss_step = self.training_step(model, inputs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/transformers/trainer.py", line 2470, in training_step
loss = self.compute_loss(model, inputs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/transformers/trainer.py", line 2502, in compute_loss
outputs = model(**inputs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, **kwargs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn
ret_val = func(*args, **kwargs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/deepspeed/runtime/engine.py", line 1768, in forward
loss = self.module(*inputs, **kwargs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1538, in _call_impl
result = forward_call(*args, **kwargs)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1621, in forward
torch.cuda.set_device(self.decoder.first_device)
File "/root/miniconda3/envs/seedpicker/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1614, in __getattr__
raise AttributeError("'{}' object has no attribute '{}'".format(
I have already begun the fine tuning. However, some thing wrong:
I check the issue repo and install the transformers with version 4.21.3 as suggestion https://github.com/salesforce/CodeT5/issues/113.
What should I do? Thanks.
The text was updated successfully, but these errors were encountered: