-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Token indices sequence length is longer than the specified maximum sequence length for this model (4158 > 2048) #102
Comments
This problem also occurred when I reproduced the 20B model |
Hi , nvcc -V When I run Any help would be really appreciated. I tried different versions of cuda , same error every time.. |
also same error... |
Describe the bug
Running the Pythia-7B fine-tune script on 4 x A10 (24GB each).
Seems like issue with seq len:
_```
Token indices sequence length is longer than the specified maximum sequence length for this model (4158 > 2048). Running this sequence through the model will result in indexing errors
Traceback (most recent call last):
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 358, in
main()
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 332, in main
train_loop(args, pipe, device, train_data_loader, test_data_loader)
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 151, in train_loop
get_data_parallel_comm().recv(
File "/home/ec2-user/OpenChatKit/training/comm/nccl_backend.py", line 79, in recv
self.comm.recv(
File "cupy_backends/cuda/libs/nccl.pyx", line 477, in cupy_backends.cuda.libs.nccl.NcclCommunicator.recv
File "cupy_backends/cuda/libs/nccl.pyx", line 129, in cupy_backends.cuda.libs.nccl.check_status
cupy_backends.cuda.libs.nccl.NcclError: NCCL_ERROR_UNHANDLED_CUDA_ERROR: unhandled cuda error
Traceback (most recent call last):
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 358, in
main()
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 332, in main
train_loop(args, pipe, device, train_data_loader, test_data_loader)
File "/home/ec2-user/OpenChatKit/training/dist_clm_train.py", line 117, in train_loop
get_data_parallel_comm().send(
File "/home/ec2-user/OpenChatKit/training/comm/nccl_backend.py", line 65, in send
self.comm.send(
File "cupy_backends/cuda/libs/nccl.pyx", line 468, in cupy_backends.cuda.libs.nccl.NcclCommunicator.send
File "cupy_backends/cuda/libs/nccl.pyx", line 129, in cupy_backends.cuda.libs.nccl.check_status
cupy_backends.cuda.libs.nccl.NcclError: NCCL_ERROR_UNHANDLED_CUDA_ERROR: unhandled cuda error
The text was updated successfully, but these errors were encountered: