Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NPU基于Qwen2-VL-7B执行多卡QLoRA时卡住 #6659

Open
1 task done
tbozhong opened this issue Jan 15, 2025 · 1 comment
Open
1 task done

NPU基于Qwen2-VL-7B执行多卡QLoRA时卡住 #6659

tbozhong opened this issue Jan 15, 2025 · 1 comment
Labels
bug Something isn't working npu This problem is related to NPU devices pending This problem is yet to be addressed

Comments

@tbozhong
Copy link

Reminder

  • I have read the above rules and searched the existing issues.

System Info

  • llamafactory version: 0.9.1.dev0
  • Platform: Linux-5.4.241-1-tlinux4-0017.7-x86_64-with-glibc2.28
  • Python version: 3.9.16
  • PyTorch version: 2.1.0+cpu (NPU)
  • Transformers version: 4.45.2
  • Datasets version: 2.21.0
  • Accelerate version: 0.34.2
  • PEFT version: 0.12.0
  • TRL version: 0.9.6
  • NPU type: Ascend910B2C
  • CANN version: 8.0.RC3
  • DeepSpeed version: 0.13.1+1d35db76
  • Bitsandbytes version: 0.45.0.dev+7e6f865

Reproduction

单卡QLoRA运行成功,只有多卡时遇到问题。单、多卡LoRA同样可以运行成功。
卡住界面如下,随后会爆出通信超时的错误:
image

Others

期待您的回复~

@tbozhong tbozhong added bug Something isn't working pending This problem is yet to be addressed labels Jan 15, 2025
@github-actions github-actions bot added the npu This problem is related to NPU devices label Jan 15, 2025
@hiyouga
Copy link
Owner

hiyouga commented Jan 15, 2025

cannot reproduce

@hiyouga hiyouga closed this as completed Jan 15, 2025
@hiyouga hiyouga reopened this Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working npu This problem is related to NPU devices pending This problem is yet to be addressed
Projects
None yet
Development

No branches or pull requests

2 participants