-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] <title>72B-int4 提示 probability tensor contains either inf
, nan
or element < 0
#857
Comments
同样的问题,你试过单卡推理吗?int4单卡48G可以带起来。我是只要多卡推理就会出现这个问题,单卡就正常。 |
没试过单卡 |
同样的问题 |
@taochangda 请先检查下autogptq安装是否正确哈,cu118的不能 |
@onionknightdd @taochangda 两位可以报下卡型、卡数量吗?如果方便的话,可以说明是自己的服务器、还是哪个平台租的服务器吗? |
似乎报一样的问题, 目前可以单卡推理, 但是多卡的话, 无论是docker环境,还是native环境, 加载模型都可以, 一旦实际chat, 就会出问题 4x3090 |
同样问题,自己服务器运行72B,用6张v100S报和楼主一样的错误,乱码报错;相同的软件环境用单张4090+大内存,chat一句话成功了。魔塔还是huggingface的都下载试过了,而且文件做了校验是对的,都是一样的问题,是不是对老显卡的多卡支持有问题? 现在我又在6*v100s试过了千问14B,device_map=cuda:0 可以正常运行,设置为auto就会复现错误。 |
可以试试把温度调到0,只调整top p即可。温度高了就会有这个问题。 |
This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread. |
是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?
当前行为 | Current Behavior
72B-int4 提示下面的问题,怎么解决?
期望行为 | Expected Behavior
No response
复现方法 | Steps To Reproduce
No response
运行环境 | Environment
备注 | Anything else?
No response
The text was updated successfully, but these errors were encountered: