Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Qwen-14B-Chat-Int4运行后预测结果不对 #68

Closed
takemars opened this issue Jan 25, 2024 · 4 comments
Closed

Qwen-14B-Chat-Int4运行后预测结果不对 #68

takemars opened this issue Jan 25, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@takemars
Copy link

在A100 80G上执行,按照要求,安装了相关的包后,按照“运行指南(int4-gptq篇)”执行第一步和第三步后,查看日志为:
image,生成的文件如图:
image
执行第三步后,正常生成的文件是这几个吗?
最后执行python3 run.py --tokenizer_dir=Qwen-14B-Chat-Int4,发现预测结果不对,本次预测结果为:
image
,请问如何解决这个问题?

@takemars takemars changed the title Qwen-7B-Chat-Int4运行后预测结果不对 Qwen-14B-Chat-Int4运行后预测结果不对 Jan 25, 2024
@takemars
Copy link
Author

补充执行第三步的指令为:python build.py --use_weight_only
--weight_only_precision int4_gptq
--per_group
--hf_model_dir Qwen-14B-Chat-Int4
--quant_ckpt_path Qwen-14B-Chat-Int4

@Tlntin
Copy link
Owner

Tlntin commented Jan 25, 2024

可以升级一下transformers版本试试。
顺便问问你用的哪个版本,是当前项目的main分支吗

@takemars
Copy link
Author

image
用的是当前项目的main分支

@Tlntin Tlntin added the bug Something isn't working label Jan 26, 2024
@Tlntin
Copy link
Owner

Tlntin commented Jan 26, 2024

升级transformers版本后就可以了,该问题是optimum和transformers版本不匹配导致的,
两者都用最新版就可以解决了。

@Tlntin Tlntin closed this as completed Jan 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants