Qwen-14B-Chat-Int4运行后预测结果不对 #68

takemars · 2024-01-25T09:45:49Z

在A100 80G上执行，按照要求，安装了相关的包后，按照“运行指南（int4-gptq篇）”执行第一步和第三步后，查看日志为：
，生成的文件如图：

执行第三步后，正常生成的文件是这几个吗？
最后执行python3 run.py --tokenizer_dir=Qwen-14B-Chat-Int4，发现预测结果不对，本次预测结果为：

，请问如何解决这个问题？

takemars · 2024-01-25T09:48:18Z

补充执行第三步的指令为：python build.py --use_weight_only
--weight_only_precision int4_gptq
--per_group
--hf_model_dir Qwen-14B-Chat-Int4
--quant_ckpt_path Qwen-14B-Chat-Int4

Tlntin · 2024-01-25T10:27:22Z

可以升级一下transformers版本试试。
顺便问问你用的哪个版本，是当前项目的main分支吗

takemars · 2024-01-26T01:00:02Z

用的是当前项目的main分支

Tlntin · 2024-01-26T03:24:57Z

升级transformers版本后就可以了，该问题是optimum和transformers版本不匹配导致的，
两者都用最新版就可以解决了。

takemars changed the title ~~Qwen-7B-Chat-Int4运行后预测结果不对~~ Qwen-14B-Chat-Int4运行后预测结果不对 Jan 25, 2024

Tlntin added the bug Something isn't working label Jan 26, 2024

Tlntin closed this as completed Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen-14B-Chat-Int4运行后预测结果不对 #68

Qwen-14B-Chat-Int4运行后预测结果不对 #68

takemars commented Jan 25, 2024

takemars commented Jan 25, 2024

Tlntin commented Jan 25, 2024 •

edited

Loading

takemars commented Jan 26, 2024

Tlntin commented Jan 26, 2024

Qwen-14B-Chat-Int4运行后预测结果不对 #68

Qwen-14B-Chat-Int4运行后预测结果不对 #68

Comments

takemars commented Jan 25, 2024

takemars commented Jan 25, 2024

Tlntin commented Jan 25, 2024 • edited Loading

takemars commented Jan 26, 2024

Tlntin commented Jan 26, 2024

Tlntin commented Jan 25, 2024 •

edited

Loading