-
运行代码为 export CUDA_VISIBLE_DEVICES=0,1,2,3 && python run.py --models vllm_qwen1_5_32b_chat_awq --datasets ceval_ppl --debug 配置文件如下: _meta_template = dict( models = [ 请问大家是如何解决的,谢谢! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
Make sure you have about 100 GB GPU memory and try to add |
Beta Was this translation helpful? Give feedback.
-
请教一下,model_kwargs=dict(tensor_parallel_size=2, gpu_memory_utilization=0.7)是添加在哪里啊,我这样添加的 报错 |
Beta Was this translation helpful? Give feedback.
Make sure you have about 100 GB GPU memory and try to add
gpu_memory_utilization
to the model config, like:model_kwargs=dict(tensor_parallel_size=2, gpu_memory_utilization=0.7),