Skip to content

为什么运行到一半出现oom的情况? #1261

Answered by liushz
AriesHaa asked this question in Q&A
Discussion options

You must be logged in to vote

Make sure you have about 100 GB GPU memory and try to add gpu_memory_utilization to the model config, like:
model_kwargs=dict(tensor_parallel_size=2, gpu_memory_utilization=0.7),

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
2 replies
@AriesHaa
Comment options

@QiMingChina
Comment options

Answer selected by AriesHaa
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants