Skip to content

Commit

Permalink
adjust concurrency for 100-160gb
Browse files Browse the repository at this point in the history
  • Loading branch information
louisgv committed Jan 3, 2024
1 parent 6fab93e commit b1ddf4f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion modal/runner/containers/vllm_unified.py
Original file line number Diff line number Diff line change
Expand Up @@ -64,5 +64,5 @@ def __init__(
"VllmContainerA100_80G", num_gpus=1, memory=80
)
VllmContainerA100_160G = _make_container(
"VllmContainerA100_160G", num_gpus=2, memory=80, concurrent_inputs=4
"VllmContainerA100_160G", num_gpus=2, memory=80, concurrent_inputs=2
)

0 comments on commit b1ddf4f

Please sign in to comment.