Skip to content

Commit

Permalink
add info for clarity
Browse files Browse the repository at this point in the history
  • Loading branch information
dsikka committed Dec 21, 2024
1 parent 9d49435 commit d31b31b
Showing 1 changed file with 4 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,8 @@
warmup_ratio=warmup_ratio,
)
logger.info(
"Note: vLLM requires the dtype=torch.float16 when running the ",
"compressed marlin-24 model",
"Note: llcompressor does not currently support running ",
"compressed models in the marlin-24 format. The model ",
"produced from this example can be run on vLLM with ",
"dtype=torch.float16",
)

0 comments on commit d31b31b

Please sign in to comment.