trtllm-build with --fast-build ignore transformer layers #2135
Labels
not a bug
Some known limitation, but not a bug.
stale
triaged
Issue has been triaged by maintainers
waiting for feedback
System Info
DGX H100
Who can help?
when build engine with :
trtllm-build --fast-build --model_config $model_cfg
and then benchmark with gptMangerBenchmark, it reports:
is it expected behavior with fast-build ?
btw, wo
--fast-build
, the engine build and benchmark looks all right.Thanks
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
trtllm-build --fast-build --model_config $model_cfg
Expected behavior
fast-build flag should also build workable engines,
actual behavior
with fast-build flag, transformer layers are ignored somehow
additional notes
no more
The text was updated successfully, but these errors were encountered: