trtllm-build with --fast-build ignore transformer layers #2135

ZJLi2013 · 2024-08-21T13:56:51Z

System Info

DGX H100

Who can help?

when build engine with :

  trtllm-build --fast-build --model_config $model_cfg

and then benchmark with gptMangerBenchmark, it reports:

[08/21/2024-12:18:48] [TRT-LLM] [I] Total time of building Unnamed Network 0: 00:01:28
[08/21/2024-12:18:48] [TRT-LLM] [E] Failed to get weight: transformer.layers.0.attention.qkv.weight
[08/21/2024-12:18:48] [TRT-LLM] [E] Failed to get weight: transformer.layers.0.attention.dense.weight
[08/21/2024-12:18:48] [TRT-LLM] [E] Failed to get weight: transformer.layers.0.mlp.router.weight
[08/21/2024-12:18:48] [TRT-LLM] [E] Failed to get weight: transformer.layers.0.mlp.fc.weight
[08/21/2024-12:18:48] [TRT-LLM] [E] Failed to get weight: transformer.layers.0.mlp.proj.weight
# and further in runtime : 
[TensorRT-LLM][ERROR] Encountered an error in forwardAsync function: Input tensor 'transformer.layers.0.attention.qkv.weight' not found; expected shape: (8192, 1280) (/src/tensorrt_llm/cpp/tensorrt_llm/runtime/tllmRuntime.cpp:202)

is it expected behavior with fast-build ?

btw, wo --fast-build, the engine build and benchmark looks all right.

Thanks

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

  trtllm-build --fast-build --model_config $model_cfg

Expected behavior

fast-build flag should also build workable engines,

actual behavior

with fast-build flag, transformer layers are ignored somehow

additional notes

no more

The text was updated successfully, but these errors were encountered:

VALLIS-NERIA · 2024-09-04T06:33:58Z

Hi, please check:

Do you find a file named rank0_managed_weights.safetensors or so inside the engine dir?

Is there a field named manage_weights in config.json, plugin_config part?

VALLIS-NERIA · 2024-09-05T13:37:29Z

It seems that you are building from a model config without weights, not a checkpoint. In such cases TRT-LLM generates random weights, but is not supported by fast_build yet.

github-actions · 2024-10-09T02:02:16Z

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

github-actions · 2024-12-04T16:07:27Z

This issue was closed because it has been 14 days without activity since it has been marked as stale.

ZJLi2013 added the bug Something isn't working label Aug 21, 2024

lfr-0531 assigned VALLIS-NERIA Sep 7, 2024

lfr-0531 added triaged Issue has been triaged by maintainers not a bug Some known limitation, but not a bug. waiting for feedback and removed bug Something isn't working labels Sep 7, 2024

kaiyux mentioned this issue Oct 8, 2024

Update TensorRT-LLM #2297

Merged

github-actions bot added the stale label Oct 9, 2024

kaiyux mentioned this issue Nov 1, 2024

Update TensorRT-LLM v0.14.0 #2401

Merged

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trtllm-build with --fast-build ignore transformer layers #2135

trtllm-build with --fast-build ignore transformer layers #2135

ZJLi2013 commented Aug 21, 2024

VALLIS-NERIA commented Sep 4, 2024

VALLIS-NERIA commented Sep 5, 2024

github-actions bot commented Oct 9, 2024

github-actions bot commented Dec 4, 2024

trtllm-build with --fast-build ignore transformer layers #2135

trtllm-build with --fast-build ignore transformer layers #2135

Comments

ZJLi2013 commented Aug 21, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

VALLIS-NERIA commented Sep 4, 2024

VALLIS-NERIA commented Sep 5, 2024

github-actions bot commented Oct 9, 2024

github-actions bot commented Dec 4, 2024