Adding debug options to trtllm-build to visualize the TRT Network before Engine build #1238

Lokiiiiii · 2024-03-06T00:18:31Z

Overview

This PR adds 2 new flags to trtllm-build to support debugging.
--visualize-network dumps the finalized TRT Network as SVG files for visual analysis.
--dry-run runs through all the steps except the Engine build and serialization which are typically the operations with the most overhead.

When used together, the TRT Network is dumped in ~10 secs for llama2-70B.

Testing

These changes have been manually tested for a few configurations of llama2.

Unit Tests

I can add more unit tests to this PR and fix existing unit tests if this basic design is acceptable.

QiJune · 2024-04-04T15:26:33Z

tensorrt_llm/commands/build.py

@@ -139,7 +153,13 @@ def parse_arguments():
    return args


-def build_model(model: PretrainedModel, build_config: BuildConfig) -> Engine:
+def build_model(rank: int,


The build_model is moved to builder.py, and renamed to build function.

We want to keep the interface of build function to be stable,

def build(model: PretrainedModel, build_config: BuildConfig) -> Engine:

So, I suggest we add the dry_run and visualize_network as a field of BuildConfig

Updated PR to add dry_run and visualize_network to BuildConfig as suggested.

QiJune · 2024-04-04T15:28:14Z

tensorrt_llm/commands/build.py

@@ -529,7 +564,8 @@ def main():
        'weight_only_precision': args.weight_only_precision,
    }
    parallel_build(source, build_config, args.output_dir, workers,
-                   args.log_level, model_cls, **kwargs)
+                   args.log_level, model_cls, args.dry_run,
+                   args.visualize_network, **kwargs)


There is a **kwargs field, so we can reuse it to aovid too much changes on different functions.

kwargs = { 'logits_dtype': args.logits_dtype, 'use_fused_mlp': args.use_fused_mlp, 'weight_only_precision': args.weight_only_precision, 'dry_run': args.dry_run, 'visualize_network': args.visualize_network, }

This is no longer necessary since we added dry_run and visualize_network to BuildConfig as suggested.

QiJune · 2024-04-04T15:28:53Z

Hi @Lokiiiiii ,

Sorry for the later response. Thanks for submitting the MR and really appreciate your contributions to TensorRT-LLM. Could you please rebase the MR to the latest main branch?

…ore Engine build

Lokiiiiii · 2024-04-05T19:34:32Z

@QiJune Could you please review this again ?

QiJune · 2024-04-09T02:07:12Z

@QiJune Could you please review this again ?

It LGTM now. We plan to integrate your contributions as part of our refinement work and when the work gets landed into the github, we will add you as the co-author and also acknowledge your efforts.

Lokiiiiii · 2024-04-15T17:38:11Z

@QiJune I noticed that this change did not land in TRT-LLM 0.9.0 release tag. Can you provide an ETA ?

kaiyux · 2024-04-22T11:59:29Z

@QiJune I noticed that this change did not land in TRT-LLM 0.9.0 release tag. Can you provide an ETA ?

Hi @Lokiiiiii , thanks a lot for your contribution and support! We've merged your changes into the internal codebase, which will be included in the update to the GitHub main branch this week, and land in the next stable release.

nv-guomingz · 2024-06-05T09:49:12Z

Close it since we've merged the changes.

QiJune reviewed Apr 4, 2024

View reviewed changes

Lokiiiiii force-pushed the debug-qol-pr branch from 8944ec5 to c020ca9 Compare April 5, 2024 19:28

Adding debug options to trtllm-build to visualize the TRT Network bef…

1d583bb

…ore Engine build

Lokiiiiii force-pushed the debug-qol-pr branch from c020ca9 to 1d583bb Compare April 5, 2024 19:32

kaiyux mentioned this pull request Apr 24, 2024

Update TensorRT-LLM #1492

Merged

nv-guomingz added the Merged label Jun 5, 2024

nv-guomingz closed this Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding debug options to trtllm-build to visualize the TRT Network before Engine build #1238

Adding debug options to trtllm-build to visualize the TRT Network before Engine build #1238

Lokiiiiii commented Mar 6, 2024 •

edited

Loading

QiJune Apr 4, 2024

Lokiiiiii Apr 5, 2024

QiJune Apr 4, 2024

Lokiiiiii Apr 5, 2024

QiJune commented Apr 4, 2024

Lokiiiiii commented Apr 5, 2024

QiJune commented Apr 9, 2024

Lokiiiiii commented Apr 15, 2024

kaiyux commented Apr 22, 2024

nv-guomingz commented Jun 5, 2024

Adding debug options to trtllm-build to visualize the TRT Network before Engine build #1238

Adding debug options to trtllm-build to visualize the TRT Network before Engine build #1238

Conversation

Lokiiiiii commented Mar 6, 2024 • edited Loading

Overview

Testing

Unit Tests

QiJune Apr 4, 2024

Choose a reason for hiding this comment

Lokiiiiii Apr 5, 2024

Choose a reason for hiding this comment

QiJune Apr 4, 2024

Choose a reason for hiding this comment

Lokiiiiii Apr 5, 2024

Choose a reason for hiding this comment

QiJune commented Apr 4, 2024

Lokiiiiii commented Apr 5, 2024

QiJune commented Apr 9, 2024

Lokiiiiii commented Apr 15, 2024

kaiyux commented Apr 22, 2024

nv-guomingz commented Jun 5, 2024

Lokiiiiii commented Mar 6, 2024 •

edited

Loading