feat: Implement FP8 functionality #2763

peri044 · 2024-04-18T23:14:58Z

Description

This PR adds FP8 & BF16 datatype support. It also implements converter for FP8 quantized ops.

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

chore: updates to trt api chore: trt 10 fixes chore: more fixes

author Dheeraj Peri <peri.dheeraj@gmail.com> 1711393059 -0700 committer Dheeraj Peri <peri.dheeraj@gmail.com> 1711393072 -0700 chore: minor updates chore: Fix save failures chore: minor fixes chore: remove duplicate bert test case chore: remove comments chore: add load api chore: minor updates chore: minor updates chore: minor updates chore: more updates

zewenli98 · 2024-06-01T00:39:28Z

py/torch_tensorrt/dynamo/lowering/passes/_aten_lowering_pass.py

+def pre_export_lowering(
+    ep: torch.export.ExportedProgram, sample_inputs: Sequence[torch.Tensor]
+) -> torch.fx.GraphModule:
+    """Applies the lowering passes to a graph module after torch.export/ torch.compile and their decompositions, returns the modified GraphModule"""


after -> before?
I'm wondering what belong to pre_lowering and what belong to post_lowering?

remove_detach belongs to pre_lowering (which happens before decompositions and other lowering passes)

peri044 added 30 commits March 12, 2024 02:11

chore: Upgrade to TRT 10.0

9ad87ac

chore: updates to trt api

a655c9a

feat: Add save API for torch-trt compiled models

cd86660

feat: Add FP8 support including dtype and converters

31285e5

chore: minor fixes

7c9c646

Merge branch 'main' into trt_10

4eabeb0

Merge branch 'trt_10' into fp8_trt10

a320e56

chore: resolve merge conflicts

3ece71b

chore: Fix save failures

eab0dba

chore: update to 2.3 rc build

b191d62

chore: rebase with release/2.3 branch

ce606fe

chore: minor fixes

8674a3c

chore: remove duplicate bert test case

f4e8fe9

chore: remove comments

4ae6ab9

chore: Upgrade to TRT 10.0

fff1b80

chore: updates to trt api chore: trt 10 fixes chore: more fixes

chore: more fixes

39ca77d

chore: update trt version

5431ee3

chore: more updates

0c03de5

chore: more updates

1ae46e9

chore: rebase with save

ae87fba

chore: Update versions

beb5920

chore: update tensorrt version in CI

f0068c6

chore: more updates

39261b9

chore: more fixes

3753150

Merge branch 'release/2.3' into trt_10

16a191c

chore: remove NvUtils.h

c355766

chore: more updates

2d237dc

chore: change lib64 to lib in rhel BUILD file

e4b4429

chore: more updates

fa4fb9c

peri044 and others added 23 commits May 23, 2024 23:51

chore: updates

4030344

chore: updates

ad9d825

Update build-test-windows.yml

0059c1c

Update build-test-linux.yml

f98abd6

chore: updates

0d2021d

chore: updates

1940267

chore: disable all lower_linear tests

5814402

chore: updates

338a92b

chore: fixes

59d0bd0

chore: updates

020fe63

chore: updates

3f8297e

chore: updates

5ce0ee1

chore: updates

65c5c3e

chore: updates

d99989d

chore: updates

99dfbdc

chore: updates

ad996a5

chore: updates

6ada351

chore: fixes

88fd7ee

chore: updates

2511095

chore: updates

5346a45

chore: updates

d284b8f

chore: updates

c71c017

chore: updates

a983064

github-actions bot added component: core Issues re: The core compiler component: build system Issues re: Build system labels May 29, 2024

zewenli98 force-pushed the fp8_trt10 branch from 9460715 to a983064 Compare May 29, 2024 21:13

github-actions bot removed component: core Issues re: The core compiler component: build system Issues re: Build system labels May 29, 2024

peri044 merged commit fe7fc94 into release/2.3 May 30, 2024
69 of 71 checks passed

zewenli98 reviewed Jun 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Implement FP8 functionality #2763

feat: Implement FP8 functionality #2763

Uh oh!

peri044 commented Apr 18, 2024

Uh oh!

Uh oh!

zewenli98 Jun 1, 2024

Uh oh!

peri044 Jun 3, 2024

Uh oh!

Uh oh!

feat: Implement FP8 functionality #2763

feat: Implement FP8 functionality #2763

Uh oh!

Conversation

peri044 commented Apr 18, 2024

Description

Type of change

Checklist:

Uh oh!

Uh oh!

zewenli98 Jun 1, 2024

Choose a reason for hiding this comment

Uh oh!

peri044 Jun 3, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!