[FT] FasterTransformer 3.0 Release #696

byshiue · 2020-09-23T01:58:25Z

No description provided.

Update the cloned project

1. Add the FasterTransformer v3.0 2. Modify the README.md of FasterTransformer project 3. Deprecate the FasterTransformer v1

1. Because the apis of cublaslt are different in cuda 11 and previous cuda version, we add the supporting on different cuda version in FT v3.0.

[FT] feat: Add FasterTransformer v3.0 1. Add supporting of INT8 quantization of cpp and TensorFlow op. 2. Provide the tools to quantize the model. 3. Fix the bugs that cmake 3.15 and 3.16 cannot build this project. 4. Deprecate the FasterTransformer v1

byshiue added 5 commits September 20, 2020 13:45

Merge pull request #2 from NVIDIA/master

696cedf

Update the cloned project

feat: Add FasterTransformer v3.0

b15e4b8

1. Add the FasterTransformer v3.0 2. Modify the README.md of FasterTransformer project 3. Deprecate the FasterTransformer v1

feat: [FT] Add supporting of cublaslt on cuda 11

b262710

1. Because the apis of cublaslt are different in cuda 11 and previous cuda version, we add the supporting on different cuda version in FT v3.0.

fix: [FT] Fix the bug that cmake 3.15 or 3.16 cannot build the project

d21e2a3

docs: [FT] Update README

f73a531

byshiue self-assigned this Sep 23, 2020

byshiue merged commit b2e89e6 into NVIDIA:master Sep 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FT] FasterTransformer 3.0 Release #696

[FT] FasterTransformer 3.0 Release #696

byshiue commented Sep 23, 2020

[FT] FasterTransformer 3.0 Release #696

[FT] FasterTransformer 3.0 Release #696

Conversation

byshiue commented Sep 23, 2020