Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FT] FasterTransformer 3.0 Release #696

Merged
merged 5 commits into from
Sep 23, 2020
Merged

[FT] FasterTransformer 3.0 Release #696

merged 5 commits into from
Sep 23, 2020

Conversation

byshiue
Copy link
Collaborator

@byshiue byshiue commented Sep 23, 2020

No description provided.

Update the cloned project
1. Add the FasterTransformer v3.0
2. Modify the README.md of FasterTransformer project
3. Deprecate the FasterTransformer v1
1. Because the apis of cublaslt are different in cuda 11 and previous cuda
   version, we add the supporting on different cuda version in FT v3.0.
@byshiue byshiue self-assigned this Sep 23, 2020
@byshiue byshiue merged commit b2e89e6 into NVIDIA:master Sep 23, 2020
changlan pushed a commit to changlan/DeepLearningExamples that referenced this pull request Apr 5, 2021
[FT] feat: Add FasterTransformer v3.0

1. Add supporting of INT8 quantization of cpp and TensorFlow op.
2. Provide the tools to quantize the model.
3. Fix the bugs that cmake 3.15 and 3.16 cannot build this project. 
4. Deprecate the FasterTransformer v1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant