Explore NVIDIA/TransformerEngine for speed/efficiency #1288

0xdevalias · 2022-11-14T23:06:44Z

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like

https://github.com/NVIDIA/TransformerEngine
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.

Describe alternatives you've considered

That this lib won't be useful in this repo, or that existing optimisations already do things as well as it could (or better).

Additional context

Crossposted on:
- [Feature Request]: Explore NVIDIA/TransformerEngine for speed/efficiency AUTOMATIC1111/stable-diffusion-webui#4721
Other potential speed gain/improvement issues:
- Explore potential speed benefits from implementing kernl (Up to 12X faster GPU inference) #1094
- Community Integration: Making AIGC cheaper, faster, and more efficient. #1212
  - Community Integration: Making AIGC cheaper, faster, and more efficient. #1212 (comment)

patrickvonplaten · 2022-11-18T12:01:04Z

Hey @0xdevalias,

Thanks a lot for opening the issue! Just to better understand, what benefits does https://github.com/NVIDIA/TransformerEngine give besides 8-bit quantizition that we don't currently have with xformers: https://github.com/facebookresearch/xformers or other optimization libraries? :-)

github-actions · 2022-12-15T15:04:12Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

0xdevalias mentioned this issue Nov 14, 2022

[Feature Request]: Explore NVIDIA/TransformerEngine for speed/efficiency AUTOMATIC1111/stable-diffusion-webui#4721

Open

1 task

github-actions bot added the stale Issues that haven't received updates label Dec 15, 2022

github-actions bot closed this as completed Dec 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore NVIDIA/TransformerEngine for speed/efficiency #1288

Explore NVIDIA/TransformerEngine for speed/efficiency #1288

0xdevalias commented Nov 14, 2022

patrickvonplaten commented Nov 18, 2022

github-actions bot commented Dec 15, 2022

Explore NVIDIA/TransformerEngine for speed/efficiency #1288

Explore NVIDIA/TransformerEngine for speed/efficiency #1288

Comments

0xdevalias commented Nov 14, 2022

patrickvonplaten commented Nov 18, 2022

github-actions bot commented Dec 15, 2022