Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request support for Sage Attention #188

Open
doogyhatts opened this issue Jan 26, 2025 · 0 comments
Open

Request support for Sage Attention #188

doogyhatts opened this issue Jan 26, 2025 · 0 comments

Comments

@doogyhatts
Copy link

doogyhatts commented Jan 26, 2025

First, thanks for making v5.1!

I noticed my generation time on the RTX6000 Ada for 1344x768 resolution, 49 frames with bf16 model precision, to be around 866-876 seconds.
I did notice that teacache is enabled as well.
And system ram peaked at 81gb. (might be because I switched between bf16 and fp8)

Just wondering if there is a possibility of adding Sage Attention to speed up the generation process, other than having teacache?
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant