You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched the existing issues and checked the recent builds/commits
What would your feature do ?
Assuming its possible - an implementation of https://github.com/dbolya/tomesd would be great!
They claim significant memory savings and inference speed improvements with very little quality loss.
However, it requires pytorch >= 1.12.1 (for scatter_reduce) so I'm not sure this can be ported to DirectML though.
Proposed workflow
.
Additional information
No response
The text was updated successfully, but these errors were encountered:
DirectML seems to be weak on handling partially modified dimensions. (dim < 0)
I added some fallback (to cpu) for it. Then, it was slower and generated some broken images.
Thank you for taking a look into it.
Since its not something solvable on your end, should I close this issue or keep it open in hopes the DirectML team fixes this someday?
I think they'll fix it someday, but I don't know when it will be. I don't think we need to maintain this issue because an extension for Token Merging is already out.
Is there an existing issue for this?
What would your feature do ?
Assuming its possible - an implementation of https://github.com/dbolya/tomesd would be great!
They claim significant memory savings and inference speed improvements with very little quality loss.
However, it requires
pytorch >= 1.12.1
(forscatter_reduce
) so I'm not sure this can be ported to DirectML though.Proposed workflow
.
Additional information
No response
The text was updated successfully, but these errors were encountered: