Flash Attention Support #83

amrrs · 2024-05-05T19:28:35Z

Hey! Great work.

I think the latest code change for the SpeechToTextPipeline expects all GPUs to be Flash Attention 2compatible.

I'm not sure if there's anyway to override the kwargs.

I used it on P100 from Kaggle and got the error about Flash Attention

kadirnar · 2024-05-05T21:11:20Z

Can you share the error message?

kadirnar · 2024-05-05T21:40:35Z

I added flash-attention2 as a parameter for you to turn off. You can look at Readme.

kadirnar self-assigned this May 5, 2024

kadirnar added bug Something isn't working good first issue Good for newcomers labels May 5, 2024

kadirnar linked a pull request May 5, 2024 that will close this issue

Add optional parameter and wer metric code #84

Merged

kadirnar closed this as completed in #84 May 5, 2024

Provide feedback