Update llama_flash_attn_monkey_patch.py for flash attention 2 #2059

yilin-bao · 2023-07-24T02:56:34Z

Upgrading from FlashAttention (1.x) to FlashAttention-2

These functions have been renamed:

flash_attn_unpadded_qkvpacked_func -> flash_attn_varlen_qkvpacked_func

You can check how they made the update: https://github.com/Dao-AILab/flash-attention

Why are these changes needed?

The function flash_attn_unpadded_qkvpacked_func is not included anymore in
flash-attention, so causing issue with fine-tuning. This will fix the problem.

Related issue number (if applicable)

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
- https://github.com/Dao-AILab/flash-attention
I've made sure the relevant tests are passing (if applicable).

Upgrading from FlashAttention (1.x) to FlashAttention-2 These functions have been renamed: flash_attn_unpadded_qkvpacked_func -> flash_attn_varlen_qkvpacked_func

yilin-bao · 2023-07-24T03:06:57Z

Also, could point out that fastchat is depending on flash-attention edition v1.0.9 or lower. That could also fix the problem.

Update llama_flash_attn_monkey_patch.py

e486ef8

Upgrading from FlashAttention (1.x) to FlashAttention-2 These functions have been renamed: flash_attn_unpadded_qkvpacked_func -> flash_attn_varlen_qkvpacked_func

yilin-bao closed this Jul 24, 2023

yilin-bao reopened this Jul 24, 2023

merrymercy changed the title ~~Update llama_flash_attn_monkey_patch.py~~ Update llama_flash_attn_monkey_patch.py for flash attention 2 Jul 24, 2023

merrymercy merged commit a08c1d6 into lm-sys:main Jul 24, 2023

merrymercy mentioned this pull request Jul 25, 2023

flash attention 2 #1985

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update llama_flash_attn_monkey_patch.py for flash attention 2 #2059

Update llama_flash_attn_monkey_patch.py for flash attention 2 #2059

yilin-bao commented Jul 24, 2023 •

edited

Loading

yilin-bao commented Jul 24, 2023

Update llama_flash_attn_monkey_patch.py for flash attention 2 #2059

Update llama_flash_attn_monkey_patch.py for flash attention 2 #2059

Conversation

yilin-bao commented Jul 24, 2023 • edited Loading

Why are these changes needed?

Related issue number (if applicable)

Checks

yilin-bao commented Jul 24, 2023

yilin-bao commented Jul 24, 2023 •

edited

Loading