Skip to content

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore #6344

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore #6344

L2_Megatron_GPT_PEFT_Lora_PP2_O2  /  main

succeeded Nov 15, 2024 in 2m 43s