Skip to content

Conversation

@libinta
Copy link

@libinta libinta commented Apr 10, 2025

The make_attn_bias in hpu_model_runner doesn't cover the non-causal embedding model mask set and also vertical mask off is not set when merged prefill is enabled.

@michalkuligowski
Copy link

/run-gaudi-tests

@michalkuligowski
Copy link

/run-gaudi-tests

@michalkuligowski michalkuligowski merged commit b3c3a2f into v1.21.0_next Apr 16, 2025
42 checks passed
@michalkuligowski michalkuligowski deleted the libint/fix_embedding_merged_prefill_121 branch April 16, 2025 09:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants