Skip to content

Commit

Permalink
Revert "temporary decompose for decode (#353)"
Browse files Browse the repository at this point in the history
This reverts commit 0e93b64.
  • Loading branch information
dan-garvey authored Nov 1, 2024
1 parent 614dafa commit 0930126
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion sharktank/sharktank/models/llama/llama.py
Original file line number Diff line number Diff line change
Expand Up @@ -269,7 +269,6 @@ def decode(
for block_idx, block in enumerate(self.attn_blocks):
if block_idx == 0:
self.trace_tensor(f"llama.attn_block.{block_idx}.input", h)
block.attn.attention_kernel = "decomposed"
h = block(
h,
start_positions=start_positions,
Expand Down

0 comments on commit 0930126

Please sign in to comment.