Skip to content

Commit

Permalink
Revert "temporary decompose for decode" (nod-ai#409)
Browse files Browse the repository at this point in the history
Reverts nod-ai#353
  • Loading branch information
dan-garvey authored Nov 2, 2024
1 parent ad48f9d commit ce8b77f
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion sharktank/sharktank/models/llama/llama.py
Original file line number Diff line number Diff line change
Expand Up @@ -269,7 +269,6 @@ def decode(
for block_idx, block in enumerate(self.attn_blocks):
if block_idx == 0:
self.trace_tensor(f"llama.attn_block.{block_idx}.input", h)
block.attn.attention_kernel = "decomposed"
h = block(
h,
start_positions=start_positions,
Expand Down

0 comments on commit ce8b77f

Please sign in to comment.