Skip to content

Commit b8e4387

Browse files
author
Varun Sundar Rabindranath
committed
fixes
Signed-off-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
1 parent 940afec commit b8e4387

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/lora/layers.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -242,12 +242,12 @@ def forward(self, x: torch.Tensor) -> torch.Tensor:
242242
embeddings_indices = torch.narrow(
243243
self.punica_wrapper._embeddings_indices, 1, 0, x.size(0))
244244

245-
indices = embeddings_indices[1].view_as(x)
245+
indices = embeddings_indices[1]
246246
full_lora_a_embeddings = F.embedding(
247247
x + indices,
248248
self.lora_a_stacked_2d,
249249
)
250-
indices = embeddings_indices[0].view_as(x)
250+
indices = embeddings_indices[0]
251251
full_output = self.base_layer.forward(x +
252252
(indices * added_tokens_mask))
253253

0 commit comments

Comments
 (0)