Fix Lora Rebase #290

hlahkar · 2024-09-17T11:19:12Z

Fixes Lora Related issues in vllm Rebase

kzawora-intel · 2024-09-17T14:27:04Z

vllm/lora/punica.py

@@ -131,9 +132,9 @@ def convert_mapping(
    if long_lora_context:
        assert long_lora_offsets is not None
        indices_list.append(long_lora_offsets)
-    indices = torch.tensor(indices_list, dtype=torch.long, device="cuda")
+    indices = torch.tensor(indices_list, dtype=torch.long, device=get_device())


Suggested change

indices = torch.tensor(indices_list, dtype=torch.long, device=get_device())

device = 'hpu' if current_platform.is_hpu() else 'cuda'

indices = torch.tensor(indices_list, dtype=torch.long, device=device)

Then use device instead of get_device() in the remainder of the file. For getting current_platform, add from vllm.platforms import current_platform at the top of the file.

We are using get_device; to localize this if condition; as get_device is used in many places in lora/models.py, lora/punica.py. This way the code looks clean

tests/lora/test_lora_hpu.py

vllm/lora/models.py

vllm/lora/layers.py

vivekgoe · 2024-09-20T08:40:30Z

@michalkuligowski I have changed label from Intel to Habana. Both myself & Himangshu are from Habana team :-)

Lora Rebase Code

0837bb5

hlahkar requested review from vivekgoe, kzawora-intel and SanjuCSudhakaran September 17, 2024 11:19

hlahkar changed the title ~~Fix Lora Issues~~ Fix Lora Rebase Sep 17, 2024

kzawora-intel reviewed Sep 17, 2024

View reviewed changes

hlahkar force-pushed the dev/hlahkar/lora_rebase branch 3 times, most recently from 9d43b43 to 2a7e513 Compare September 19, 2024 04:20

vivekgoe requested changes Sep 19, 2024

View reviewed changes

tests/lora/test_lora_hpu.py Show resolved Hide resolved

vllm/lora/models.py Show resolved Hide resolved

vllm/lora/layers.py Show resolved Hide resolved

hlahkar added 2 commits September 19, 2024 14:14

Call bgmv through Punica Wrapper

c51b0c2

Attn MetaData dtype should be same as model dtype

b6a2d2a

hlahkar force-pushed the dev/hlahkar/lora_rebase branch from 2a7e513 to b6a2d2a Compare September 19, 2024 11:15

michalkuligowski added the intel Issues or PRs submitted by Intel label Sep 20, 2024

vivekgoe added habana Issues or PRs submitted by Habana Labs and removed intel Issues or PRs submitted by Intel labels Sep 20, 2024

kzawora-intel merged commit b2653ab into private/kzawora/vllm_v0_6_0_rebase Sep 20, 2024

kzawora-intel mentioned this pull request Sep 20, 2024

Add LoRA extensions HabanaAI/vllm-hpu-extension#3

Merged

hlahkar deleted the dev/hlahkar/lora_rebase branch October 1, 2024 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Lora Rebase #290

Fix Lora Rebase #290

hlahkar commented Sep 17, 2024

kzawora-intel Sep 17, 2024

hlahkar Sep 19, 2024 •

edited

Loading

vivekgoe commented Sep 20, 2024

	indices = torch.tensor(indices_list, dtype=torch.long, device=get_device())
	device = 'hpu' if current_platform.is_hpu() else 'cuda'
	indices = torch.tensor(indices_list, dtype=torch.long, device=device)

Fix Lora Rebase #290

Fix Lora Rebase #290

Conversation

hlahkar commented Sep 17, 2024

kzawora-intel Sep 17, 2024

Choose a reason for hiding this comment

hlahkar Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

vivekgoe commented Sep 20, 2024

hlahkar Sep 19, 2024 •

edited

Loading