[Misc]: issue with loading weights from safetensors files #211

huijjj · 2024-08-28T12:23:44Z

Anything you want to discuss about vllm.

While implementing disaggregated prefill, we found an error regarding loading weights from safetensors files. We have filed a JIRA ticket(HS-3164) as we believe this is a synapseAI bug.

However, we found out that the code in vllm-fork is currently doing the same thing: loading safetensors file under torch.device(“hpu”) context, without involving any significant errors.

We’ll be very glad to know what made this possible, please let us know if we are missing something.

FYI, we are using IDC node with synapseAI version 1.17.0.

huijjj · 2024-08-29T02:00:35Z

import torch

import habana_frameworks.torch as htorch

from safetensors import safe_open
from safetensors.torch import save_file

if __name__ == "__main__":
    safetensor_file_path = "tmp.safetensors"

    # create safetensors file
    save_file({"foo": torch.randn(10)}, safetensor_file_path)

    # load safetensors file
    with torch.device("hpu"):
        f = safe_open(safetensor_file_path, framework="pt")
        foo = f.get_tensor("foo")
        # following line yields error in hpu LAZY mode,
        print(foo) # RuntimeError: Reshape doesnt support change in number of elements: [40] Size of output: [10]

This is the minimal reproducer code of the torch.device context + safetensors.safe_open runtime error, just in case if you don't have access to the JIRA ticket.

michalkuligowski · 2024-09-18T10:02:41Z

Hi, we are testing potential fix for that, we will update soon

huijjj · 2024-09-18T11:36:31Z

@michalkuligowski
Thanks for the update, glad to know that issue is being tracked and will be fixed in near future.

michalkuligowski · 2024-10-11T12:45:51Z

Hi, the fix will be provided in v1.19

kzawora-intel added the external Issues or PRs submitted by external users label Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc]: issue with loading weights from safetensors files #211

[Misc]: issue with loading weights from safetensors files #211

huijjj commented Aug 28, 2024

huijjj commented Aug 29, 2024

michalkuligowski commented Sep 18, 2024

huijjj commented Sep 18, 2024

michalkuligowski commented Oct 11, 2024

[Misc]: issue with loading weights from safetensors files #211

[Misc]: issue with loading weights from safetensors files #211

Comments

huijjj commented Aug 28, 2024

Anything you want to discuss about vllm.

huijjj commented Aug 29, 2024

michalkuligowski commented Sep 18, 2024

huijjj commented Sep 18, 2024

michalkuligowski commented Oct 11, 2024