Set upper Numpy Version #540

stbaione · 2024-11-15T16:39:01Z

Description

Found a bug when walking through the shortfin llm docs using latest nightly sharktank. gguf is currently incompatible with numpy > 2. This breaks sharktank.examples.export_paged_llm_v1 on linux.

The gguf issue is filed here. It was closed from inactivity, but isn't actually solved and has a PR open for the fix.

Repro Steps

On linux:

Before re-pinning

Create a virtual environment:

python -m venv --prompt sharktank .venv
souce .venv/bin/activate

Install depencies and sharktank:

pip install -r pytorch-cpu-requirements.txt
pip install -r requirements.txt -e sharktank/

Show numpy version (before re-pinning):

pip show numpy | grep Version
Version: 2.1.3

Try running export_paged_llm_v1:

python -m sharktank.examples.export_paged_llm_v1 --gguf-file=$PATH_TO_GGUF --output-mlir=./temp/model.mlir --output-config=./temp/config.json --bs=1,4

You'll see this error:

Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/examples/export_paged_llm_v1.py", line 336, in <module>
    main()
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/examples/export_paged_llm_v1.py", line 67, in main
    dataset = cli.get_input_dataset(args)
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/utils/cli.py", line 104, in get_input_dataset
    return Dataset.load(data_files["gguf"], file_type="gguf")
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/types/theta.py", line 347, in load
    ds = _dataset_load_helper(path, file_type=file_type, mmap=mmap)
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/types/theta.py", line 536, in _dataset_load_helper
    return gguf_interop.load_file(path)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/sharktank/sharktank/types/gguf_interop/base.py", line 117, in load_file
    reader = GGUFReader(gguf_path)
             ^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/.venv_2/lib/python3.12/site-packages/gguf/gguf_reader.py", line 87, in __init__
    if self._get(offs, np.uint32, override_order = '<')[0] != GGUF_MAGIC:
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/stbaione/repos/SHARK-Platform/.venv_2/lib/python3.12/site-packages/gguf/gguf_reader.py", line 137, in _get
    .newbyteorder(override_order or self.byte_order)
     ^^^^^^^^^^^^
AttributeError: `newbyteorder` was removed from the ndarray class in NumPy 2.0. Use `arr.view(arr.dtype.newbyteorder(order))` instead.

After re-pinning

Create a virtual environment:

python -m venv --prompt sharktank .venv
souce .venv/bin/activate

Install depencies and sharktank:

pip install -r pytorch-cpu-requirements.txt
pip install -r requirements.txt -e sharktank/

Show numpy version:

pip show numpy | grep Version
Version: 1.26.3

Run export_paged_llm_v1:

python -m sharktank.examples.export_paged_llm_v1 --gguf-file=$PATH_TO_GGUF --output-mlir=./temp/model.mlir --output-config=./temp/config.json --bs=1,4

With re-pinning we get desired output:

Exporting decode_bs1
Exporting prefill_bs4
Exporting decode_bs4
GENERATED!
Exporting
Saving to './temp/model.mlir'

sharktank/requirements.txt

marbre · 2024-11-15T18:05:35Z

Maybe rename to Set upper Numpy version as we don't strictly pin?

stbaione · 2024-11-15T18:20:29Z

Maybe rename to Set upper Numpy version as we don't strictly pin?

@marbre Updated!

Repin numpy in sharktank for linux environment

7ae3077

stbaione requested review from ScottTodd and marbre November 15, 2024 16:39

ScottTodd reviewed Nov 15, 2024

View reviewed changes

sharktank/requirements.txt Show resolved Hide resolved

marbre reviewed Nov 15, 2024

View reviewed changes

sharktank/requirements.txt Outdated Show resolved Hide resolved

Set numpy version range for windows and non-windows

9a49ac2

stbaione requested review from marbre and ScottTodd November 15, 2024 17:26

marbre reviewed Nov 15, 2024

View reviewed changes

sharktank/requirements.txt Outdated Show resolved Hide resolved

Simplify numpy requirement

59eff94

stbaione requested a review from marbre November 15, 2024 18:00

stbaione changed the title ~~Repin numpy in sharktank for linux environment~~ Set upper Numpy Version Nov 15, 2024

Merge branch 'main' into repin-sharktank-numpy

72603b5

marbre approved these changes Nov 15, 2024

View reviewed changes

Merge branch 'main' into repin-sharktank-numpy

6156a51

marbre merged commit 5ccfc87 into nod-ai:main Nov 15, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set upper Numpy Version #540

Set upper Numpy Version #540

stbaione commented Nov 15, 2024

marbre commented Nov 15, 2024

stbaione commented Nov 15, 2024

Set upper Numpy Version #540

Set upper Numpy Version #540

Conversation

stbaione commented Nov 15, 2024

Description

Repro Steps

Before re-pinning

After re-pinning

marbre commented Nov 15, 2024

stbaione commented Nov 15, 2024