Skip to content

Latest commit

 

History

History
116 lines (76 loc) · 4.37 KB

README.md

File metadata and controls

116 lines (76 loc) · 4.37 KB

SHARK Modeling and Serving Libraries

WARNING: This is an early preview that is in progress. It is not ready for general use.

GitHub License pre-commit

Sub-projects

PyPI version CI - sharktank

The SHARK Tank sub-project contains a collection of model recipes and conversion tools to produce inference-optimized programs.

PyPI version CI - shortfin

The shortfin sub-project is SHARK's high performance inference library and serving engine.

  • API documentation for shortfin is available on readthedocs.

CI - Tuner

The Tuner sub-project assists with tuning program performance by searching for optimal parameter configurations to use during model compilation.

Support matrix

Models

Model name Model recipes Serving apps
SDXL sharktank/sharktank/models/punet/ shortfin/python/shortfin_apps/sd/
llama sharktank/sharktank/models/llama/ shortfin/python/shortfin_apps/llm/

Development getting started

Use this as a guide to get started developing the project using pinned, pre-release dependencies. You are welcome to deviate as you see fit, but these canonical directions mirror what the CI does.

Setup a venv

We recommend setting up a virtual environment (venv). The project is configured to ignore .venv directories, and editors like VSCode pick them up by default.

python -m venv --prompt sharktank .venv
source .venv/bin/activate

Install PyTorch for your system

If no explicit action is taken, the default PyTorch version will be installed. This will give you a current CUDA-based version. Install a different variant by doing so explicitly first:

CPU:

pip install -r pytorch-cpu-requirements.txt

ROCM:

pip install -r pytorch-rocm-requirements.txt

Install development packages

# Clone and install editable iree-turbine dep in deps/
pip install -f https://iree.dev/pip-release-links.html --src deps \
  -e "git+https://github.com/iree-org/iree-turbine.git#egg=iree-turbine"

# Install editable local projects.
pip install -r requirements.txt -e sharktank/ shortfin/

Running tests

pytest sharktank
pytest shortfin

Optional: pre-commits and developer settings

This project is set up to use the pre-commit tooling. To install it in your local repo, run: pre-commit install. After this point, when making commits locally, hooks will run. See https://pre-commit.com/