Brevitas

Brevitas is a PyTorch library for neural network quantization, with support for both post-training quantization (PTQ) and quantization-aware training (QAT).

Please note that Brevitas is a research project and not an official Xilinx product.

If you like this project please consider ⭐ this repo, as it is the simplest and best way to support it.

Requirements

Python >= 3.7 .
Pytorch >= 1.5.1, <= 1.13.1 (PyTorch 2.0.0 is currently untested).
Windows, Linux or macOS.
GPU training-time acceleration (Optional but recommended).

Installation

You can install the latest release from PyPI:

pip install brevitas

Getting Started

Brevitas currently offers quantized implementations of the most common PyTorch layers used in DNN under brevitas.nn, such as QuantConv1d, QuantConv2d, QuantConvTranspose1d, QuantConvTranspose2d, QuantMultiheadAttention, QuantRNN, QuantLSTM etc., for adoption within PTQ and/or QAT. For each one of these layers, quantization of different tensors (inputs, weights, bias, outputs, etc) can be individually tuned according to a wide range of quantization settings.

As a reference for PTQ, Brevitas provides an example user flow for ImageNet classification models under brevitas_examples.imagenet_classification.ptq that quantizes an input torchvision model using PTQ under different quantization configurations (e.g. bit-width, granularity of scale, etc). Sample accuracy results are available here for a selection of three reference topologies (ResNet18, MobileNet V2, ViT), under a variety of different quantization settings.

For more info, checkout https://xilinx.github.io/brevitas/getting_started .

Cite as

If you adopt Brevitas in your work, please cite it as:

@software{brevitas,
  author       = {Alessandro Pappalardo},
  title        = {Xilinx/brevitas},
  year         = {2023},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.3333552},
  url          = {https://doi.org/10.5281/zenodo.3333552}
}

History

2023/04/28 - Minor release version 0.9.1, see the release notes.
2023/04/21 - Release version 0.9.0, see the release notes.
2023/01/10 - Release version 0.8.0, see the release notes.
2021/12/13 - Release version 0.7.1, fix a bunch of issues. Added TVMCon 2021 tutorial notebook.
2021/11/03 - Re-release version 0.7.0 (build 1) on PyPI to fix a packaging issue.
2021/10/29 - Release version 0.7.0, see the release notes.
2021/06/04 - Release version 0.6.0, see the release notes.
2021/05/24 - Release version 0.5.1, fix a bunch of minor issues. See release notes.
2021/05/06 - Release version 0.5.0, see the release notes.
2021/03/15 - Release version 0.4.0, add support for __torch_function__ to QuantTensor.
2021/03/04 - Release version 0.3.1, fix bug w/ act initialization from statistics w/ IGNORE_MISSING_KEYS=1.
2021/03/01 - Release version 0.3.0, implements enum and shape solvers within extended dependency injectors. This allows declarative quantizers to be self-contained.
2021/02/04 - Release version 0.2.1, includes various bugfixes of QuantTensor w/ zero-point.
2021/01/30 - First release version 0.2.0 on PyPI.

Name		Name	Last commit message	Last commit date
Latest commit History 1,068 Commits
.github/workflows		.github/workflows
docs		docs
docsrc		docsrc
notebooks		notebooks
requirements		requirements
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
noxfile.py		noxfile.py
pytest.ini		pytest.ini
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brevitas

Requirements

Installation

Getting Started

Cite as

History

About

Releases

Packages

Languages

License

bwintermann/brevitas

Folders and files

Latest commit

History

Repository files navigation

Brevitas

Requirements

Installation

Getting Started

Cite as

History

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages