Add `scores_precision` parameter to `FeatureAttributionOutput.save` #202

gsarti · 2023-07-12T21:38:38Z

Description

This issue addresses the high space requirements of large attribution scores tensors by adding a scores_precision parameter to FeatureAttributionOutput.save method.

Proposant: @g8a9

Motivation

Currently, tensors in FeatureAttributionOutput objects (attributions and step scores) are serialized in float32 precision as a default when using out.save(). While it is possible to compress the representation of these values with ndarray_compact=True, the resulting JSON files are usually quite large. Using more parsimonious data types could reduce the size of saved objects and facilitate systematic analyses leveraging large amounts of data.

Proposal

float32 precision should probably remain the default behavior, as we do not want to cause any information loss by default.

float16 and float8 should also be considered, both in the signed and unsigned variants, since leveraging the strictly positive nature of some score types would allow supporting greater precision while halving space requirements. Unsigned values will be used as defaults if no negative scores are present in a tensor.

float16 can be easily used by casting tensors to the native torch.float16 data type, which would preserve precision up to 4 decimal values for scores normalized in the [-1;1] interval (8 for unsigned tensors). This corresponds to 2 or 4 decimal places for float8. However, this data type is not supported natively in Pytorch, so tensors should be converted to torch.int8 and torch.uint8 instead and transformed in floats upon reloading the object.

The text was updated successfully, but these errors were encountered:

inseq-team#202 Adds functionality for saving feature attributions objects and tensors in float16 or float8 format, depending on `scores_precision` parameters. Tensors are saved in huggingface safetensor format, and quantized using zeropoint quantization. Because safetensors are bytes objects, they are encoded with b64 to be saved in the output json and decoded upon reloading.

* Save attribution tensors with a lower precision. #202 Adds functionality for saving feature attributions objects and tensors in float16 or float8 format, depending on `scores_precision` parameters. Tensors are saved in huggingface safetensor format, and quantized using zeropoint quantization. Because safetensors are bytes objects, they are encoded with b64 to be saved in the output json and decoded upon reloading. * Add support for `device_map` use (#264) * Add device_map support * Fix device setter in HF model * Support Value Zeroing for non-eager attention types (#267) * Fix nans in attn, add pad if missing (#269) * Add transformers v4.40 models to config, update changelog * run tests * Minor fixes * fixed json decode error * Switched to torch-native float8_e4m3fn format * Fix style * Fix reqs * Fix safety * Add changelog --------- Co-authored-by: Gabriele Sarti <[email protected]>

gsarti added enhancement New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels Jul 12, 2023

gsarti added this to the v0.5 milestone Jul 28, 2023

LuukSuurmeijer mentioned this issue May 10, 2024

Save tensors in lower precision #273

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `scores_precision` parameter to `FeatureAttributionOutput.save` #202

Add `scores_precision` parameter to `FeatureAttributionOutput.save` #202

gsarti commented Jul 12, 2023

Add scores_precision parameter to FeatureAttributionOutput.save #202

Add scores_precision parameter to FeatureAttributionOutput.save #202

Comments

gsarti commented Jul 12, 2023

Description

Motivation

Proposal

Add `scores_precision` parameter to `FeatureAttributionOutput.save` #202

Add `scores_precision` parameter to `FeatureAttributionOutput.save` #202