MPS CI runs #162

mikekgfb · 2024-04-13T01:26:18Z

MPS quantization

* MPS quantization * mps dtypes * updates * fix names * typo * no bfloat16 for older macOS * fix typo * remove failing embedding quantization from MPS runs * bfloat -> current model precision * typo * missed bfloat16 to swotch to defaulkt precision * remove int8 quantization on mps * enable cpu fallback for mps on int4 * hack int4pack_mm for torch.float * typo * disable int4 because fp16 int4pack_mm not working for float16

MPS quantization

8498548

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 13, 2024

mikekgfb added 4 commits April 12, 2024 18:32

mps dtypes

09a118b

updates

8655d46

fix names

06a18de

typo

cb2af69

mikekgfb changed the title ~~MPS quantization~~ MPS CI runs Apr 13, 2024

mikekgfb added 2 commits April 12, 2024 18:41

no bfloat16 for older macOS

f2837ea

fix typo

3eae72b

cbilgin approved these changes Apr 13, 2024

View reviewed changes

remove failing embedding quantization from MPS runs

2ad41ee

cbilgin approved these changes Apr 13, 2024

View reviewed changes

mikekgfb and others added 9 commits April 12, 2024 19:20

bfloat -> current model precision

4018770

typo

d4a5641

missed bfloat16 to swotch to defaulkt precision

859e114

remove int8 quantization on mps

9f72392

enable cpu fallback for mps on int4

667ae9c

hack int4pack_mm for torch.float

721c85a

typo

58cf986

disable int4 because fp16 int4pack_mm not working for float16

242d998

Merge branch 'main' into mps_quantization

8d1dff9

mikekgfb merged commit 9c8bf7b into main Apr 15, 2024
1 of 13 checks passed

mikekgfb deleted the mps_quantization branch April 15, 2024 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPS CI runs #162

MPS CI runs #162

mikekgfb commented Apr 13, 2024

MPS CI runs #162

MPS CI runs #162

Conversation

mikekgfb commented Apr 13, 2024