udpate llama7b_sparse_quantized example #2322

bfineran · 2024-06-07T16:49:32Z

No description provided.

robertgshaw2-neuralmagic · 2024-06-07T16:51:58Z

examples/llama7b_sparse_quantized/README.md

+
+output_dir = "output_llama7b_2:4_w4a16_channel"
+
+apply(


The number of arguments here is very confusing, especially since most of these are related to training...

Talked to Ben and he is going to write up a README of just quantization without the training. This one is intended to be a more advanced readme showing how to do the full sparsity -> finetuning -> quantization flow

examples/llama7b_sparse_quantized/README.md

examples/llama7b_one_shot_quantization.md

examples/llama7b_sparse_quantized/README.md

Co-authored-by: dbogunowicz <[email protected]>

examples/llama7b_one_shot_quantization.md

* fix alias application with unit tests * style

udpate llama7b_sparse_quantized example

4057ba5

bfineran requested review from Satrat and markurtz June 7, 2024 16:49

robertgshaw2-neuralmagic reviewed Jun 7, 2024

View reviewed changes

examples/llama7b_sparse_quantized/README.md Show resolved Hide resolved

one shot llama example

7c53e0c

dbogunowicz previously approved these changes Jun 10, 2024

View reviewed changes

examples/llama7b_one_shot_quantization.md Show resolved Hide resolved

examples/llama7b_sparse_quantized/README.md Outdated Show resolved Hide resolved

Update examples/llama7b_sparse_quantized/README.md

3de03c2

Co-authored-by: dbogunowicz <[email protected]>

Satrat dismissed dbogunowicz’s stale review via 3de03c2 June 10, 2024 15:34

Merge branch 'main' into compression-example-update

d637be9

robertgshaw2-neuralmagic reviewed Jun 11, 2024

View reviewed changes

examples/llama7b_one_shot_quantization.md Show resolved Hide resolved

Sara Adkins added 2 commits June 12, 2024 11:19

Fix GPTQ Aliases (#2327)

1a2d534

* fix alias application with unit tests * style

Merge branch 'main' into compression-example-update

73d89df

Satrat approved these changes Jun 13, 2024

View reviewed changes

bfineran merged commit 5c1de1c into main Jun 13, 2024
18 checks passed

bfineran deleted the compression-example-update branch June 13, 2024 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

udpate llama7b_sparse_quantized example #2322

udpate llama7b_sparse_quantized example #2322

bfineran commented Jun 7, 2024

robertgshaw2-neuralmagic Jun 7, 2024

Satrat Jun 7, 2024


		output_dir = "output_llama7b_2:4_w4a16_channel"

		apply(

udpate llama7b_sparse_quantized example #2322

udpate llama7b_sparse_quantized example #2322

Conversation

bfineran commented Jun 7, 2024

robertgshaw2-neuralmagic Jun 7, 2024

Choose a reason for hiding this comment

Satrat Jun 7, 2024

Choose a reason for hiding this comment