feat: add GGMLFileQuantizationType and apply to test #806

snowyu · 2024-07-17T00:15:02Z

@mishig25 that's it for #794

…ace#794

julien-c · 2024-07-17T07:32:06Z

cc @ngxson too

ngxson

LGTM. Thanks 👍

julien-c

will let @mishig25 do a final review and merge 👌

Thanks a lot @snowyu!

ngxson · 2024-08-16T10:36:42Z

FYI, I added the MOSTLY_ prefix in the last commit, to better reflect the type name from ggml (see here)

The reason is because many operations in ggml only support F32 for 1d tensors. So in fact, gguf file is never "purely" quantized, but rather being a mix between quantized type and F32.

julien-c · 2024-08-16T16:37:26Z

BTW, i also propose to display the enum's key name in a tooltip inside the GGUF file viewer, like this:

(internal PR)

julien-c · 2024-08-16T17:02:38Z

i'll let you merge @ngxson!

snowyu · 2024-08-17T08:50:29Z

@ngxson be careful, the const is not in ggml.h, it's in llama.h.

ngxson · 2024-08-17T08:54:23Z

Yeah I linked to the incorrect file, but the content is not changed anyway because I only added MOSTLY_ on top of your commit. (So everything is still correct)

feat: add GGMLFileQuantizationType and apply to test - close huggingf…

d275c9d

…ace#794

snowyu requested review from mishig25 and julien-c as code owners July 17, 2024 00:15

ngxson approved these changes Jul 17, 2024

View reviewed changes

julien-c approved these changes Jul 18, 2024

View reviewed changes

add MOSTLY_ prefix

d300964

julien-c approved these changes Aug 16, 2024

View reviewed changes

Merge branch 'main' into feat/GGMLFileQuantizationType

547894b

ngxson merged commit 1140e0c into huggingface:main Aug 16, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add GGMLFileQuantizationType and apply to test #806

feat: add GGMLFileQuantizationType and apply to test #806

snowyu commented Jul 17, 2024

julien-c commented Jul 17, 2024

ngxson left a comment

julien-c left a comment

ngxson commented Aug 16, 2024

julien-c commented Aug 16, 2024

julien-c commented Aug 16, 2024

snowyu commented Aug 17, 2024

ngxson commented Aug 17, 2024 •

edited

Loading

feat: add GGMLFileQuantizationType and apply to test #806

feat: add GGMLFileQuantizationType and apply to test #806

Conversation

snowyu commented Jul 17, 2024

julien-c commented Jul 17, 2024

ngxson left a comment

Choose a reason for hiding this comment

julien-c left a comment

Choose a reason for hiding this comment

ngxson commented Aug 16, 2024

julien-c commented Aug 16, 2024

julien-c commented Aug 16, 2024

snowyu commented Aug 17, 2024

ngxson commented Aug 17, 2024 • edited Loading

ngxson commented Aug 17, 2024 •

edited

Loading