Support QAT in QCOM qnn backend #6212

cccclai · 2024-10-15T00:48:23Z

🚀 The feature, motivation and pitch

Currently qnn quantizer only supports PTQ (post training quantization), and we'd like to enable QAT (quantization aware trainning) for better quantization support

Alternatives

Use PTQ

Additional context

No response

RFC (Optional)

No response

cccclai · 2024-10-15T01:08:52Z

Hi @chiwwang, @navsud is our quantization expert and is also looking into QAT for qnn, maybe we can coordinate and enable QAT together.

chiwwang · 2024-10-15T03:56:25Z

Nice! ++ @chunit-quic , who is prototyping QAT in qnn quantizer.

chiwwang · 2024-10-15T07:26:03Z

We have a prototype #6222, which is more like kickoff for our discussions.
It might be incorrect.... QAT is really a new thing for us. So please feel free to advise and give directions! (And...what model is suitable for the 1st QAT target? We need some E2E verifications... 🤔 )

cccclai added partner: qualcomm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Qualcomm module: qnn Related to Qualcomm's QNN delegate labels Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support QAT in QCOM qnn backend #6212

Support QAT in QCOM qnn backend #6212

cccclai commented Oct 15, 2024

cccclai commented Oct 15, 2024

chiwwang commented Oct 15, 2024

chiwwang commented Oct 15, 2024

Support QAT in QCOM qnn backend #6212

Support QAT in QCOM qnn backend #6212

Comments

cccclai commented Oct 15, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

cccclai commented Oct 15, 2024

chiwwang commented Oct 15, 2024

chiwwang commented Oct 15, 2024