Will there be precision errors after switching to qnn after Aimet pipline? #3268

shifeiwen · 2024-08-26T09:25:22Z

Hi ,aimet team
I want to know if when I use aimet to generate the encoding file and then use the --quantization_overrides flag when converting to qnn, will the generated NPU backend model lose accuracy again? For example, aimet loss = 1.1 qnn htp backend may be 1.3~1.4
The reason for this is that I was using qc's genAI notebook to call aimet quantization model, and I found that the loss of the actual generated model was higher than that in aimet.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will there be precision errors after switching to qnn after Aimet pipline? #3268

Will there be precision errors after switching to qnn after Aimet pipline? #3268

shifeiwen commented Aug 26, 2024

Will there be precision errors after switching to qnn after Aimet pipline? #3268

Will there be precision errors after switching to qnn after Aimet pipline? #3268

Comments

shifeiwen commented Aug 26, 2024