Add Intel AMX/AVX512 support to accelerate inference #2247

LeiZhou-97 · 2023-08-17T03:45:06Z

Why are these changes needed?

Currently, CPU only inference mode does not support accelerator. Now use intel extension for pytorch to accelerate inference when CPU has Intel AI accelerator.

RFC: https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/examples.html#bfloat16

Related issue number (if applicable)

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

Currently, CPU only inference mode does not support accelerator. Now use intel extension for pytorch to accelerate inference when CPU has AI accelerator. Signed-off-by: LeiZhou-97 <lei.zhou@intel.com>

LeiZhou-97 force-pushed the main branch from 2fa0fa4 to f96786c Compare August 17, 2023 03:58

Add Intel AMX/AVX512 support to accelerate inference

e5d3a92

Currently, CPU only inference mode does not support accelerator. Now use intel extension for pytorch to accelerate inference when CPU has AI accelerator. Signed-off-by: LeiZhou-97 <lei.zhou@intel.com>

LeiZhou-97 force-pushed the main branch from f96786c to e5d3a92 Compare August 17, 2023 03:59

merrymercy merged commit 50c5f0f into lm-sys:main Aug 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Intel AMX/AVX512 support to accelerate inference #2247

Add Intel AMX/AVX512 support to accelerate inference #2247

LeiZhou-97 commented Aug 17, 2023 •

edited

Loading

Add Intel AMX/AVX512 support to accelerate inference #2247

Add Intel AMX/AVX512 support to accelerate inference #2247

Conversation

LeiZhou-97 commented Aug 17, 2023 • edited Loading

Why are these changes needed?

Related issue number (if applicable)

Checks

LeiZhou-97 commented Aug 17, 2023 •

edited

Loading