Allow serving llama models with tensor parallel#592
Draft
Jackmin801 wants to merge 1 commit intobigscience-workshop:mainfrom
Draft
Allow serving llama models with tensor parallel#592Jackmin801 wants to merge 1 commit intobigscience-workshop:mainfrom
Jackmin801 wants to merge 1 commit intobigscience-workshop:mainfrom