Add chatbot for fastertransformer #2

lvhan028 · 2023-06-18T07:59:07Z

No description provided.

add internlm2-chat-7b chat template

* support ascend using infer_ext * fix(ascend): make infer_ext using TND format q,k,v in paged_token_attention * support ascend using infer_ext * feat: support ascend moe_gating_topk_softmax * feat: change infer_ext ops function param order (#2) * ascend: align attention mask to 32bytes (#7) * fix attn args (#9) * fix: expand shape of attn_mask (#10) * feat: udpate infer_ext ops interface (#13) * rename infer_ext to dlinfer * format code * Support internlm 2.5 (#14) * refactor ascend pagedattention * fix ascend apply_rotary_pos_emb * fix import dlinfer (#16) * fix: fix rms_norm params (#18) * fix sync on ascend --------- Co-authored-by: chenchiyu <chenchiyu@pjlab.org.cn> Co-authored-by: CyCle1024 <ccy_justin@163.com> Co-authored-by: Wei Tao <1136862851@qq.com> Co-authored-by: jinminxi104 <jinminxi104@hotmail.com> Co-authored-by: pdx1989 <pdx1989@gmail.com>

add chatbot

dd7a61b

lvhan028 merged commit ef2adb0 into InternLM:main Jun 18, 2023

grimoire referenced this pull request in grimoire/lmdeploy Jan 3, 2024

Merge pull request #2 from irexyc/support-internlm2

408b553

add internlm2-chat-7b chat template

jiabao-wang mentioned this pull request Nov 19, 2024

[Bug] Cannot install torch-npu==2.3.1, torch==2.3.1 and torchvision==0.18.1 because these package versions have conflicting dependencies. #2745

Open

3 tasks

Sunxiaohu0406 mentioned this pull request Dec 4, 2024

[Bug] glm-4v-9b多卡报错 #2855

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add chatbot for fastertransformer #2

Add chatbot for fastertransformer #2

lvhan028 commented Jun 18, 2023

Add chatbot for fastertransformer #2

Add chatbot for fastertransformer #2

Conversation

lvhan028 commented Jun 18, 2023