[WIP] Support InternLM on 3rd-party inference toolboxes

This issue is to track progress on 3rd party toolboxes which is related to InternLM.

## VLLM

https://github.com/wangruohui/vllm/tree/internlm
- [x] Inference with single GPU
  - [ ]  There seems some bug, not sure from my implementation or from upstream
- [ ] Tensor parallel

## DeepSpeed

**InternLM-7B is supported in Deepspeed inference and merged to main branch**: https://github.com/microsoft/DeepSpeed/pull/4137

- [x] Single GPU with kernel infection policy
- [x] Tensor parallel

Meta tensor for faster model loading: watching https://github.com/microsoft/DeepSpeed/pull/3608

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support InternLM on 3rd-party inference toolboxes #136

VLLM

DeepSpeed

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[WIP] Support InternLM on 3rd-party inference toolboxes #136

Description

VLLM

DeepSpeed

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions