Check inference input_id tokens length by mrwyattii · Pull Request #4349 · deepspeedai/DeepSpeed

mrwyattii · 2023-09-15T23:24:40Z

When the inputs to the inference engine have a length greater than max_tokens we will run into segfaults, garbage output, and generally bad behavior. Adding a check to avoid this.

…reater than max_tokens

* origin/master: Allow multiple inference engines in single script (deepspeedai#4384) adds triton flash attention2 kernel (deepspeedai#4337) Fix llama meta tensor loading in AutoTP and kernel injected inference (deepspeedai#3608) Fix min torch version (deepspeedai#4375) Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (deepspeedai#4373) add the missing method (deepspeedai#4363) Openfold fix (deepspeedai#4368) deepspeed4science japanese blog (deepspeedai#4369) deepspeed4science chinese blog (deepspeedai#4366) Enable workflow dispatch on Torch 1.10 CI tests (deepspeedai#4361) Update conda env to have max pydantic version (deepspeedai#4362) add deepspeed4science blog link (deepspeedai#4364) added check to avoid undefined behavior when the input_id length is greater than max_tokens (deepspeedai#4349) Add the policy to run llama model from the official repo (deepspeedai#4313) fix deepspeed4science links (deepspeedai#4358) DeepSpeed4Science (deepspeedai#4357) Support InternLM (deepspeedai#4137) Pass base_dir to model files can be loaded for auto-tp/meta-tensor. (deepspeedai#4348)

added check to avoid undefined behavior when the input_id length is g…

b278719

…reater than max_tokens

mrwyattii marked this pull request as ready for review September 18, 2023 16:47

mrwyattii requested review from RezaYazdaniAminabadi, arashb, awan-10, cmikeh2 and jeffra as code owners September 18, 2023 16:47

awan-10 approved these changes Sep 19, 2023

View reviewed changes

awan-10 added this pull request to the merge queue Sep 19, 2023

Merged via the queue into master with commit 8533423 Sep 19, 2023

mrwyattii deleted the mrwyattii/add-input-length-check branch September 19, 2023 22:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check inference input_id tokens length#4349

Check inference input_id tokens length#4349
awan-10 merged 1 commit intomasterfrom
mrwyattii/add-input-length-check

mrwyattii commented Sep 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mrwyattii commented Sep 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants