Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Benchmark Fix : Remove special tokens from warmup prompts (vllm-proje…
…ct#140) Summary: When sampling words at random for prompt generation, we sometimes pick up the `<pad>` token. The Tokenizer doesn't recognize this as a special token and leaves it in the prompt as-is. This causes the backend to fail with, ``` ../aten/src/ATen/native/cuda/Indexing.cu:1292: indexSelectLargeIndex: block: [312,0,0], thread: [64,0,0] Assertion `srcIndex < srcSelectDimSize` failed ``` like errors. Test: Manual tests Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
- Loading branch information