Skip to content

Commit 6d70198

Browse files
authored
[Doc] Fix typo (#11666)
Signed-off-by: Kazuhiro Serizawa <nserihiro@gmail.com>
1 parent f962f42 commit 6d70198

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

vllm/model_executor/layers/rejection_sampler.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ def __init__(self,
3939
strict_mode: Whether or not to perform shape/device/dtype checks
4040
during sampling. This catches correctness issues but adds
4141
nontrivial latency.
42-
use_falshinfer: We will use this parameter to determine whether
42+
use_flashinfer: We will use this parameter to determine whether
4343
to use the FlashInfer rejection sampling kernel or not. If it's
4444
None, we will use the default value from the environment variable.
4545
This parameter is only used for testing purposes.

vllm/v1/sample/ops/topk_topp_sampler.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ def __init__(self):
4444
logger.warning(
4545
"FlashInfer is not available. Falling back to the PyTorch-"
4646
"native implementation of top-p & top-k sampling. For the "
47-
"best performance, please install FalshInfer.")
47+
"best performance, please install FlashInfer.")
4848
self.forward = self.forward_native
4949
else:
5050
self.forward = self.forward_native

0 commit comments

Comments
 (0)