[TPU][V1] Implicitly adjust page size when there's SMEM OOM #16871

yaochengji · 2025-04-19T05:54:16Z

No description provided.

Signed-off-by: Chengji Yao <chengjiyao@google.com>

github-actions · 2025-04-19T05:54:25Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Chengji Yao <chengjiyao@google.com>

mgoin

LGTM

vanbasten23 · 2025-04-21T23:21:56Z

vllm/v1/attention/backends/pallas.py

+    # we simply make sure that the size is smaller than half of SMEM capacity.
+    @staticmethod
+    def get_min_page_size(vllm_config: VllmConfig) -> int:
+        max_num_page_per_req = (1024 * 1024 // 2 //


what's the 2 here?

Here we simply make sure that the size is smaller than half of SMEM capacity

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Frieda (Jingying) Huang <jingyingfhuang@gmail.com>

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com>

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

[TPU][V1] Implicitly adjust page size when there's SMEM OOM

99fd5cc

Signed-off-by: Chengji Yao <chengjiyao@google.com>

yaochengji requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners April 19, 2025 05:54

mergify bot added v1 tpu Related to Google TPUs labels Apr 19, 2025

yaochengji requested a review from vanbasten23 April 19, 2025 05:55

fix test

dae2302

Signed-off-by: Chengji Yao <chengjiyao@google.com>

yaochengji added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 19, 2025

mgoin approved these changes Apr 21, 2025

View reviewed changes

mgoin merged commit 471fe65 into vllm-project:main Apr 21, 2025
56 of 58 checks passed

vanbasten23 reviewed Apr 21, 2025

View reviewed changes

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[TPU][V1] Implicitly adjust page size when there's SMEM OOM (vllm-pro…

22deb23

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[TPU][V1] Implicitly adjust page size when there's SMEM OOM (vllm-pro…

6400204

…ject#16871) Signed-off-by: Chengji Yao <chengjiyao@google.com>

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

[TPU][V1] Implicitly adjust page size when there's SMEM OOM #16871

[TPU][V1] Implicitly adjust page size when there's SMEM OOM #16871

Uh oh!

yaochengji commented Apr 19, 2025

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

vanbasten23 Apr 21, 2025

Uh oh!

yaochengji Apr 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Uh oh!

[TPU][V1] Implicitly adjust page size when there's SMEM OOM #16871

[TPU][V1] Implicitly adjust page size when there's SMEM OOM #16871

Uh oh!

Conversation

yaochengji commented Apr 19, 2025

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vanbasten23 Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

yaochengji Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants