Report usage for beam search #6404

simon-mo · 2024-07-13T03:21:17Z

So we can be informed when fully removing.

github-actions · 2024-07-13T03:21:27Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only trigger fastcheck CI to run, which consists only a small and essential subset of tests to quickly catch errors with the flexibility to run extra individual tests on top (you can do this by unblocking test steps in the Buildkite run).

Full CI run is still required to merge this PR so once the PR is ready to go, please make sure to run it. If you need all test signals in between PR commits, you can trigger full CI as well.

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

WoosukKwon · 2024-07-15T00:07:56Z

vllm/sampling_params.py

@@ -184,6 +184,9 @@ def __init__(

        self._verify_args()
        if self.use_beam_search:
+            # Lazy import to avoid circular imports.
+            from vllm.usage.usage_lib import set_runtime_usage_data
+            set_runtime_usage_data("use_beam_search", True)


IIUC, we don't track the number of beam search requests or its ratio, but track the cases that the server receives at least one beam search request, right?

Correct. I think in the end I want to get to some understanding of "% of vLLM deployments using beam search"

simon-mo · 2024-07-15T02:37:32Z

Verified that the data has been received from testing.

Signed-off-by: Alvant <alvasian@yandex.ru>

Report usage for beam search

f7746be

simon-mo requested a review from WoosukKwon July 13, 2024 03:21

simon-mo added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 13, 2024

revert cf change

85e36ce

WoosukKwon mentioned this pull request Jul 13, 2024

[RFC] Drop beam search support #6226

Closed

WoosukKwon approved these changes Jul 15, 2024

View reviewed changes

Merge branch 'main' into usage-report-beam

8c41725

simon-mo merged commit 32c9d7f into vllm-project:main Jul 15, 2024
68 of 73 checks passed

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024

Report usage for beam search (vllm-project#6404)

ac3487d

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

Report usage for beam search (vllm-project#6404)

de262a2

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

Report usage for beam search (vllm-project#6404)

9655d69

Signed-off-by: Alvant <alvasian@yandex.ru>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report usage for beam search #6404

Report usage for beam search #6404

simon-mo commented Jul 13, 2024

github-actions bot commented Jul 13, 2024

WoosukKwon Jul 15, 2024

simon-mo Jul 15, 2024

simon-mo commented Jul 15, 2024

Report usage for beam search #6404

Report usage for beam search #6404

Conversation

simon-mo commented Jul 13, 2024

github-actions bot commented Jul 13, 2024

WoosukKwon Jul 15, 2024

Choose a reason for hiding this comment

simon-mo Jul 15, 2024

Choose a reason for hiding this comment

simon-mo commented Jul 15, 2024